Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Cornell-AGI
university
Activity Feed
Follow
9
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
GitBag
authored
a paper
about 2 months ago
Prompt Curriculum Learning for Efficient LLM Post-Training
GitBag
authored
a paper
6 months ago
Pre-trained Large Language Models Learn Hidden Markov Models In-context
GitBag
updated
a collection
6 months ago
Accelerating RL for LLM Reasoning with Optimal Advantage Reg
View all activity
Team members
1
Cornell-AGI
's models
20
Sort: Recently updated
Cornell-AGI/apo_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
7
Cornell-AGI/ppo_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
37
Cornell-AGI/rebel_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
16
Cornell-AGI/grpo_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
10
Cornell-AGI/grpo_math_qwen2.5_1.5b
Text Generation
•
2B
•
Updated
May 5
•
15
Cornell-AGI/ppo_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
18
Cornell-AGI/rebel_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
9
Cornell-AGI/apo_math_qwen2.5_3b
Text Generation
•
3B
•
Updated
May 5
•
9
Cornell-AGI/grpo_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 5
•
14
Cornell-AGI/ppo_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 5
•
24
Cornell-AGI/rebel_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 4
•
7
Cornell-AGI/apo_math_qwen2.5_7b
Text Generation
•
8B
•
Updated
May 4
•
10
•
1
Cornell-AGI/REFUEL-Llama-3-Armo-iter_2
8B
•
Updated
Oct 8, 2024
•
4
Cornell-AGI/REFUEL-Llama-3-Armo-iter_1
8B
•
Updated
Oct 8, 2024
•
4
Cornell-AGI/REBEL-Llama-3-Armo-iter_3
8B
•
Updated
Sep 2, 2024
•
5
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_2
8B
•
Updated
Sep 2, 2024
•
9
•
1
Cornell-AGI/REBEL-Llama-3-Armo-iter_1
8B
•
Updated
Sep 2, 2024
•
5
•
1
Cornell-AGI/REBEL-Llama-3-epoch_2
Text Generation
•
Updated
Sep 1, 2024
•
10
•
3
Cornell-AGI/REBEL-Llama-3
Text Generation
•
Updated
Sep 1, 2024
•
17
•
1
Cornell-AGI/REBEL-OpenChat-3.5
Text Generation
•
Updated
Sep 1, 2024
•
20
•
1