Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
D
Anna4242
Follow
21world's profile picture
CrocodileGreen's profile picture
Disperser5601's profile picture
3 followers
·
4 following
AI & ML interests
None yet
Recent Activity
updated
a model
26 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
published
a model
26 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
updated
a model
26 days ago
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200
View all activity
Organizations
Anna4242
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
26 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
8B
•
Updated
26 days ago
•
9
published
a model
26 days ago
Anna4242/qwen25-7b-multihop-grpo-checkpoint-200
8B
•
Updated
26 days ago
•
9
updated
a model
26 days ago
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200
8B
•
Updated
26 days ago
•
9
published
a model
26 days ago
Anna4242/qwen25-7b-singlehop-grpo-checkpoint-200
8B
•
Updated
26 days ago
•
9
updated
a model
29 days ago
Anna4242/qwen25-3b-instruct-grpo-merged
3B
•
Updated
29 days ago
•
11
published
a model
29 days ago
Anna4242/qwen25-3b-instruct-grpo-merged
3B
•
Updated
29 days ago
•
11
updated
a model
29 days ago
Anna4242/qwen25-3b-base-grpo
Text Generation
•
Updated
29 days ago
•
25
published
a model
30 days ago
Anna4242/qwen25-3b-base-grpo
Text Generation
•
Updated
29 days ago
•
25
updated
a dataset
30 days ago
Anna4242/grpo-training-plots
Viewer
•
Updated
30 days ago
•
1.41k
•
37
published
a dataset
30 days ago
Anna4242/grpo-training-plots
Viewer
•
Updated
30 days ago
•
1.41k
•
37
updated
a model
30 days ago
Anna4242/qwen25-7b-full-sft-multihop
8B
•
Updated
30 days ago
•
9
published
a model
30 days ago
Anna4242/qwen25-7b-full-sft-multihop
8B
•
Updated
30 days ago
•
9
updated
a model
30 days ago
Anna4242/qwen25-3b-full-sft-multihop
3B
•
Updated
30 days ago
•
7
published
a model
30 days ago
Anna4242/qwen25-3b-full-sft-multihop
3B
•
Updated
30 days ago
•
7
updated
a model
30 days ago
Anna4242/qwen25-7b-sft-grpo-checkpoint-200
Reinforcement Learning
•
Updated
30 days ago
published
a model
30 days ago
Anna4242/qwen25-7b-sft-grpo-checkpoint-200
Reinforcement Learning
•
Updated
30 days ago
updated
a model
about 1 month ago
Anna4242/qwen25-3b-original-sft-ep1-grpo-checkpoint-200
Text Generation
•
Updated
Nov 27
published
a model
about 1 month ago
Anna4242/qwen25-3b-original-sft-ep1-grpo-checkpoint-200
Text Generation
•
Updated
Nov 27
updated
a model
about 1 month ago
Anna4242/Qwen2.5-7B-Instruct-onlyrl-step-1000
8B
•
Updated
Nov 26
•
2
published
a model
about 1 month ago
Anna4242/Qwen2.5-7B-Instruct-onlyrl-step-1000
8B
•
Updated
Nov 26
•
2
Load more