-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated • 4 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 77 • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated • 2
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
published
a model
about 12 hours ago
AmberYifan/Qwen3-4B-MATH-GRPO-len-control-tuned
published
a model
1 day ago
AmberYifan/Qwen3-4B-OpenR1Math-MARL-structure-v2
published
a model
1 day ago
AmberYifan/Qwen3-4B-Polaris-MARL-structure-v2
Organizations
LLMs Can Get "Brain Rot"!
-
AmberYifan/qwen2.5-7b-instruct-full-pretrain-control-tweet-1m-en-sft
Text Generation • 8B • Updated • 4 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-junk-tweet-1m-en-sft
Text Generation • 8B • Updated • 77 • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-high-tweet-1m-en-sft
Text Generation • 8B • Updated • 1 -
AmberYifan/qwen2.5-7b-instruct-full-pretrain-mix-mid-tweet-1m-en-sft
Text Generation • 8B • Updated • 2
DRIFT
Learning from Abundant User Dissatisfaction in Real-World Preference Learning
models
638
AmberYifan/Qwen3-4B-MATH-GRPO-len-control-tuned
Updated
AmberYifan/Qwen3-4B-OpenR1Math-MARL-structure-v2
Updated
AmberYifan/Qwen3-4B-Polaris-MARL-structure-v2
Updated
AmberYifan/Qwen3-4B-MATH-MARL-structure-v2
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-structure
Updated
AmberYifan/Qwen3-1.7B-MATH-MARL-tuned
Updated
AmberYifan/Qwen3-4B-MATH-MARL-tuned
Updated
AmberYifan/Qwen3-4B-MATH-GRPO-tuned
Updated
AmberYifan/Qwen3-4B-MATH-MARL-structure-loop-penalty-v2-32
Updated
AmberYifan/Qwen3-4B-OpenR1Math-MARL-structure-loop-penalty-v2
Updated
datasets
28
AmberYifan/seed-data
Viewer
•
Updated
•
491
•
30
AmberYifan/dsat-data
Viewer
•
Updated
•
10.6k
•
17
AmberYifan/sat-data
Viewer
•
Updated
•
4.43k
•
18
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
•
21
AmberYifan/sft-spin-filter
Updated
•
2
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
•
3
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
5
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
9
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
•
16
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
•
6