Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Work
1
Peijia Qin
t2ance
Follow
OliverQinyy's profile picture
AMAImedia's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 9 hours ago
t2ance/selection-lcb-sft-warmup-forcing
published
a dataset
about 9 hours ago
t2ance/selection-lcb-sft-warmup-forcing
updated
a dataset
about 10 hours ago
t2ance/BCB-Selection-Data-8192
View all activity
Organizations
None yet
t2ance
's models
59
Sort: Recently updated
t2ance/CodeRM-GRPO-4B-bs96-nrp
Updated
1 day ago
t2ance/atts-grpo-8b-warmstart155-b63r16
Updated
4 days ago
t2ance/atts-grpo-8b-sft-2gpu-bs96
Updated
5 days ago
t2ance/sft_qwen3_8b_merged
8B
•
Updated
7 days ago
•
21
t2ance/CodeRM-SFT-Haiku500-4B
4B
•
Updated
8 days ago
•
19
t2ance/CodeRM-GRPO-Selection-8B
8B
•
Updated
19 days ago
•
40.6k
•
1
t2ance/CodeRM-Bilevel-GRPO-4B
4B
•
Updated
20 days ago
•
103
•
1
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-K8s-v2
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v13-ThinkingMasked
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-v12-NoThinking
Updated
22 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v11
Updated
23 days ago
•
1
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v9
Updated
26 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v6
Updated
26 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v5
Updated
27 days ago
t2ance/mle-playbooks
Updated
27 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v4
Updated
27 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v3
Updated
27 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT-v2
Updated
28 days ago
t2ance/CodeRM-SFT-Warmup-Selection-4B-Merged
4B
•
Updated
28 days ago
•
7.82k
t2ance/sft-4b-onpolicy-rejection-sampling
Updated
28 days ago
t2ance/CodeRM-OnlineGRPO-Selection-8B-Domain-SFT-K8s
Updated
28 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain-SFT
Updated
28 days ago
t2ance/CodeRM-SFT-Warmup-Selection-8B-Merged
8B
•
Updated
28 days ago
•
7.72k
t2ance/CodeRM-SFT-Warmup-Selection-8B
Text Generation
•
Updated
28 days ago
•
14
t2ance/CodeRM-SFT-Warmup-Selection-4B
Text Generation
•
Updated
28 days ago
•
14
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain-SmallMeta
Updated
29 days ago
t2ance/CodeRM-OnlineGRPO-Selection-4B-Domain
Updated
30 days ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-CrossDomain
Updated
about 1 month ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Heuristic
Updated
about 1 month ago
t2ance/CodeRM-OnlineGRPO-Selection-1.7B-Baseline
Updated
Mar 24
Previous
1
2
Next