Predict human preference to LLM responses.
Binfeng Xu
billxbf
AI & ML interests
evolving back to apes
Recent Activity
upvoted a paper about 2 hours ago
Polar: Agentic RL on Any Harness at Scale updated a model 28 days ago
billxbf/qwen3.5-4b-pi-polar