arxiv:2510.23393
Farid Bagirov
kraalfar
AI & ML interests
None yet
Recent Activity
authored
a paper
about 3 hours ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N
Sampling via max@k Optimisation
upvoted
a
paper
about 8 hours ago
The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N
Sampling via max@k Optimisation
upvoted
a
paper
about 9 hours ago
Diff-XYZ: A Benchmark for Evaluating Diff Understanding