Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper โข 2509.24372 โข Published 30 days ago โข 8
Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information Paper โข 2505.06046 โข Published May 9 โข 15
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper โข 2406.15877 โข Published Jun 22, 2024 โข 48
Challenges and Applications of Large Language Models Paper โข 2307.10169 โข Published Jul 19, 2023 โข 49