What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity Paper • 2511.15593 • Published 8 days ago • 54
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 10 days ago • 130
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 174
A Family of Pretrained Transformer Language Models for Russian Paper • 2309.10931 • Published Sep 19, 2023 • 5
RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark Paper • 2010.15925 • Published Oct 29, 2020
Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models Paper • 2202.07791 • Published Feb 15, 2022
Findings of the The RuATD Shared Task 2022 on Artificial Text Detection in Russian Paper • 2206.01583 • Published Jun 3, 2022 • 1
Vote'n'Rank: Revision of Benchmarking with Social Choice Theory Paper • 2210.05769 • Published Oct 11, 2022
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20 • 192
MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published Feb 20 • 192
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Paper • 2310.03502 • Published Oct 5, 2023 • 78