rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published Jan 8 β’ 285
Facilitating large language model Russian adaptation with Learned Embedding Propagation Paper β’ 2412.21140 β’ Published Dec 30, 2024 β’ 18
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper β’ 2412.21187 β’ Published Dec 30, 2024 β’ 41
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 8 items β’ Updated Jul 31 β’ 27
Material Anything: Generating Materials for Any 3D Object via Diffusion Paper β’ 2411.15138 β’ Published Nov 22, 2024 β’ 50
RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models Paper β’ 2409.19989 β’ Published Sep 30, 2024 β’ 18
Colorful Diffuse Intrinsic Image Decomposition in the Wild Paper β’ 2409.13690 β’ Published Sep 20, 2024 β’ 14
jina-embeddings-v3: Multilingual Embeddings With Task LoRA Paper β’ 2409.10173 β’ Published Sep 16, 2024 β’ 34
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. β’ 323 items β’ Updated about 20 hours ago β’ 361
LLM Pruning and Distillation in Practice: The Minitron Approach Paper β’ 2408.11796 β’ Published Aug 21, 2024 β’ 57
JPEG-LM: LLMs as Image Generators with Canonical Codec Representations Paper β’ 2408.08459 β’ Published Aug 15, 2024 β’ 45
In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation Paper β’ 2408.00397 β’ Published Aug 1, 2024 β’ 12
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 91 items β’ Updated Feb 28 β’ 114
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 39 items β’ Updated Jul 21 β’ 373
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation Paper β’ 2405.01434 β’ Published May 2, 2024 β’ 56