Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published Oct 20 • 66 • 5
lmstudio-community/Mistral-Large-Instruct-2407-GGUF Text Generation • 123B • Updated Aug 29, 2024 • 79 • 13
lmstudio-community/Meta-Llama-3-8B-Instruct-GGUF Text Generation • 8B • Updated May 3, 2024 • 7.5k • 187
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper • 2401.10774 • Published Jan 19, 2024 • 59
Running on CPU Upgrade 13.7k Open LLM Leaderboard 🏆 13.7k Track, rank and evaluate open LLMs and chatbots