Does It Tie Out? Towards Autonomous Legal Agents in Venture Capital Paper β’ 2512.18658 β’ Published 8 days ago β’ 8
Performance Gaps in Multi-view Clustering under the Nested Matrix-Tensor Model Paper β’ 2402.10677 β’ Published Feb 16, 2024
Investigating Regularization of Self-Play Language Models Paper β’ 2404.04291 β’ Published Apr 4, 2024 β’ 1
Do Vision and Language Encoders Represent the World Similarly? Paper β’ 2401.05224 β’ Published Jan 10, 2024
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper β’ 2507.22448 β’ Published Jul 30 β’ 68
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper β’ 2507.22448 β’ Published Jul 30 β’ 68
Running 3.6k The Ultra-Scale Playbook π 3.6k The ultimate guide to training LLM on large GPU Clusters
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. May 15 β’ 36
Running on CPU Upgrade 13.8k Open LLM Leaderboard π 13.8k Track, rank and evaluate open LLMs and chatbots
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark Jan 2 β’ 41
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated Nov 6 β’ 87