view article Article ModernVBERT: Towards Smaller Visual Document Retrievers By paultltc and 4 others • 25 days ago • 41
Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications Paper • 2509.17671 • Published Sep 22 • 9
Guided Decoding and Its Critical Role in Retrieval-Augmented Generation Paper • 2509.06631 • Published Sep 8 • 10
view article Article Guided Decoding and Its Critical Role in Retrieval-Augmented Generation: A Deep Dive into Structured LLM Outputs By nmmursit and 7 others • Sep 8 • 16
view article Article Theoretical Limitations of Embedding Models and Their Applications in Turkish: An In-Depth Look By nmmursit and 1 other • Sep 4 • 15
view article Article 🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders By adaamko and 1 other • Aug 31 • 14
view article Article Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications By nmmursit and 5 others • Aug 29 • 27
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face Jul 29 • 190
EuroHPC Benchmark Access Collection Funding: EuroHPC JU Benchmark Access Grant No. EHPC-BEN-2024B11-003 Infrastructure: IT4Innovations National Supercomputing Center (Karolina) • 23 items • Updated Sep 26 • 2
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 1.14k
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • Jan 31 • 51
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Feb 25 • 19