view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 253
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 73
view article Article Building Conversational AI: A Deep Dive into Voice Agent Architectures and Best Practices By abdeljalilELmajjodi • Sep 2 • 9
Moroccan Darija LLMs Collection Language Models that speaks Moroccan darija (ary) • 9 items • Updated Feb 20 • 4
view article Article Seeing Isn’t Understanding: The Spatial Reasoning Gap in Vision-Language Models By KBayoud • Jul 13 • 8
view article Article Creating your custom Ghibli Text-to-Image model By atlasia and 3 others • May 1 • 18
view article Article Atlaset Dataset for Moroccan Darija: From Data Collection, Analysis, to Model Trainings By atlasia and 1 other • Mar 6 • 26
view article Article Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • Feb 10 • 14
view article Article TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation By imomayiz and 4 others • Jan 10 • 33
view article Article Finding Moroccan Arabic (Darija) in Fineweb 2 By omarkamali and 3 others • Dec 8, 2024 • 23
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 15 items • Updated Jun 11 • 12