SindBERT, the Sailor: Charting the Seas of Turkish NLP Paper • 2510.21364 • Published 30 days ago • 1
The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models Paper • 2510.13996 • Published Oct 15 • 7
Llama-GENBA-10B: A Trilingual Large Language Model for German, English and Bavarian Paper • 2509.05668 • Published Sep 6 • 5