Bilingual LMs ( L1 {es fr de pl tr ar zh} + L2 en ) trained on Cultura-X for L1 and FineWebEdu (L2)
Suchir Salhan
suchirsalhan
AI & ML interests
Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.
Recent Activity
upvoted a collection about 17 hours ago
Learner Essay Datasets updated a Space 1 day ago
BabyLM-community/BabyLM-Leaderboard-2026 published a model 2 days ago
Beetle-FineWeb/beetle-bilingual-balanced-b4-fineweb-deu-eng