This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task.
			
	
	AI & ML interests
Multilingual NLP, underserved languages
Recent Activity
	View all activity
	
			Organization Card
		
		 Open Language Data Initiative
 Open Language Data Initiative
Welcome!
The Open Language Data Initiative (OLDI) empowers language communities around the globe to contribute to a database that drives the foundation of today’s machine translation and natural language processing work. We invite community, academic, and industry members to contribute to key datasets that are imperative to the organic expansion of language technology’s reach.
For more information, visit oldi.org.
			models
			0
		
			
	None public yet