Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments Paper • 2502.17956 • Published Feb 25
Thai instruction dataset list Collection Thai instruction datasets that have high quality and doesn't are the translated dataset by Google translate (low quality) • 14 items • Updated 24 days ago • 2
Mangosteen: An Open Thai Corpus for Language Model Pretraining Paper • 2507.14664 • Published Jul 19 • 7
Datasets for Pretrained Thai LLM Collection List Datasets for pretrained Thai LLM by PyThaiNLP • 25 items • Updated Aug 5 • 14