Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments Paper • 2502.17956 • Published Feb 25, 2025
Thai instruction dataset list Collection Thai instruction datasets that have high quality and doesn't are the translated dataset by Google translate (low quality) • 14 items • Updated Oct 9, 2025 • 2
Mangosteen: An Open Thai Corpus for Language Model Pretraining Paper • 2507.14664 • Published Jul 19, 2025 • 7
Datasets for Pretrained Thai LLM Collection List Datasets for pretrained Thai LLM by PyThaiNLP • 25 items • Updated Aug 5, 2025 • 14