Large Scale Turkish Corpora Collection A collection for high quality, large scale Turkish corpora for model training. • 7 items • Updated 18 days ago • 2
Turkish Encoder-only Models Collection Encoder-only Transformer models pre-trained for Turkish. • 19 items • Updated May 9 • 3
Turkish Vision-Language Datasets Collection Collection of Turkish vision-language datasets. • 30 items • Updated Jul 8 • 11
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Oct 22 • 151
99eren99/ColBERT-ModernBERT-base-Turkish-uncased Sentence Similarity • 0.1B • Updated May 21 • 13 • 5