Feature Extraction
sentence-transformers
PyTorch
Chinese
English
bert
sentence-similarity
mteb
RAG
Eval Results (legacy)
text-embeddings-inference
Instructions to use DMetaSoul/Dmeta-embedding-zh with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use DMetaSoul/Dmeta-embedding-zh with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("DMetaSoul/Dmeta-embedding-zh") sentences = [ "The weather is lovely today.", "It's so sunny outside!", "He drove to the stadium." ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [3, 3] - Inference
- Notebooks
- Google Colab
- Kaggle
| import logging | |
| import functools | |
| from mteb import MTEB | |
| from sentence_transformers import SentenceTransformer | |
| logging.basicConfig(level=logging.INFO) | |
| logger = logging.getLogger("main") | |
| # task_list | |
| task_list = ['Classification', 'Clustering', 'Reranking', 'Retrieval', 'STS', 'PairClassification'] | |
| # languages | |
| task_langs=["zh", "zh-CN"] | |
| model_name = "DMetaSoul/Dmeta-embedding" | |
| model = SentenceTransformer(model_name) | |
| # normalize_embeddings should be true for this model | |
| model.encode = functools.partial(model.encode, normalize_embeddings=True) | |
| evaluation = MTEB(task_types=task_list, task_langs=task_langs) | |
| evaluation.run(model, output_folder=f"results/zh/{model_name.split('/')[-1]}") |