LSX-UniWue
/

ModernGBERT_1B

Feature Extraction

text-embeddings-inference

Model card Files Files and versions

Julia287 commited on Jul 4

Commit

5630386

·

verified ·

1 Parent(s): d093b44

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -69,7 +69,16 @@ model = get_peft_model(model, peft_config)
 ### Intermediate Checkpoints
 In addition to the final model checkpoint, we publish intermediate checkpoints throughout the full training process as unique branches in this repository.
 ### Performance
 We evaluate our models across a broad range of tasks. For natural language understanding, we use the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark, and for embedding capabilities, we use the [German MTEB](http://mteb-leaderboard.hf.space/?benchmark_name=MTEB%28deu%2C+v1%29) benchmark (after unsupervised fine-tuning of every model on the German mMARCO portion). The following table provides a comparison of this encoder with other German and multilingual encoders. See our [preprint](https://arxiv.org/abs/2505.13136) for more details about the evaluation.

 ### Intermediate Checkpoints
 In addition to the final model checkpoint, we publish intermediate checkpoints throughout the full training process as unique branches in this repository.
+A specific checkpoint can be loaded like this:
+```python
+from transformers import AutoTokenizer, AutoModelForMaskedLM
+model_id = "LSX-UniWue/ModernGBERT_1B"
+revision = "base-head-12000-ckpt"
+tokenizer = AutoTokenizer.from_pretrained(model_id, revision=revision)
+model = AutoModelForMaskedLM.from_pretrained(model_id, revision=revision)
+```
 ### Performance
 We evaluate our models across a broad range of tasks. For natural language understanding, we use the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark, and for embedding capabilities, we use the [German MTEB](http://mteb-leaderboard.hf.space/?benchmark_name=MTEB%28deu%2C+v1%29) benchmark (after unsupervised fine-tuning of every model on the German mMARCO portion). The following table provides a comparison of this encoder with other German and multilingual encoders. See our [preprint](https://arxiv.org/abs/2505.13136) for more details about the evaluation.