jpacifico
/

Aramis-2B-BitNet-bf16

Text Generation

Model card Files Files and versions

jpacifico commited on Aug 17

Commit

4495094

·

verified ·

1 Parent(s): 135a742

Update README.md

Files changed (1) hide show

README.md +24 -5

README.md CHANGED Viewed

@@ -79,15 +79,34 @@ Evaluations were performed using [LM Eval Harness](https://github.com/EleutherAI
 | jpacifico/bitnet-dpo-merged-modelstock7            | **51,62**              |
-## Usage
-You can run this model using my [Colab notebook](https://github.com/jpacifico/Chocolatine-LLM/blob/main/Chocolatine_14B_inference_test_colab.ipynb)
-You can also run this model using the following code:
-## Last checkpoint
 ### Merge Method
 This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [jpacifico/bitnet-dpo-merged-modelstock-retrain](https://huggingface.co/jpacifico/bitnet-dpo-merged-modelstock-retrain) as a base.
@@ -119,7 +138,7 @@ tokenizer_source: jpacifico/bitnet-dpo-merged-modelstock-retrain
 ```
-## Limitations
 Not tuned for coding or formal math; prefer specialized variants if those are critical.
 No explicit chain-of-thought training; improvements come from bilingual DPO + merging.

 | jpacifico/bitnet-dpo-merged-modelstock7            | **51,62**              |
+### Reproducibility
+All benchmark results reported here were obtained using [LM Eval Harness](https://github.com/EleutherAI/lm-evaluation-harness).
+The following example reproduces the **ARC-Challenge (0-shot)** evaluation for this model:
+```bash
+HF_ALLOW_CODE_EVAL=1 lm-eval --model hf \
+  --model_args pretrained=jpacifico/modelstock7,dtype=bfloat16 \
+  --tasks arc_challenge \
+  --device cuda:0 --batch_size 8 \
+  --seed 42 \
+  --num_fewshot 0 \
+  --confirm_run_unsafe_code \
+  --trust_remote_code
+```
+- All results were computed with LM Eval Harness v0.4.9
+- Randomness (e.g. seeds, batch sizes) may cause slight variations in results
+- The same procedure was used to evaluate all tasks presented in the benchmark tables
+# Usage with `bitnet.cpp`
+You can run this model using my demo [Colab notebook](https://github.com/jpacifico/) TBD
+Please refer to the [bitnet.cpp](https://github.com/microsoft/BitNet) GitHub repository for detailed compilation steps, usage examples, and command-line options.
+# Last checkpoint
 ### Merge Method
 This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [jpacifico/bitnet-dpo-merged-modelstock-retrain](https://huggingface.co/jpacifico/bitnet-dpo-merged-modelstock-retrain) as a base.
 ```
+# Limitations
 Not tuned for coding or formal math; prefer specialized variants if those are critical.
 No explicit chain-of-thought training; improvements come from bilingual DPO + merging.