fixed typos to README
Browse files
README.md
CHANGED
|
@@ -76,8 +76,8 @@ https://portal.nousresearch.com/
|
|
| 76 |
|
| 77 |

|
| 78 |
|
| 79 |
-
*Reasoning ON benchmarks
|
| 80 |
-
*Upper bound determined by measuring the % gained over Hermes 3 3 & 70b by MATH_VERIFY compared to eleuther eval harness, which ranged
|
| 81 |
|
| 82 |
## Benchmarks in **Non-Reasoning Mode** against Llama-3.1-8B-Instruct
|
| 83 |
|
|
|
|
| 76 |
|
| 77 |

|
| 78 |
|
| 79 |
+
*Reasoning ON benchmarks acquired by running HuggingFace's open-r1 reasoning mode evaluation suite, and scores for reasoning mode OFF acquired by running LM-Eval-Harness Benchmark Suite*
|
| 80 |
+
*Upper bound determined by measuring the % gained over Hermes 3 3 & 70b by MATH_VERIFY compared to eleuther eval harness, which ranged between 33% and 50% gain in MATH Hard benchmark on retested models by them compared to eval harness reported scores*
|
| 81 |
|
| 82 |
## Benchmarks in **Non-Reasoning Mode** against Llama-3.1-8B-Instruct
|
| 83 |
|