Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ pipeline_tag: text-generation
|
|
| 27 |
|
| 28 |
# Qwen2.5-Lumen-14B
|
| 29 |
|
| 30 |
-
* *Direct preference optimization finetuned for 3 epoch
|
| 31 |
|
| 32 |

|
| 33 |
|
|
@@ -41,6 +41,8 @@ Trained [Qwen2.5-14B-Instruct] for 2 epochs on [jondurbin/gutenberg-dpo-v0.1] sa
|
|
| 41 |
|
| 42 |
[Tanliboy](https://huggingface.co/tanliboy) trained [Qwen2.5-14B-Instruct] for 1 epoch on [HuggingFaceH4/ultrafeedback_binarized].
|
| 43 |
|
|
|
|
|
|
|
| 44 |
## Merge
|
| 45 |
|
| 46 |
* Merged with a sophosympatheia <b>SLERP</b> *Ultrafeedback-Binarized DPO* and *Gutenberg DPO*
|
|
|
|
| 27 |
|
| 28 |
# Qwen2.5-Lumen-14B
|
| 29 |
|
| 30 |
+
* *Direct preference optimization finetuned for 3 epoch*
|
| 31 |
|
| 32 |

|
| 33 |
|
|
|
|
| 41 |
|
| 42 |
[Tanliboy](https://huggingface.co/tanliboy) trained [Qwen2.5-14B-Instruct] for 1 epoch on [HuggingFaceH4/ultrafeedback_binarized].
|
| 43 |
|
| 44 |
+
*Mass checkpoint merged, Based on Qwen2.5-14B-Instruct.*
|
| 45 |
+
|
| 46 |
## Merge
|
| 47 |
|
| 48 |
* Merged with a sophosympatheia <b>SLERP</b> *Ultrafeedback-Binarized DPO* and *Gutenberg DPO*
|