QuixiAI
/

samantha-mistral-7b

Text Generation

text-generation-inference

Model card Files Files and versions

cocoloco31 commited on Oct 2

Commit

a630074

·

verified ·

1 Parent(s): fac7704

Update read.me

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -1,5 +1,14 @@
 ---
 license: apache-2.0
 ---
 Trained on [mistral-7b](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base model, this Samantha was trained in 2 hours on 4x A100 80gb with 20 epochs of the Samantha-1.1 dataset.
@@ -62,4 +71,4 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
 | TruthfulQA (0-shot)   | 46.08   |
 | Winogrande (5-shot)   | 76.8   |
 | GSM8K (5-shot)        | 16.0        |
-| DROP (3-shot)         | 11.22         |

 ---
 license: apache-2.0
+language:
+- en
+- zh
+- es
+base_model:
+- Qwen/Qwen3-Next-80B-A3B-Instruct
+library_name: adapter-transformers
+tags:
+- agent
 ---
 Trained on [mistral-7b](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base model, this Samantha was trained in 2 hours on 4x A100 80gb with 20 epochs of the Samantha-1.1 dataset.
 | TruthfulQA (0-shot)   | 46.08   |
 | Winogrande (5-shot)   | 76.8   |
 | GSM8K (5-shot)        | 16.0        |
+| DROP (3-shot)         | 11.22         |