nvidia
/

Llama-3_1-Nemotron-Ultra-253B-v1

Text Generation

Model card Files Files and versions

Chris-Alexiuk commited on Apr 8

Commit

0d71c00

·

verified ·

1 Parent(s): e48391b

Update Model Card

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -184,7 +184,7 @@ print(pipeline([{"role": "system", "content": f"detailed thinking {thinking}"},{
 A large variety of training data was used for the knowledge distillation phase before post-training pipeline, 3 of which included: FineWeb, Buzz-V1.2, and Dolma.
-The data for the multi-stage post-training phases for improvements in Code, Math, and Reasoning is a compilation of SFT and RL data that supports improvements of math, code, general reasoning, and instruction following capabilities of the original Llama instruct model.
 Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.

 A large variety of training data was used for the knowledge distillation phase before post-training pipeline, 3 of which included: FineWeb, Buzz-V1.2, and Dolma.
+The data for the multi-stage post-training phases is a compilation of SFT and RL data that supports improvements of math, code, general reasoning, and instruction following capabilities of the original Llama instruct model.
 Prompts have been sourced from either public and open corpus or synthetically generated. Responses were synthetically generated by a variety of models, with some prompts containing responses for both reasoning on and off modes, to train the model to distinguish between two modes. This model was improved with Qwen.