Gravel 4B

Continued-pretraining of qingy2024/Qwen2.5-4B on 143M tokens from HuggingFaceTB/finemath 4-plus

Downloads last month
10
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for qingy2024/Gravel-3.8B-Base

Base model

Qwen/Qwen2.5-3B
Finetuned
(2)
this model
Quantizations
1 model

Dataset used to train qingy2024/Gravel-3.8B-Base