janhq
/

250409-llama-3.2-3b-instruct-grpo-01-no-retry

Model card Files Files and versions

Metrics Training metrics Community

250409-llama-3.2-3b-instruct-grpo-01-no-retry / model_merged_16bit

6.44 GB

1 contributor

History: 1 commit

thinhlpg's picture

Upload folder using huggingface_hub

ae624d8 verified 7 months ago

config.json

993 Bytes

Upload folder using huggingface_hub 7 months ago
generation_config.json

166 Bytes

Upload folder using huggingface_hub 7 months ago
model-00001-of-00002.safetensors

4.97 GB
xet

Upload folder using huggingface_hub 7 months ago
model-00002-of-00002.safetensors

1.46 GB
xet

Upload folder using huggingface_hub 7 months ago
model.safetensors.index.json

20.9 kB

Upload folder using huggingface_hub 7 months ago
special_tokens_map.json

454 Bytes

Upload folder using huggingface_hub 7 months ago
tokenizer.json

17.2 MB
xet

Upload folder using huggingface_hub 7 months ago
tokenizer_config.json

54.7 kB

Upload folder using huggingface_hub 7 months ago