Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Fayaz
/
grpo_legal_extractor_qwen3_4b_V0
like
0
Transformers
Safetensors
Generated from Trainer
sft
unsloth
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
grpo_legal_extractor_qwen3_4b_V0
/
merges.txt
Commit History
Fayaz/law_extraction_qwen3_4b_grpo
342fb7e
verified
Fayaz
commited on
Jun 23