Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HANI-LAB
/
Med-REFL-MedReason-8B-lora
like
0
Follow
HANI-LAB
1
Question Answering
Safetensors
English
medical
medical-reasoning
lora
dpo
reflection
arxiv:
2506.13793
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
main
Med-REFL-MedReason-8B-lora
/
README.md
Commit History
Update README.md
b048e80
verified
CityU-Zongxian
commited on
Jun 19
Update README.md
779462d
verified
CityU-Zongxian
commited on
Jun 19
Update README.md
26524e6
verified
CityU-Zongxian
commited on
Jun 10
initial commit
f1c544d
verified
CityU-Zongxian
commited on
Jun 10