Qwen2.5-14B-Instruct-BesiegeField-CatapultRL

Qwen2.5-14B-Instruct fine-tuned with Gemini-2.5-Pro synthetic cold-start data and reinforcement-learning optimized for the Catapult task inside the BesiegeField environment.

πŸ“Ž Links

If you found this model useful, please cite:

@article{zhang2025besiegefield,
  title={Agentic Design of Compositional Machines},
  author={Zhang, Wenqian and Liu, Weiyang and Liu, Zhen},
  journal={arXiv preprint arXiv:2510.14980},
  year={2025}
}
Downloads last month
10
Safetensors
Model size
15B params
Tensor type
BF16
Β·
Video Preview
loading

Model tree for Godheritage/Qwen2.5-14B-Instruct-BesiegeField-CatapultRL

Base model

Qwen/Qwen2.5-14B
Finetuned
(216)
this model