Qwen2.5-14B-Instruct-BesiegeField-CatapultRL

Qwen2.5-14B-Instruct fine-tuned with Gemini-2.5-Pro synthetic cold-start data and reinforcement-learning optimized for the Catapult task inside the BesiegeField environment.

📎 Links

Project Page: https://besiegefield.github.io/
GitHub: https://github.com/Godheritage/BesiegeField
arXiv: https://arxiv.org/abs/2510.14980

If you found this model useful, please cite:

@article{zhang2025besiegefield,
  title={Agentic Design of Compositional Machines},
  author={Zhang, Wenqian and Liu, Weiyang and Liu, Zhen},
  journal={arXiv preprint arXiv:2510.14980},
  year={2025}
}

Downloads last month: 10

Safetensors

Model size

15B params

Tensor type

BF16

Video Preview

Reinforcement Learning

Model tree for Godheritage/Qwen2.5-14B-Instruct-BesiegeField-CatapultRL

Base model

Qwen/Qwen2.5-14B

Finetuned

Qwen/Qwen2.5-14B-Instruct

Finetuned

(216)

this model