Godheritage commited on
Commit
20eddc6
·
verified ·
1 Parent(s): 0300caa

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -0
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - qwen2.5
5
+ - 14b
6
+ - reinforcement-learning
7
+ - besiegefield
8
+ - catapult
9
+ - gemini-2.5-pro
10
+ - synthetic-data
11
+ - instruct
12
+ - transformers
13
+ language:
14
+ - en
15
+ base_model:
16
+ - Qwen/Qwen2.5-14B-Instruct
17
+ ---
18
+
19
+ # Qwen2.5-14B-Instruct-BesiegeField-CatapultRL
20
+
21
+ **Qwen2.5-14B-Instruct** fine-tuned with **Gemini-2.5-Pro synthetic cold-start data** and reinforcement-learning optimized for the **Catapult task** inside the **BesiegeField** environment.
22
+
23
+ # 📎 Links
24
+ - **Project Page:** https://besiegefield.github.io/
25
+ - **GitHub:** https://github.com/Godheritage/BesiegeField
26
+ - **arXiv:** https://arxiv.org/abs/2510.14980
27
+
28
+ If you found this model useful, please cite:
29
+ ```bibtex
30
+ @article{zhang2025besiegefield,
31
+ title={Agentic Design of Compositional Machines},
32
+ author={Zhang, Wenqian and Liu, Weiyang and Liu, Zhen},
33
+ journal={arXiv preprint arXiv:2510.14980},
34
+ year={2025}
35
+ }
36
+ ```