Tongyi-Zhiwen
/

QwenLong-L1-32B

@@ -1,14 +1,14 @@
 ---
-license: apache-2.0
-datasets:
-- Tongyi-Zhiwen/DocQA-RL-1.6K
 base_model:
 - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
 tags:
 - long-context
 - large-reasoning-model
-pipeline_tag: text-generation
-library_name: transformers
 ---
 # QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
@@ -23,6 +23,7 @@ library_name: transformers
 [![GitHub](https://img.shields.io/badge/GitHub-QwenLongL1-4b32c3?logo=github)](https://github.com/Tongyi-Zhiwen/QwenLong-L1)
 [![ModelScope](https://img.shields.io/badge/🤖%20ModelScope-purple)](https://modelscope.cn/models/iic/QwenLong-L1-32B)
 [![HuggingFace](https://img.shields.io/badge/🤗%20HuggingFace-yellow)](https://huggingface.co/Tongyi-Zhiwen/QwenLong-L1-32B)
 <!-- **Authors:** -->
@@ -155,8 +156,10 @@ try:
 except ValueError:
     index = 0
-thinking_content = tokenizer.decode(output_ids[:index], skip_special_tokens=True).strip("\n")
-content = tokenizer.decode(output_ids[index:], skip_special_tokens=True).strip("\n")
 print("thinking content:", thinking_content)
 print("content:", content)
@@ -335,11 +338,18 @@ PROJ_DIR="<YOUR_PROJ_DIR_HERE>"
 DATA="<YOUR_DATA_HERE>" # e.g., docmath, frames, 2wikimqa, hotpotqa, musique, narrativeqa, pasper
 python ${PROJ_DIR}/eval/${DATA}_verify.py \
     --save_dir "${PROJ_DIR}/results/${DATA}" \
-    --save_file "${MODEL_NAME}" \
-    --judge_model "deepseek-chat" \
     --batch_size 20
 ```
 ## 📝 Citation
 If you find this work is relevant with your research or applications, please feel free to cite our work!
@@ -350,4 +360,8 @@ If you find this work is relevant with your research or applications, please fee
   journal={arXiv preprint arXiv:2505.17667},
   year={2025}
 }
-```

 ---
 base_model:
 - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
+datasets:
+- Tongyi-Zhiwen/DocQA-RL-1.6K
+library_name: transformers
+license: apache-2.0
+pipeline_tag: text-generation
 tags:
 - long-context
 - large-reasoning-model
 ---
 # QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
 [![GitHub](https://img.shields.io/badge/GitHub-QwenLongL1-4b32c3?logo=github)](https://github.com/Tongyi-Zhiwen/QwenLong-L1)
 [![ModelScope](https://img.shields.io/badge/🤖%20ModelScope-purple)](https://modelscope.cn/models/iic/QwenLong-L1-32B)
 [![HuggingFace](https://img.shields.io/badge/🤗%20HuggingFace-yellow)](https://huggingface.co/Tongyi-Zhiwen/QwenLong-L1-32B)
+[Project Page](https://huggingface.co/Tongyi-Zhiwen/QwenLong-L1-32B)
 <!-- **Authors:** -->
 except ValueError:
     index = 0
+thinking_content = tokenizer.decode(output_ids[:index], skip_special_tokens=True).strip("
+")
+content = tokenizer.decode(output_ids[index:], skip_special_tokens=True).strip("
+")
 print("thinking content:", thinking_content)
 print("content:", content)
 DATA="<YOUR_DATA_HERE>" # e.g., docmath, frames, 2wikimqa, hotpotqa, musique, narrativeqa, pasper
 python ${PROJ_DIR}/eval/${DATA}_verify.py \
     --save_dir "${PROJ_DIR}/results/${DATA}" \
+    --save_file \"${MODEL_NAME}\" \
+    --judge_model \"deepseek-chat\" \
     --batch_size 20
 ```
+## 🌐 Join the Community
+Chinese users can scan QR codes to join WeChat/DingTalk groups.
+| WeChat | DingTalk |
+|----------|---------|
+| ![Alt Text](./assets/weichat_group.png) | ![Alt Text](./assets/dingding_group.png) |
 ## 📝 Citation
 If you find this work is relevant with your research or applications, please feel free to cite our work!
   journal={arXiv preprint arXiv:2505.17667},
   year={2025}
 }
+```
+## ⭐️ Star History
+[![Star History Chart](https://api.star-history.com/svg?repos=Tongyi-Zhiwen/QwenLong-L1&type=Timeline)](https://star-history.com/#Tongyi-Zhiwen/QwenLong-L1&Timeline)