--- license: apache-2.0 --- # JT-Math-8B-Instruct
We are excited to introduce JT-Math-8B-Instruct, a powerful 8-billion parameter model specialized for mathematical reasoning. It achieves state-of-the-art performance on major math benchmarks among models of its size. JT-Math-8B-Instruct is fine-tuned from Jiutian-Math-8B-Base and has been optimized through a comprehensive process involving Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) to enhance its mathematical problem-solving abilities and instruction-following capabilities. For full transparency and reproducibility, please refer to our technical report which details our training recipe and pipeline. ## Model Details 🚀 The **JT-Math-8B-Instruct** is an 8-billion parameter language model built on the **Jiutian LLM architecture** with a **context length of 32,768 tokens**. Its development involved two key stages: initial pre-training of the **JT-Math-8B-Base** model on a diverse corpus of text and mathematical data, followed by a two-stage instruction tuning process. This tuning began with **Supervised Fine-Tuning (SFT)**, where the model was trained on a high-quality, multilingual dataset of mathematical problems and solutions in both English and Chinese to grasp problem-solving patterns. Subsequently, **Reinforcement Learning (RL)** was applied within an 8K context window to enhance reasoning accuracy, minimize logical fallacies, and align the model more closely with human preferences for clear and correct mathematical solutions. ## Model Downloads We release the following model to support a wide range of applications: | Model Name | Context Length | Hugging Face Link | ModelScope Link | Notes | | ------------------- | -------------- | -------------------------------------------------------- | ------------------------------------------------------------ | --------------------------------------------------- | | JT-Math-8B-Instruct | 32K | [Link](https://huggingface.co/JT-LM/JT-Math-8B-Instruct) | [Link](https://www.modelscope.cn/models/JiuTian-AI/JT-Math-8B-Instruct) | Instruction-tuned for general math problem-solving. | ------ ## Evaluation Results JT-Math-8B-Instruct demonstrates state-of-the-art performance on key mathematical benchmarks, outperforming other open-source models in the ~8B parameter class. Below is a summary of our evaluation results: ![alt text](