File size: 425 Bytes

9e0e792
 
 
 
 
 
 
 
 
 
 
 
 
 
ebf9b71
a6d6d9f

---
library_name: transformers
pipeline_tag: text-generation
base_model:
- meta-llama/Llama-3.2-1B
---

## UFT

This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984).

Code: https://github.com/liumy2010/UFT
    
    ## References

    * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)