---
license: apache-2.0
---

# VaPR-Qwen2VL-7B
This model is an artifact of the work **VaPR – Vision-language Preference alignment for Reasoning (accepted at COLM 2025).** 📂 [Project Website](https://vap-r.github.io/)


## Base Model
- **Base Model**: [Qwen2VL 7B Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct)

## Training Data
This model has been fine-tuned on the **VaPR-30k subset** dataset available at: [VaPR-30k](https://huggingface.co/datasets/VaPR-UCLA/vapr-30k)

## Citation

If you use this model, please cite the following paper:

```bibtex
inproceedings{
wadhawan2025vapr,
title={Va{PR} - Vision-language Preference alignment for Reasoning},
author={Rohan Wadhawan and Fabrice Y Harel-Canada and Zi-Yi Dou and Suhaila Shakiah and Robinson Piramuthu and Nanyun Peng},
booktitle={Second Conference on Language Modeling},
year={2025},
url={https://openreview.net/forum?id=uBAubFwymy}
}