Update README.md
Browse files
README.md
CHANGED
|
@@ -1,29 +1,29 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
language:
|
| 4 |
-
- en
|
| 5 |
-
base_model:
|
| 6 |
-
- Qwen/Qwen2.5-VL-7B-Instruct
|
| 7 |
-
pipeline_tag: reinforcement-learning
|
| 8 |
-
tags:
|
| 9 |
-
- IQA
|
| 10 |
-
- Reasoning
|
| 11 |
-
- VLM
|
| 12 |
-
- Pytorch
|
| 13 |
-
- R1
|
| 14 |
-
|
|
|
|
|
|
|
| 15 |
|
| 16 |
# VisualQuality-R1-7B
|
| 17 |
-
This is the
|
| 18 |
Paper link: [arXiv](https://arxiv.org/abs/2505.14460)<br>
|
| 19 |
Code link: [github](https://github.com/TianheWu/VisualQuality-R1)
|
| 20 |
|
| 21 |
> The first NR-IQA model enhanced by RL2R, capable of both quality description and rating through reasoning.
|
| 22 |
|
| 23 |
|
| 24 |
-
|
| 25 |
-

|
| 26 |
-
|
| 27 |
|
| 28 |
|
| 29 |
## Quick Start
|
|
@@ -327,6 +327,21 @@ print(answer)
|
|
| 327 |
|
| 328 |
|
| 329 |
|
|
|
|
|
|
|
|
|
|
| 330 |
|
| 331 |
|
|
|
|
|
|
|
| 332 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
base_model:
|
| 6 |
+
- Qwen/Qwen2.5-VL-7B-Instruct
|
| 7 |
+
pipeline_tag: reinforcement-learning
|
| 8 |
+
tags:
|
| 9 |
+
- IQA
|
| 10 |
+
- Reasoning
|
| 11 |
+
- VLM
|
| 12 |
+
- Pytorch
|
| 13 |
+
- R1
|
| 14 |
+
- GRPO
|
| 15 |
+
- RL2R
|
| 16 |
+
---
|
| 17 |
|
| 18 |
# VisualQuality-R1-7B
|
| 19 |
+
This is the latest version of VisualQuality-R1, trained on a diverse combination of synthetic and realistic datasets.<br>
|
| 20 |
Paper link: [arXiv](https://arxiv.org/abs/2505.14460)<br>
|
| 21 |
Code link: [github](https://github.com/TianheWu/VisualQuality-R1)
|
| 22 |
|
| 23 |
> The first NR-IQA model enhanced by RL2R, capable of both quality description and rating through reasoning.
|
| 24 |
|
| 25 |
|
| 26 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/655de51982afda0fc479fb91/JZgVeMtAVASCCNYO5VCyn.png" width="600"/>
|
|
|
|
|
|
|
| 27 |
|
| 28 |
|
| 29 |
## Quick Start
|
|
|
|
| 327 |
|
| 328 |
|
| 329 |
|
| 330 |
+
## Related Projects
|
| 331 |
+
- [ECCV 2024] [A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment](https://arxiv.org/abs/2403.10854v2)
|
| 332 |
+
- [CVPR 2025] [Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption](https://www.arxiv.org/abs/2503.11221)
|
| 333 |
|
| 334 |
|
| 335 |
+
## 📧 Contact
|
| 336 |
+
If you have any question, please email `[email protected]` or `[email protected]`.
|
| 337 |
|
| 338 |
+
|
| 339 |
+
## BibTeX
|
| 340 |
+
```
|
| 341 |
+
@article{wu2025visualquality,
|
| 342 |
+
title={{VisualQuality-R1}: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank},
|
| 343 |
+
author={Wu, Tianhe and Zou, Jian and Liang, Jie and Zhang, Lei and Ma, Kede},
|
| 344 |
+
journal={arXiv preprint arXiv:2505.14460},
|
| 345 |
+
year={2025}
|
| 346 |
+
}
|
| 347 |
+
```
|