Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ library_name: transformers
|
|
| 12 |
---
|
| 13 |
|
| 14 |
## Model Card: GRPO-VI-Qwen2-7B-RAG
|
| 15 |
-
|
| 16 |
**Model Description:**
|
| 17 |
|
| 18 |
GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
## Model Card: GRPO-VI-Qwen2-7B-RAG
|
| 15 |
+
|
| 16 |
**Model Description:**
|
| 17 |
|
| 18 |
GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).
|