AITeamVN commited on
Commit
79d0e66
·
verified ·
1 Parent(s): a6a2d99

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ library_name: transformers
12
  ---
13
 
14
  ## Model Card: GRPO-VI-Qwen2-7B-RAG
15
-
16
  **Model Description:**
17
 
18
  GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).
 
12
  ---
13
 
14
  ## Model Card: GRPO-VI-Qwen2-7B-RAG
15
+
16
  **Model Description:**
17
 
18
  GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).