AITeamVN
/

GRPO-VI-Qwen2-7B-RAG

Text Generation

retrieval-augmented-generation

text-generation-inference

Model card Files Files and versions

AITeamVN commited on Jul 13

Commit

79d0e66

·

verified ·

1 Parent(s): a6a2d99

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ library_name: transformers
 ---
 ## Model Card: GRPO-VI-Qwen2-7B-RAG
 **Model Description:**
 GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).

 ---
 ## Model Card: GRPO-VI-Qwen2-7B-RAG
 **Model Description:**
 GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).