AITeamVN
/

GRPO-VI-Qwen2-7B-RAG

Text Generation

retrieval-augmented-generation

text-generation-inference

Model card Files Files and versions

AITeamVN commited on Apr 30

Commit

4ee808a

·

verified ·

1 Parent(s): cdeac4f

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -16,6 +16,7 @@ library_name: transformers
 **Model Description:**
 GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).
 The model is trained on a Vietnamese-language dataset with the goal of improving Vietnamese language understanding and generation capabilities, while enhancing performance on tasks that require integrating information retrieved from external documents.
 **Purpose of Use:**

 **Model Description:**
 GRPO-VI-Qwen2-7B-RAG is a large language model fine-tuned from the base model Qwen2.5-7B-Instruct (https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) to serve Retrieval-Augmented Generation (RAG) tasks. The fine-tuning process involves Supervised Fine-Tuning combined with GRPO (Group Relative Policy Optimization).
 The model is trained on a Vietnamese-language dataset with the goal of improving Vietnamese language understanding and generation capabilities, while enhancing performance on tasks that require integrating information retrieved from external documents.
 **Purpose of Use:**