Improve model card: Add pipeline tag, library name, paper link, and detailed description

by nielsr HF Staff - opened 28 days ago

←

nielsr

28 days ago

This PR enhances the model card for the CoReward-Qwen2.5-7B model by adding key metadata and enriching its content:

pipeline_tag: text-generation: This tag is added to correctly categorize the model and improve its discoverability for users seeking text generation capabilities on the Hugging Face Hub.
library_name: transformers: The model's config.json clearly indicates compatibility with the transformers library, as it defines Qwen2ForCausalLM as an architecture and specifies a transformers_version. Adding this metadata enables the automated "How to use" widget, providing users with predefined code snippets.
Paper Link: A direct link to the paper, Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models, is included for easy access to the research.
Model Description: A detailed description based on the paper's abstract is added to provide a clearer understanding of the Co-rewarding framework and its contributions.
GitHub Repository: The link to the official GitHub repository (https://github.com/tmlr-group/Co-rewarding) is made more prominent.
Framework Image: The visual framework image from the GitHub README is embedded to quickly convey the method's overview.
Citation Update: The BibTeX citation is updated to match the more accurate version provided in the project's GitHub README.

These changes significantly improve the model card's completeness and user-friendliness.

resistz changed pull request status to merged 28 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment