Improve model card: Add pipeline tag, library name, paper link, and detailed description

#1
by nielsr HF Staff - opened

This PR enhances the model card for the CoReward-Qwen2.5-7B model by adding key metadata and enriching its content:

  • pipeline_tag: text-generation: This tag is added to correctly categorize the model and improve its discoverability for users seeking text generation capabilities on the Hugging Face Hub.
  • library_name: transformers: The model's config.json clearly indicates compatibility with the transformers library, as it defines Qwen2ForCausalLM as an architecture and specifies a transformers_version. Adding this metadata enables the automated "How to use" widget, providing users with predefined code snippets.
  • Paper Link: A direct link to the paper, Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models, is included for easy access to the research.
  • Model Description: A detailed description based on the paper's abstract is added to provide a clearer understanding of the Co-rewarding framework and its contributions.
  • GitHub Repository: The link to the official GitHub repository (https://github.com/tmlr-group/Co-rewarding) is made more prominent.
  • Framework Image: The visual framework image from the GitHub README is embedded to quickly convey the method's overview.
  • Citation Update: The BibTeX citation is updated to match the more accurate version provided in the project's GitHub README.

These changes significantly improve the model card's completeness and user-friendliness.

resistz changed pull request status to merged

Sign up or log in to comment