Improve model card: Add pipeline tag, library, abstract, and comprehensive usage

by nielsr HF Staff - opened Oct 13

←

nielsr

Oct 13

This PR significantly enhances the model card for UniLIP-3B by:

Adding the pipeline_tag: any-to-any, accurately reflecting the model's unified multimodal capabilities in understanding, generation, and editing.
Specifying library_name: transformers as the model is compatible with the 🤗 Transformers library (evidence from config.json showing transformers_version). This enables automated code snippets for easy usage on the Hub.
Updating the paper link to point directly to the Hugging Face papers page: UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing.
Including the full paper abstract, providing essential context about the model's design and contributions.
Adding detailed "Main Results" tables to showcase the model's performance across various tasks.
Integrating a comprehensive "Quick Start" section, including installation instructions and "Simple Inference" code snippets for image generation, understanding, and omni-potent tasks, directly from the official GitHub repository. The model_path in the code has been updated to refer to kanashi6/UniLIP-3B for direct usability.
Including "Acknowledgements" and "Citation" sections for proper attribution.

These changes will make the model more discoverable, informative, and user-friendly on the Hugging Face Hub.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment