Improve model card: Add pipeline tag, library, abstract, and comprehensive usage
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for UniLIP-3B by:
- Adding the
pipeline_tag: any-to-any, accurately reflecting the model's unified multimodal capabilities in understanding, generation, and editing. - Specifying
library_name: transformersas the model is compatible with the π€ Transformers library (evidence fromconfig.jsonshowingtransformers_version). This enables automated code snippets for easy usage on the Hub. - Updating the paper link to point directly to the Hugging Face papers page: UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing.
- Including the full paper abstract, providing essential context about the model's design and contributions.
- Adding detailed "Main Results" tables to showcase the model's performance across various tasks.
- Integrating a comprehensive "Quick Start" section, including installation instructions and "Simple Inference" code snippets for image generation, understanding, and omni-potent tasks, directly from the official GitHub repository. The
model_pathin the code has been updated to refer tokanashi6/UniLIP-3Bfor direct usability. - Including "Acknowledgements" and "Citation" sections for proper attribution.
These changes will make the model more discoverable, informative, and user-friendly on the Hugging Face Hub.