Improve model card for Show-o2: Improved Native Unified Multimodal Models

by nielsr HF Staff - opened Jun 22

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

+150

-27

nielsr

Jun 22

This PR significantly improves the model card for showlab/show-o2 by replacing the generic template content with detailed information extracted from the paper abstract and the project's GitHub repository.

Key updates include:

A comprehensive model description and key capabilities, leveraging the paper's abstract and GitHub overview.
Updated and added external links, including the specific GitHub repository path (show-o/tree/main/show-o2) and a direct link to the Hugging Face Space demo.
Integration of the detailed "News" section from the GitHub README, providing a chronological overview of project updates and features.
Inclusion of visual examples (GIFs and images) directly from the GitHub repository to showcase the model's performance.
A detailed "How to Get Started" section with practical code snippets for inference (Multimodal Understanding, Text-to-Image Generation).
An overview of the multi-stage training pipeline.
Full BibTeX citations for both the Show-o2 and the original Show-o papers.
Populated Developed by and Language(s) fields.

This update transforms the model card into a much more informative and user-friendly resource for anyone interested in Show-o2.

Improve model card for Show-o2: Improved Native Unified Multimodal Models68e40464

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment