Update model card for Mono-InternVL-2B with Mono-InternVL-1.5 paper and comprehensive details

#11
by nielsr HF Staff - opened

This PR updates the model card for Mono-InternVL-2B to reflect information from the more recent paper Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models.

Specifically, it:

  • Updates the primary paper link and enriches the introduction with details from the Mono-InternVL-1.5 abstract and the project's introduction.
  • Adds visual charts (radar and architecture) from the GitHub repository for better illustration.
  • Includes the comprehensive performance benchmark table.
  • Expands the inference section to include both transformers and LMDeploy examples.
  • Adds a new section for supervised finetuning, including installation, dataset preparation, and training instructions.
  • Updates the citation section to include both Mono-InternVL V1 and V1.5 papers for complete attribution.
  • Removes the "File information" section, as it is internal context and not part of the public model card.

Please review and merge this PR.

Cannot merge
This branch has merge conflicts in the following files:
  • README.md

Sign up or log in to comment