Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
VidDiff's picture
5 3

VidDiff

viddiff
Aanuoluwapo65's profile picture Alejandro98's profile picture Gargaz's profile picture
·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 papers 8 months ago

MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

Paper • 2503.13399 • Published Mar 17 • 22

Video Action Differencing

Paper • 2503.07860 • Published Mar 10 • 33
upvoted a paper 9 months ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 45
upvoted 2 papers 11 months ago

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Paper • 2412.13180 • Published Dec 17, 2024 • 13

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 147
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs