Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Heng-Jui Chang's picture
4 4 3

Heng-Jui Chang

vectominist
Player1444's profile picture shuyuej's profile picture 21world's profile picture
·
https://people.csail.mit.edu/hengjui/
  • hjchang87
  • vectominist

AI & ML interests

Speech Processing, Multimodal Learning, Self-supervised Learning

Recent Activity

authored a paper 11 days ago
Pushing the Frontier of Audiovisual Perception with Large-Scale Multimodal Correspondence Learning
upvoted a collection 13 days ago
perception-encoder-audio-visual
liked a model about 1 month ago
nvidia/audio-flamingo-3-hf
View all activity

Organizations

Massachusetts Institute of Technology's profile picture ESPnet's profile picture NTU Speech Processing & Machine Learning Lab's profile picture s3prl's profile picture Meta Llama's profile picture Spoken Language Systems's profile picture

upvoted a collection 13 days ago

perception-encoder-audio-visual

Collection
9 items • Updated 20 days ago • 23
upvoted a collection 3 months ago

Meta CLIP 1

Collection
Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 7 items • Updated Nov 24, 2025 • 21
upvoted a paper 7 months ago

USAD: Universal Speech and Audio Representation via Distillation

Paper • 2506.18843 • Published Jun 23, 2025 • 12
upvoted a paper 11 months ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13, 2025 • 37
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs