Audio Dataset MLCommons/peoples_speech_v1.0 Updated Aug 25, 2024 • 109 • 8 amphion/Emilia-Dataset Viewer • Updated Feb 28, 2025 • 54.8M • 41k • 459 simon3000/genshin-voice Viewer • Updated Apr 22, 2025 • 424k • 4.54k • 234 facebook/multilingual_librispeech Viewer • Updated Aug 12, 2024 • 1.49M • 51k • 180
Omni model collection of Omni modal model inclusionAI/Ming-flash-omni-2.0 Any-to-Any • 104B • Updated Feb 12 • 2.87k • 266 Qwen/Qwen3-Omni-30B-A3B-Instruct Any-to-Any • 35B • Updated Sep 22, 2025 • 1.55M • 931 naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated Jan 6 • 723 • 188 meituan-longcat/LongCat-Flash-Omni Any-to-Any • 561B • Updated Nov 11, 2025 • 73 • 112
Audio Dataset MLCommons/peoples_speech_v1.0 Updated Aug 25, 2024 • 109 • 8 amphion/Emilia-Dataset Viewer • Updated Feb 28, 2025 • 54.8M • 41k • 459 simon3000/genshin-voice Viewer • Updated Apr 22, 2025 • 424k • 4.54k • 234 facebook/multilingual_librispeech Viewer • Updated Aug 12, 2024 • 1.49M • 51k • 180
Omni model collection of Omni modal model inclusionAI/Ming-flash-omni-2.0 Any-to-Any • 104B • Updated Feb 12 • 2.87k • 266 Qwen/Qwen3-Omni-30B-A3B-Instruct Any-to-Any • 35B • Updated Sep 22, 2025 • 1.55M • 931 naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B Text Generation • 11B • Updated Jan 6 • 723 • 188 meituan-longcat/LongCat-Flash-Omni Any-to-Any • 561B • Updated Nov 11, 2025 • 73 • 112