Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2311.00430

multilingual STT and TTS

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 53.7k • 926

Knowledge Distillation

shayekh/aya8b-distillkit-hidden

Updated Aug 11, 2024 • 1
shayekh/aya8b-distillkit-logits

Updated Aug 11, 2024
AhmadMustafa/distAyaQwen

0.6B • Updated Aug 11, 2024 • 6 • 1
Less is More: Task-aware Layer-wise Distillation for Language Model Compression

Paper • 2210.01351 • Published Oct 4, 2022 • 3

whisper_related_papers

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

Paper • 2309.13876 • Published Sep 25, 2023 • 1
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Paper • 2310.06434 • Published Oct 10, 2023 • 4

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Paper • 2312.03818 • Published Dec 6, 2023 • 34
Scaling Laws of Synthetic Images for Model Training ... for Now

Paper • 2312.04567 • Published Dec 7, 2023 • 9
Large Language Models for Mathematicians

Paper • 2312.04556 • Published Dec 7, 2023 • 13
LooseControl: Lifting ControlNet for Generalized Depth Conditioning

Paper • 2312.03079 • Published Dec 5, 2023 • 16

Speech Models 🎧

ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 213 • 414
AudioPaLM: A Large Language Model That Can Speak and Listen

Paper • 2306.12925 • Published Jun 22, 2023 • 55
OpenMOSS-Team/SpeechGPT-7B-cm

Text Generation • Updated Sep 15, 2023 • 38 • 7
parler-tts/parler_tts_mini_v0.1

Text-to-Speech • 0.6B • Updated Apr 30, 2024 • 3.43k • 358

Speech to text.

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

Masked Autoencoders Are Scalable Vision Learners

Paper • 2111.06377 • Published Nov 11, 2021 • 5
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
distil-whisper/distil-large-v2

Automatic Speech Recognition • 0.8B • Updated Mar 6 • 7.79k • 511
Seven Failure Points When Engineering a Retrieval Augmented Generation System

Paper • 2401.05856 • Published Jan 11, 2024 • 2

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 90
Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83
Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

Paper • 2311.00871 • Published Nov 1, 2023 • 3

multilingual STT and TTS

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 53.7k • 926

Speech Models 🎧

ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 213 • 414
AudioPaLM: A Large Language Model That Can Speak and Listen

Paper • 2306.12925 • Published Jun 22, 2023 • 55
OpenMOSS-Team/SpeechGPT-7B-cm

Text Generation • Updated Sep 15, 2023 • 38 • 7
parler-tts/parler_tts_mini_v0.1

Text-to-Speech • 0.6B • Updated Apr 30, 2024 • 3.43k • 358

Knowledge Distillation

shayekh/aya8b-distillkit-hidden

Updated Aug 11, 2024 • 1
shayekh/aya8b-distillkit-logits

Updated Aug 11, 2024
AhmadMustafa/distAyaQwen

0.6B • Updated Aug 11, 2024 • 6 • 1
Less is More: Task-aware Layer-wise Distillation for Language Model Compression

Paper • 2210.01351 • Published Oct 4, 2022 • 3

Speech to text.

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

whisper_related_papers

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

Paper • 2309.13876 • Published Sep 25, 2023 • 1
Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Paper • 2310.06434 • Published Oct 10, 2023 • 4

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57

Masked Autoencoders Are Scalable Vision Learners

Paper • 2111.06377 • Published Nov 11, 2021 • 5
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
distil-whisper/distil-large-v2

Automatic Speech Recognition • 0.8B • Updated Mar 6 • 7.79k • 511
Seven Failure Points When Engineering a Retrieval Augmented Generation System

Paper • 2401.05856 • Published Jan 11, 2024 • 2

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Paper • 2312.03818 • Published Dec 6, 2023 • 34
Scaling Laws of Synthetic Images for Model Training ... for Now

Paper • 2312.04567 • Published Dec 7, 2023 • 9
Large Language Models for Mathematicians

Paper • 2312.04556 • Published Dec 7, 2023 • 13
LooseControl: Lifting ControlNet for Generalized Depth Conditioning

Paper • 2312.03079 • Published Dec 5, 2023 • 16

Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling

Paper • 2311.00430 • Published Nov 1, 2023 • 57
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis

Paper • 2307.01952 • Published Jul 4, 2023 • 90
Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83
Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

Paper • 2311.00871 • Published Nov 1, 2023 • 3

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs