mistralai
/

Voxtral-Small-24B-2507

Audio-Text-to-Text

Model card Files Files and versions

pandora-s commited on Jul 15

Commit

35ac9a5

·

verified ·

1 Parent(s): 44bc30e

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -25,13 +25,17 @@ Voxtral Small is an enhancement of [Mistral Small 3](https://huggingface.co/mist
 Learn more about Voxtral in our blog post [here](https://mistral.ai/news/voxtral-2507).
 ## Key Features
 Voxtral builds upon Mistral Small 3 with powerful audio understanding capabilities.
-- Audio and text understanding
-- **32k context length** for more than **30 minutes of audio**
-- Audio understanding for **transcriptions**, **summaries**, **translations**, and much more
-- Function Calling
 ## Benchmark Results

 Learn more about Voxtral in our blog post [here](https://mistral.ai/news/voxtral-2507).
+Both Voxtral models go beyond transcription with capabilities that include:
 ## Key Features
 Voxtral builds upon Mistral Small 3 with powerful audio understanding capabilities.
+- **Long-form context**: with a 32k token context length, Voxtral handles audios up to 30 minutes for transcription, or 40 minutes for understanding
+- **Built-in Q&A and summarization**: Supports asking questions directly about the audio content or generating structured summaries, without the need to chain separate ASR and language models
+- **Natively multilingual**: Automatic language detection and state-of-the-art performance in the world’s most widely used languages (English, Spanish, French, Portuguese, Hindi, German, Dutch, Italian, to name a few), helping teams serve global audiences with a single system
+- **Function-calling straight from voice**: Enables direct triggering of backend functions, workflows, or API calls based on spoken user intents, turning voice interactions into actionable system commands without intermediate parsing steps.
+- **Highly capable at text**: Retains the text understanding capabilities of its language model backbone, Mistral Small 3
 ## Benchmark Results