Update README.md
Browse files
README.md
CHANGED
|
@@ -25,13 +25,17 @@ Voxtral Small is an enhancement of [Mistral Small 3](https://huggingface.co/mist
|
|
| 25 |
|
| 26 |
Learn more about Voxtral in our blog post [here](https://mistral.ai/news/voxtral-2507).
|
| 27 |
|
|
|
|
|
|
|
|
|
|
| 28 |
## Key Features
|
| 29 |
|
| 30 |
Voxtral builds upon Mistral Small 3 with powerful audio understanding capabilities.
|
| 31 |
-
-
|
| 32 |
-
- **
|
| 33 |
-
-
|
| 34 |
-
- Function
|
|
|
|
| 35 |
|
| 36 |
## Benchmark Results
|
| 37 |
|
|
|
|
| 25 |
|
| 26 |
Learn more about Voxtral in our blog post [here](https://mistral.ai/news/voxtral-2507).
|
| 27 |
|
| 28 |
+
Both Voxtral models go beyond transcription with capabilities that include:
|
| 29 |
+
|
| 30 |
+
|
| 31 |
## Key Features
|
| 32 |
|
| 33 |
Voxtral builds upon Mistral Small 3 with powerful audio understanding capabilities.
|
| 34 |
+
- **Long-form context**: with a 32k token context length, Voxtral handles audios up to 30 minutes for transcription, or 40 minutes for understanding
|
| 35 |
+
- **Built-in Q&A and summarization**: Supports asking questions directly about the audio content or generating structured summaries, without the need to chain separate ASR and language models
|
| 36 |
+
- **Natively multilingual**: Automatic language detection and state-of-the-art performance in the world’s most widely used languages (English, Spanish, French, Portuguese, Hindi, German, Dutch, Italian, to name a few), helping teams serve global audiences with a single system
|
| 37 |
+
- **Function-calling straight from voice**: Enables direct triggering of backend functions, workflows, or API calls based on spoken user intents, turning voice interactions into actionable system commands without intermediate parsing steps.
|
| 38 |
+
- **Highly capable at text**: Retains the text understanding capabilities of its language model backbone, Mistral Small 3
|
| 39 |
|
| 40 |
## Benchmark Results
|
| 41 |
|