π§ CoreML Silero VAD
A CoreML implementation of the Silero Voice Activity Detection (VAD) model, optimized for Apple platforms (iOS/macOS). This repository contains pre-converted CoreML models ready for use in Swift applications.
See FluidAudio Repo link at the top for more information
Model Description
Developed by: Silero Team (original), converted by FluidAudio
Model type: Voice Activity Detection
License: MIT
Parent Model: silero-vad
This is how the model performs against the silero-vad v6.0.0 basline Pytorch JIT version
Note that we tested the quantized versions, as the model is already tiny, theres no performance imporvement at all.
This is how the different models compare in terms of speed, the 256s takes in 8 chunks of 32ms and processes it in batches so its much faster

Conversion code is available here: FluidInference/mobius
Intended Use
Primary Use Cases
- Real-time voice activity detection in iOS/macOS applications
- Speech preprocessing for ASR systems
- Audio segmentation and filtering
How to Use
Citation
@misc{silero-vad-coreml, title={CoreML Silero VAD}, author={FluidAudio Team}, year={2024},
url={https://huggingface.co/alexwengg/coreml-silero-vad} }
@misc{silero-vad, title={Silero VAD}, author={Silero Team}, year={2021}, url={https://github.com/snakers4/silero-vad} }
- Downloads last month
- 3,871
Model tree for FluidInference/silero-vad-coreml
Base model
onnx-community/silero-vad
