πŸ§ƒ CoreML Silero VAD

Discord GitHub Repo stars

A CoreML implementation of the Silero Voice Activity Detection (VAD) model, optimized for Apple platforms (iOS/macOS). This repository contains pre-converted CoreML models ready for use in Swift applications.

See FluidAudio Repo link at the top for more information

Model Description

Developed by: Silero Team (original), converted by FluidAudio

Model type: Voice Activity Detection

License: MIT

Parent Model: silero-vad

This is how the model performs against the silero-vad v6.0.0 basline Pytorch JIT version

graphs/yc_standard_comparison_20250915_205721_2c04b81.png graphs/yc_256ms_comparison_20250915_205721_2c04b81.png

Note that we tested the quantized versions, as the model is already tiny, theres no performance imporvement at all.

This is how the different models compare in terms of speed, the 256s takes in 8 chunks of 32ms and processes it in batches so its much faster graphs/yc_performance_20250915_205721_2c04b81.png

Conversion code is available here: FluidInference/mobius

Intended Use

Primary Use Cases

  • Real-time voice activity detection in iOS/macOS applications
  • Speech preprocessing for ASR systems
  • Audio segmentation and filtering

How to Use

Citation

@misc{silero-vad-coreml, title={CoreML Silero VAD}, author={FluidAudio Team}, year={2024},

url={https://huggingface.co/alexwengg/coreml-silero-vad} }

@misc{silero-vad, title={Silero VAD}, author={Silero Team}, year={2021}, url={https://github.com/snakers4/silero-vad} }

Downloads last month
3,871
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for FluidInference/silero-vad-coreml

Quantized
(1)
this model

Datasets used to train FluidInference/silero-vad-coreml

Collection including FluidInference/silero-vad-coreml