🧃 CoreML Silero VAD

A CoreML implementation of the Silero Voice Activity Detection (VAD) model, optimized for Apple platforms (iOS/macOS). This repository contains pre-converted CoreML models ready for use in Swift applications.

See FluidAudio Repo link at the top for more information

Model Description

Developed by: Silero Team (original), converted by FluidAudio

Model type: Voice Activity Detection

License: MIT

Parent Model: silero-vad

This is how the model performs against the silero-vad v6.0.0 basline Pytorch JIT version

Note that we tested the quantized versions, as the model is already tiny, theres no performance imporvement at all.

This is how the different models compare in terms of speed, the 256s takes in 8 chunks of 32ms and processes it in batches so its much faster

Conversion code is available here: FluidInference/mobius

Intended Use

Primary Use Cases

Real-time voice activity detection in iOS/macOS applications
Speech preprocessing for ASR systems
Audio segmentation and filtering

How to Use

Citation

@misc{silero-vad-coreml, title={CoreML Silero VAD}, author={FluidAudio Team}, year={2024},

url={https://huggingface.co/alexwengg/coreml-silero-vad} }

@misc{silero-vad, title={Silero VAD}, author={Silero Team}, year={2021}, url={https://github.com/snakers4/silero-vad} }

GitHub: https://github.com/FluidAudio/FluidAudioSwift

Downloads last month: 6,586

Inference Providers NEW

Voice Activity Detection

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for FluidInference/silero-vad-coreml

Base model

onnx-community/silero-vad

Quantized

(1)

this model

Datasets used to train FluidInference/silero-vad-coreml

Collection including FluidInference/silero-vad-coreml

CoreML

Collection

Models for Apple devices. See https://github.com/FluidInference/FluidAudio for usage details • 7 items • Updated about 8 hours ago • 4