--- license: mit tags: - coreml - voice-activity-detection - silero - vad - neural-engine base_model: snakers4/silero-vad pipeline_tag: voice-activity-detection --- # Silero-VAD-v5 — CoreML CoreML conversion of [Silero VAD v5](https://github.com/snakers4/silero-vad) for Apple Neural Engine. ## Model Details | Detail | Value | |--------|-------| | Architecture | STFT → Conv1d encoder → LSTM → decoder | | Parameters | ~309K | | Input | 512 samples (32ms @ 16kHz) | | Output | Speech probability (0.0–1.0) | | Size | ~4.2 MB | ## Usage ```swift let vad = try await SileroVADModel.fromPretrained(backend: .coreML) let prob = vad.processChunk(samples) ``` ## Variants | Variant | Backend | Model ID | |---------|---------|----------| | MLX | GPU | [aufklarer/Silero-VAD-v5-MLX](https://huggingface.co/aufklarer/Silero-VAD-v5-MLX) | | **CoreML** | **Neural Engine** | **aufklarer/Silero-VAD-v5-CoreML** | ## Links - **Swift library**: [soniqo/speech-swift](https://github.com/soniqo/speech-swift) - **Original model**: [snakers4/silero-vad](https://github.com/snakers4/silero-vad) --- ## Links - **Blog**: [blog.ivan.digital](https://blog.ivan.digital) - **Library Docs**: [soniqo.audio](https://soniqo.audio)