mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced 2025-10-06 09:07:10 +08:00
English | 简体中文
Silero VAD - pre-trained enterprise-grade Voice Activity Detector
The deployment model comes from silero-vad
Key Features
- Stellar accuracy
Silero VAD has excellent results on speech detection tasks.
- Fast
One audio chunk (30+ ms) takes less than 1ms to be processed on a single CPU thread. Using batching or GPU can also improve performance considerably.
- General
Silero VAD was trained on huge corpora that include over 100 languages and it performs well on audios from different domains with various background noise and quality levels.
- Flexible sampling rate
Silero VAD supports 8000 Hz and 16000 Hz sampling rates.
Download Pre-trained ONNX Model
For developers' testing, model exported by VAD are provided below. Developers can download them directly.
模型 | 大小 | 备注 |
---|---|---|
silero-vad | 1.8MB | This model file is sourced from snakers4/silero-vad,MIT License |