Files
FastDeploy/examples/audio/silero-vad
Qianhe Chen 09ec386e8d [Bug Fix] Fix speech and silence state transition in VAD (#1937)
* Fix speech and silence state transition

* Fix typo

* Fix speech and silence state transition

---------

Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>
2023-05-16 18:50:04 +08:00
..

English | 简体中文

Silero VAD - pre-trained enterprise-grade Voice Activity Detector

The deployment model comes from silero-vad

Key Features

  • Stellar accuracy

Silero VAD has excellent results on speech detection tasks.

  • Fast

One audio chunk (30+ ms) takes less than 1ms to be processed on a single CPU thread. Using batching or GPU can also improve performance considerably.

  • General

Silero VAD was trained on huge corpora that include over 100 languages and it performs well on audios from different domains with various background noise and quality levels.

  • Flexible sampling rate

Silero VAD supports 8000 Hz and 16000 Hz sampling rates.

Download Pre-trained ONNX Model

For developers' testing, model exported by VAD are provided below. Developers can download them directly.

模型 大小 备注
silero-vad 1.8MB This model file is sourced from snakers4/silero-vadMIT License

Detailed Deployment Documents

Source

https://github.com/snakers4/silero-vad