Files
FastDeploy/examples/audio/silero-vad/README.md
Qianhe Chen a4b94b2c93 [Model] Add Silero VAD example (#1107)
* add vad example

* fix typo

* fix typo

* rename file

* remove model and wav

* delete Vad.cc

* delete Vad.h

* rename and format

* fix max and min

* update readme

* rename var

* format

* add params

* update readme

* update readme

* Update README.md

* Update README_CN.md

Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>
2023-01-15 14:42:01 +08:00

1.6 KiB
Raw Blame History

English | 简体中文

Silero VAD - pre-trained enterprise-grade Voice Activity Detector

The deployment model comes from silero-vad

Key Features

  • Stellar accuracy

Silero VAD has excellent results on speech detection tasks.

  • Fast

One audio chunk (30+ ms) takes less than 1ms to be processed on a single CPU thread. Using batching or GPU can also improve performance considerably.

  • General

Silero VAD was trained on huge corpora that include over 100 languages and it performs well on audios from different domains with various background noise and quality levels.

  • Flexible sampling rate

Silero VAD supports 8000 Hz and 16000 Hz sampling rates.

Download Pre-trained ONNX Model

For developers' testing, model exported by VAD are provided below. Developers can download them directly.

模型 大小 备注
silero-vad 1.8MB This model file is sourced from snakers4/silero-vadMIT License

Detailed Deployment Documents

Source

https://github.com/snakers4/silero-vad