mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-10-12 03:50:39 +08:00

Files

Qianhe Chen 09ec386e8d [Bug Fix] Fix speech and silence state transition in VAD (#1937 )

* Fix speech and silence state transition

* Fix typo

* Fix speech and silence state transition

---------

Co-authored-by: DefTruth <31974251+DefTruth@users.noreply.github.com>

2023-05-16 18:50:04 +08:00

cpp

[Bug Fix] Fix speech and silence state transition in VAD (#1937 )

2023-05-16 18:50:04 +08:00

README_CN.md

[Model] Add Silero VAD example (#1107 )

2023-01-15 14:42:01 +08:00

README.md

[Model] Add Silero VAD example (#1107 )

2023-01-15 14:42:01 +08:00

README.md

English | 简体中文

Silero VAD - pre-trained enterprise-grade Voice Activity Detector

The deployment model comes from silero-vad

Key Features

Stellar accuracy

Silero VAD has excellent results on speech detection tasks.

Fast

One audio chunk (30+ ms) takes less than 1ms to be processed on a single CPU thread. Using batching or GPU can also improve performance considerably.

General

Silero VAD was trained on huge corpora that include over 100 languages and it performs well on audios from different domains with various background noise and quality levels.

Flexible sampling rate

Silero VAD supports 8000 Hz and 16000 Hz sampling rates.

Download Pre-trained ONNX Model

For developers' testing, model exported by VAD are provided below. Developers can download them directly.

模型	大小	备注
silero-vad	1.8MB	This model file is sourced from snakers4/silero-vad，MIT License

Detailed Deployment Documents

C++ deployment

Source

https://github.com/snakers4/silero-vad

README.md Unescape Escape

Silero VAD - pre-trained enterprise-grade Voice Activity Detector

Key Features

Download Pre-trained ONNX Model

Detailed Deployment Documents

Source

README.md