[P] fast-vad: a very fast voice activity detector in Rust with Python bindings.

Repo: https://github.com/AtharvBhat/fast-vad

I needed something comparable to existing open-source VADs in quality, but with a strong emphasis on speed, simple integration, and streaming support. To my knowledge it’s the fastest open-source VAD out there.

Highlights: – Rust crate + Python package – batch and streaming/stateful APIs – built-in modes for sensible defaults – configurable lower-level knobs if you want to tune behavior yourself

It’s a simple logistic regression that operates on frame based features to keep it as fast as possible. It was trained using libriVAD dataset ( small version )

If anyone works on Audio, do try it out and let me know how it goes !

Feedback would be helpful 🙂

submitted by /u/AtharvBhat
[link] [comments]

Liked Liked