Skip to content Skip to main navigation Skip to footer

Voice Activity Detection (VAD)

Voice Activity Detection is a language-, domain- and channel-independent technology that identifies parts of audio recordings with speech content vs. non-speech content. It creates labels for speech and other signals in the recording; this can then serve as a decision point whether to process the recording by other technologies or not. VAD is usually part of rapid filtration process in deployment.

Typical use cases are:

  • detection of present or absent human speech for voice processing,
  • filtering non-speech parts of the recording,
  • filtering out recordings with not enough net speech to be processed by other technologies
  • voice activated process, etc.

The speed of Voice Activity Detection is 140 ftRT per one instance.

Privacy Preference Center

Necessary

Required cookies required for proper function of Word Press publication platform.

gdpr*, wordpress*,cf7*,wp-settings*,PHPSESSID

Analytics

We are using Google Analytic in Global Site Tag configuration for keeping site content optimized for great user experience. No personal data are sent.

_ga*,_gid