Skip to content Skip to main navigation Skip to footer

Basic package for desktop contains the following components, technologies & models

  • Speech Engine (SPE v3.45.4) – technologies included:
    • Speech to Text (STT) – model EN_US_6 (US English)
    • Keyword Spotting (KWS) – model EN_US_6 (US English)
    • Speaker Identification 4 (SID4) – model XL4
    • Speaker Diarization (DIAR) – model XL4
    • Language Identification (LID) – model L4
    • Gender Identification (GID) – model XL4
    • Age Estimation (AGE) ) – model L4
    • Voice Activity Detection (VAD) – model GENERIC_3
    • Speech Quality Estimator (SQE)
    • Phoneme Recogniser (PHNREC) – model EN_US_6 (US English)
    • Time Analysis Extraction (TAE)
    • Waveform Denoiser (DENOISER)
  • Phonexia Browser (BROWSER v3.45.1)
  • Reporting and Licensing Server (RLS v0.12.1)
  • example audio
    (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/)