Skip to content Skip to main navigation Skip to footer

Search: confusion network

6 results

STT: What is Words-To-Numbers feature and how to use it

…variants are provided), for both file- and stream transcription. The reason for not having it available in the word-level outputs (One-best, Confusion Network) is that it would create difficulties in stream transcription – as new words keep coming, they may potentially change the previous output: two… 2 two thousand… 2000 two thousand twenty… 2020 two thousand twenty one 2021 And…

Releases and Changelogs (Browser)

…License system failure (1303) on Windows with NET or FLS-distributed licenses when expiration of the license was in year 2038 or later Phonexia Browser 3.45.1, BSAPI 3.45.4 (2021-12-15) Fixed: Error when loading STT confusion network from SPE Improved: Server information dialog now shows ‘unlimited’ value for file size limits instead of ‘0 B’ Phonexia Browser 3.45.0, BSAPI 3.45.0 (2021-10-08) New:…

STT: Results explained

…{-inf,+inf} interval. Confidence is normalized probability from {0,1} interval. Multiplying the value by 100 gives the confidence percentage.   Confusion network output { “time_slot”: 1, “start_time”: 1500000, “end_time”: 3600000, “word”: “I”, “posterior_probability”: 0.986744345985676, “channel”: } Confusion networks can be used for further processing in a similar way as the n-best output. Time slot is a period of time for which…

Speech to Text (STT)

…the word correctly and when technology evaluates the best result as not matching to what was really said. Confusion network result type provides similar output as n-best, only with the exception that segments are returned word by word. Usage of confusion network is the same as of n-best. Training of new models To create new model of STT about 100…

Releases and Changelogs (SPE)

…to X-SAMPA) Fixed: Cannot create new output stream due to hanging unfinished tasks Fixed: Task is not removed from pool when result is delivered via Webhook Fixed: Some log messages contain format placeholder instead of numbers Fixed: Missing <silence/> label in STT confusion network output Fixed: STT confusion network contains <silence/> tags with confidence greater than 1.0 Fixed: Diarization crashes…

STT: How to properly convert Confusion Network results to One-best

Confusion Network output is the most detailed Speech Engine STT output as it provides multiple word alternatives for individual timeslots of processed speech signal. Therefore many applications want use it as the main source of speech transcription and perform eventual conversion to less verbose output formats internally. This article provides the recommended way to do the conversion. Time slots and…