Skip to content Skip to main navigation Skip to footer

Search: score%20transformation

22 results

Release Notes

…speech being detected in the audio. When estimating PESQ score using Speech Quality Estimation (SQE), non-speech parts of audio are ignored (since 3.59.0) This improvement helps getting more realistic PESQ scores with audio containing significant amount e.g. silence or ringtones as these get filtered out by internal voice activity detection. Phonexia Browser Transcriptions can be saved to a text file,…

Keyword Spotting (KWS)

…takes some time – the more pronunciationless keywords in the list, the longer delay occurs before the processing. When keyword list has pronunciations defined for each keyword, even thousands of defined keywords have no impact on performance. Technology searches the recording and returns the list of found keywords, together with score and confidence for each found keyword. The score is…

Gender Identification (GID)

…generation of XL3 and L3 models) Output scoring: log-likelihood ratio (LLR) and score (0-1). Score can be interpreted as percentage by multiplying the score by 100. Typical use cases: filtering calls by gender, playing advertisement focused on specific gender, getting quick demographic analysis of the recordings. The speed of Gender Identification is up to 150 FtRT (depending on the model)….

Speech Quality Estimation (SQE)

…linear coding), A-law or Mu-law, PCM, 8kHz+ sampling Output global score – percentage expression of audio quality (range <0;100>), by default, the global score is calculated based on waveform_n_bits and waveform_snr variables. pesq – value inspired by PESQ (Perceptual Evaluation of Speech Quality). Value is -0.5 to 4.5, the higher rating, the better quality of the recording. Other important statistics…

STT: How to properly convert Confusion Network results to One-best

…word alternatives: The recommended algorithm for converting Confusion Network (CN) to One-best is as follows: loop through all CN timeslots from start to end in each timeslot, get the input alternative with highest score and if it’s not <null/> or _DELETE_ add the input alternative at the end of your output then, loop through all alternatives in your output for…

Understand SPE configuration file

…cache, if set to 0, cache is disabled server.audio_source_profiles_cache_size = 64 See Audio Source Profiles cache for more detials. server.bsapi_comparator_fa_cache_size # Number of recently used score shifts for false acceptance calibration hold in cache, if set to 0, cache is disabled. Default is 100 000. # Hold per pair of Audio Source Profile with enabled false acceptance calibration and compared…