Skip to content Skip to main navigation Skip to footer

Search: score

22 results

Gender Identification (GID)

…generation of XL3 and L3 models) Output scoring: log-likelihood ratio (LLR) and score (0-1). Score can be interpreted as percentage by multiplying the score by 100. Typical use cases: filtering calls by gender, playing advertisement focused on specific gender, getting quick demographic analysis of the recordings. The speed of Gender Identification is up to 150 FtRT (depending on the model)….

Speech Quality Estimation (SQE)

…linear coding), A-law or Mu-law, PCM, 8kHz+ sampling Output global score – percentage expression of audio quality (range <0;100>), by default, the global score is calculated based on waveform_n_bits and waveform_snr variables. pesq – value inspired by PESQ (Perceptual Evaluation of Speech Quality). Value is -0.5 to 4.5, the higher rating, the better quality of the recording. Other important statistics…

FAQs (PSP)

…–spe-output PhxBrowser software will start with ” SPE output” tab which shows debug output of SPE in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: Why does the system show high score (>90%) even for non-targets? A: Threshold for score isn’t set up correctly. Adjust speaker score sharpness value to calibrate the recalculation. Please see Calibration in technology documentation. in…

FAQs (Browser)

…debug output of SPE Linux: Run PhxBrowser software in terminal with command: ./PhxBrowser –spe-debug –spe-output PhxBrowser software will start with ” SPE output” tab which shows debug output of SPE in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: Why does the system show high score (>90%) even for non-targets? A: Threshold for score isn’t set up correctly. Adjust speaker…

STT: How to properly convert Confusion Network results to One-best

…word alternatives: The recommended algorithm for converting Confusion Network (CN) to One-best is as follows: loop through all CN timeslots from start to end in each timeslot, get the input alternative with highest score and if it’s not <null/> or _DELETE_ add the input alternative at the end of your output then, loop through all alternatives in your output for…

Understand SPE configuration file

…cache, if set to 0, cache is disabled server.audio_source_profiles_cache_size = 64 See Audio Source Profiles cache for more detials. server.bsapi_comparator_fa_cache_size # Number of recently used score shifts for false acceptance calibration hold in cache, if set to 0, cache is disabled. Default is 100 000. # Hold per pair of Audio Source Profile with enabled false acceptance calibration and compared…