Skip to content Skip to main navigation Skip to footer

Search: score

21 results

STT: Results explained

…milliseconds. Score is logarithm of probability from {-inf,0} interval – the higher score, the higher probability that the word was spoken in that time interval. Confidence is a probability from {0,1} interval. It’s calculated from the score value using e score formula. Multiplying the value by 100 gives the confidence percentage. NOTE: Some ancient legacy models do not support confidence….

Q: What do LLR, LR and score mean?

…function of LR. LLR meets numbers in interval (-inf;+inf). Percentage (normalised) score – commonly used mathematical transformation of the LLR to percentage. This number is better for human readability but may bring some doubts if LLR numbers are too high (typically for some non-adapted installations). Interval <0;100> (or sometimes <0;1>), in %. The higher the score, the better the match….

KWS: Results explained

…threshold. … { “channel_id”: 0, “score“: 4.5108547, “confidence”: 0.9891304, “start”: 171400000, “end”: 175900000, “word”: “sale_0” }, { “channel_id”: 0, “score“: -1.5344038, “confidence”: 0.17735027, “start”: 246900000, “end”: 251700000, “word”: “sale_1” }, { “channel_id”: 0, “score“: 2.1896133, “confidence”: 0.89931285, “start”: 284100000, “end”: 291000000, “word”: “brazil_0” }, { “channel_id”: 0, “score“: 0.9341812, “confidence”: 0.7179228, “start”: 294900000, “end”: 300600000, “word”: “machine_0” } …  …

Speaker Identification (SID)

…existing enrollment voiceprints and the system returns a score for each comparison. The score is produced by comparing two voiceprints using Probabilistic Linear Discriminant Analysis (PLDA). Scoring and conversion to percentage Score produced by comparing two voiceprints is an estimate of the probability (P), that we get the given evidence (the compared voiceprints) if the speakers in the two voiceprints…

Language Identification (LID)

…LID score to percentage, use e score * 100 formula) LID adaptation (custom language packs) The scoring principle described above implies that score is distributed among all languages in a language pack. It means that every language has to score with non-zero value… i.e. that the scores may get diluted as they get spread among many languages. Additionally, if the…

Releases and Changelogs (VIN)

…Target score distribution Fixed: Population Set selected correctly even if renamed in the selection window Improved: Speech length display in the case view: added “Unlimited” option to display the speech length permanently Improved: SID Evidence score aligned with Speech Engine output of SID score Removed: Speech length compensation Voice Inspector 5.1 Voice Inspector 5.1.0, BSAPI 3.60.0 (2023-12-07) New: A generalized…

Age Estimation (AGE)

…age estimated by the system. { “result”: { “version”: 2, “name”: “AgeEstimationResult”, “file”: “/kelly_2.wav”, “model”: “L”, “channel_scores”: [ { “channel”: 0, “scores“: [ { “name”: “0”, “score“: }, { “name”: “1”, “score“: }, . . . { “name”: “41”, “score“: 1 }, { “name”: “42”, “score“: }, . . . In order to achieve the most representative results possible, a…

Releases and Changelogs (SPE)

…actual score for recordings containing less than 3 seconds of speech (it returned score -9999 in versions 3.55 to 3.57) NOTE: It is still strongly recommended to NOT rely on results from such very short recordings. Only longer recordings give results with appropriate confidence. Improved: Better Windows version detection Fixed: Generic model for SQE may fail when processing very short…

SID: Speaker Identification: Results Enhancement

…User Calibration enables the user to manually add custom variables “shift” and “scale” to the profile. The final score of any comparison using this profile is then adjusted according to this formula: adjusted_score = scale × (original_score + shift) To be able to determine proper shift and scale factors, SID Evaluation must be performed by a trained Phonexia Voice Biometrics…

Speech to Text (STT)

…segments including always one word. Every such segment provides information about start and end, the transcribed word and a score. n-best result provides several alternatives for sentences or bigger segments of speech with its score. It can be useful for analytics programs which can take more input and work on it. It can be used when speaker does not pronounce…

Releases and Changelogs (Browser)

…when an error occurs, but view all errors and continue creating the evaluation set Fixed: SID Evaluator – invalid GID score values Fixed: SID Evaluator – missing SQE information in report Fixed: SID Evaluator – don’t save disabled recordings to evaluation set Phonexia Browser 3.40.3, BSAPI 3.40.4 (2021-05-28) Fixed: Some minor bugs in licensing system Phonexia Browser 3.40.2, BSAPI 3.40.2…

Release Notes

…speech being detected in the audio. When estimating PESQ score using Speech Quality Estimation (SQE), non-speech parts of audio are ignored (since 3.59.0) This improvement helps getting more realistic PESQ scores with audio containing significant amount e.g. silence or ringtones as these get filtered out by internal voice activity detection. Phonexia Browser Transcriptions can be saved to a text file,…

Keyword Spotting (KWS)

…takes some time – the more pronunciationless keywords in the list, the longer delay occurs before the processing. When keyword list has pronunciations defined for each keyword, even thousands of defined keywords have no impact on performance. Technology searches the recording and returns the list of found keywords, together with score and confidence for each found keyword. The score is…