Search Results for: ASR

Results 1 - 6 of 6 Page 1 of 1
Results per-page: 10 | 20 | 50 | 100

Voice Inspector – Interpretation of results

     Posted on: 2019-06-24

Introduction Phonexia Voice Inspector (VIN) is a tool for forensic automatic speaker identification, compliant with the Methodological Guidelines for Best Practice in Forensic Semiautomatic and Automatic Speaker Recognition, published by the European Network of Forensic Science Institutes.  This post explains individual SID score types and ways to visualize the results in a speaker identification case implemented in Voice Inspector. Evidence In VIN, the term evidence has two meanings. In general, it refers to any SID score that the system calculates for any pair of recordings in the case. These scores are the output of the Phonexia SID technology which runs…

STT

     Posted on: 2018-02-01

Phonexia Speech To Text, sometime also as Speech Transcription Technology (LVCSR based ASR technology)

PHR

     Posted on: 2018-02-01

Phoneme recognizer – currently part of Keyword Spotting (Phonexia Keyword Spotting - acoustics based ASR, several tec...) technology in Phonexia Speech Engine  (REST Application Program Interface)

KWS

     Posted on: 2018-02-01

Phonexia Keyword Spotting - acoustics based ASR, several technologies possible, language dependent

ASR

     Posted on: 2018-02-01

Automatic Speech Recognition (several technologies possible see LVCSR, STT or KWS)

Difference between on-the-fly and off-line type of transcription (STT)

     Posted on: 2017-12-11

Similarly as human, the ASR (STT) engine is doing the adaptation to an acoustic channel, environment and speaker. Also the ASR (STT) engine is learning more information about the content during time, that is used to improve recognition. The dictate engine, also known as on-the-fly transciption, does not look to the future and has information about just a few seconds of speech at the beginning of recordings. As the output is requested immediately during processing of the audio, recording engine can't predict what will come in next seconds of the speech. When access to the whole recording is granted during off-line transcription…