Search Results for: stt

Results 11 - 20 of 25 Page 2 of 3
Results per-page: 10 | 20 | 50 | 100

WER

     Posted on: 2018-02-01

Word Error Rate – metrics for STT/LVCSR accuracy measurement

STT

     Posted on: 2018-02-01

Phonexia Speech To Text, sometime also as Speech Transcription Technology (LVCSR based ASR technology)

LM

     Posted on: 2018-02-01

Language Model (“vocabulary” in STT technology)

ASR

     Posted on: 2018-02-01

Automatic Speech Recognition (several technologies possible see LVCSR, STT or KWS)

Difference between on-the-fly and off-line type of transcription (STT)

     Posted on: 2017-12-11

Similarly as human, the ASR (STT) engine is doing the adaptation to an acoustic channel, environment and speaker. Also the ASR (STT) engine is learning more information about the content during time, that is used to improve recognition. The dictate engine, also known as on-the-fly transciption, does not look to the future and has information about just a few seconds of speech at the beginning of recordings. As the output is requested immediately during processing of the audio, recording engine can't predict what will come in next seconds of the speech. When access to the whole recording is granted during off-line transcription…

Q: What languages do you offer?

     Posted on: 2017-09-07

It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 30+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 10+ including English, French, German, Russian or American Spanish.

Terminology

     Posted on: 2017-06-15

Document which briefly describes processes and relations in Phonexia Technologies with consideration on correct word usage.   SID - Speaker Identification Technology (about SID technology) which recognize the speaker in the audio based on the input data (usually database of voiceprints). XL3, L3,L2,S2 - Technology models of SID. Speaker enrollment - Process, where the speaker model is created (usually new record in the voiceprint database). Speaker model: 1/ should reach recommended minimums (net speech, audio quality), 2/ should be made with more net speech and thus be more robust. The test recordings (payload) are then compared to the model (see…

Speech Analytics Course (technical training)

     Posted on: 2017-05-18

The Speech Analytics course consists of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part of the course is obligatory)   SAL course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus – Explain basic needs Discussion of partner project focused mainly on finalizing the training topics and agenda. YES 0,75 Application Design & Development – Licensing Presentation of types of licensing, and how to use the license file. YES 0,75 Technologies – Data gathering and Quality measurement – basic Description of…

Voice Biometrics Course (technical training)

     Posted on: 2017-05-18

The Voice Biometrics course consist of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part is mandatory for course)   VBS course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus - Explain basic needs Partner project related discussion focused mainly to finalizing training topics and agenda YES 0,75 Apps Designing and Developing - Licensing Gives trainee knowledge about type of licensing, and how to use the license file YES 0,75 Technologies - Data gathering and Quality measurement - basic Data gathering…