Skip to content Skip to main navigation Skip to footer

Search: adaptation

7 results

FAQs (PSP)

…Browser, FAQ Speech Platform Permalink Q: What is the difference between on-the-fly and off-line type of speech to text transcription (STT)? A: Similarly as human, the ASR (STT) engine is doing the adaptation to an acoustic channel, environment and speaker. Also the ASR (STT) engine is learning more information about the content during time, that is used to improve recognition….

Speech to Text (STT)

…data are similar to desired usage of resulting technology model, which is usually spontaneous speech. However as it is complicated to obtain such amount of data of this type, also other sources are used. Adaptation The technology can be adapted in two levels – in the Acoustic Model or the Language Model. Adapting the Acoustic Model to speakers from a…

Speaker Identification

…Mean normalization. Gathering an adaptation set (100-150 random recordings from the Customer’s environment, real production data; 60+ second of net speech for each recording) Preparing the Audio Source Profile (ASP) (done by Phonexia) Setting up a proper threshold – point 1 (done by Phonexia) Adaptation for False Acceptance Rate (FAR) The final step in the calibration. If the customer wants…

Language Identification (LID)

…actually expected in your use case. This process of tailoring the language pack for particular needs is called language pack adaptation and is described in LID: Terminology and adaptation article. Example usages of custom language packs Law enforcement agency monitoring a network of criminals using only a particular set of languages can use the approach of keeping only languages expected…

Q: What are the recommendations for LID adaptation set?

A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in…

LID: Terminology and adaptation

…to train a language using just a few and long audio files (like 5 files, 1 hour each) Acoustic channels should be as close as possible to channel of intended deployment Adaptation using REST API (SPE 3.38 or newer) SPE 3.38 and newer include LID adaptation tasks in REST API, which makes the adaptation significantly easier than in previous versions….