Skip to content Skip to main navigation Skip to footer

Search: adding%20packages

15 results

How to Update Orbis on KVM Without Data Loss

This guide assumes that the customer has a working Orbis virtual machine deployed using steps described in the How to install Orbis on KVM article. This guide is very similar to the first one with the exception of keeping the existing data disk rather than adding a default blank one. Ubuntu 20.04 LTS with default GUI is being used as…

Understand SPE metafiles

…to be stored in some sort of database managed by that application. But if you want to create just some very simple application, adding the database may be an undesired complication… and the simple option to handle metadata directly in SPE may come in handy. The …../metafile endpoint allows to manage metadata directly in SPE – use POST, GET or…

Q: What are the requirements for SID evaluation dataset?

…recordings in order to meet the criteria of at least 3 recordings for each speaker is not the right way to proceed. This way you are not adding any details. You are essentially analyzing details of a single recording five times. In contrast, by using 5 unique recordings coming from different audio environments or even different times of the day,…

Q: What are the recommendations for LID adaptation set?

A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in…

Speech to Text (STT)

…about the usual contexts where the words occur. Since the 5th generation of STT, we developed a tool that allows customers to customize language models by adding words specific to their domain or usecase. Accuracy To measure the accuracy of Phonexia Speech to Text the following points should be taken into account: Reason for the accuracy measurement What is the…

Q: Which authentication options are allowed by the server and how does it work?

A: The following options are supported: HTTP basic authorization – Client asks for session by resource “post /login” with HTTP basic authorization in query header. If server responds with error 405, server doesn’t support authorization by sessions and it is necessary to use basic authorization. Authorization by session – Authorization by session is done by adding parameter “X-SessionID“ into HTTP…

Release Notes

…to 81 % Phoneme Recognition technology is now included with Speech To Text This should help users with defining pronunciations in preferred phrases, when adding words to language model, or when creating customized model Added grammar rules for words to numbers conversion in 3.59.0 Pre-defined rules for English, Spanish and Polish were added. For other languages we provide empty definitions…

LID: Terminology and adaptation

…20 hours of audio is required, see requirements below Enhancing existing language model by adding your own audio files to existing built-in language at least 5 hours of audio is required, see requirements below Creating custom language pack consisting of your chosen set of languages, both pre-trained or created from your audio files Audio recordings requirements Format: WAV, FLAC, RAW…

STT: What is Words-To-Numbers feature and how to use it

…can add this functionality to other languages, or tune the existing one, by yourself by adding/editing the conversion rules, see below for more details. What is the words-to-numbers feature Words-to-numbers feature allows to convert raw transcription of numbers, dates (or similar patterns like credit card numbers) to their native form: two thousand twenty one ⇒ 2021 fifteen hundred eighty six…

STT: Language Model Customization tutorial

…as a source and creates new STT model with your customizations included as a target. To see results of the customizations, you need to use the new STT model for the transcription. Currently supported language model customizations are: adding new words and/or pronunciations This is intended for adding client-, domain- or product specific words like company names, product names, component…

FAQs (PSP)

…may use them for both training a new language pack and testing/comparing against an existing language pack. The language-prints need to be compatible only with the model of LID used for language-print extraction. in FAQ Speech Platform Permalink Q: What are the recommendations for LID adaptation set? A: The following is recommended: For adding new language to language pack 20+…

STT: What is Preferred Phrases feature and how to use it

…details in Adding words to STT language model article. Legacy preferred phrases (SPE 3.32 – 3.42) have a number of limitations: adding words to dictionary is not supported only words already known by the language model are allowed in preferred phrases Phrases containing unknown words are ignored and a warning message is logged to SPE log. Therefore, to be able…

STT: Adding words to language model on the fly

Adding words to STT language model on-the-fly is possible in SPE 3.45 or newer as part of preferred phrases feature. The POST /technologies/stt or POST /technologies/stt/input_stream API calls actually serve two purposes: specify the actual preferred phrases (in the phrases part) specify words to be added to STT language model (in the dictionary part) Each part can be used independently,…

Adding new language or technology model (Browser)

…our example, we are adding new Spanish model (ES_6 technology model) of Speech to Text and Keyword Spotting (with Phoneme Recognizer). When you install new languages or models, they are turned off by default and need to be enabled in Phonexia Browser. To turn new models on, open Phonexia Browser: go to Settings Switch to Speech Engine tab Open STT…

Releases and Changelogs (SPE)

…STT: Faster initialization of “dynamic adding of words” feature (now ~0.1 s, before ~0.5 s) New: phxadmin2 added to installation manual New: Models for STT and KWS with features of CS_CZ_6 (new VAD generation, dynamic adding of words in preferred phrases, increased transcription precision via updated decoder) TR_TR_6 SK_SK_6 FA_6 Improved: Models for STT and KWS, updated and aligned with…