Search: training new language

13 results

Releases and Changelogs (SPE)

…need for command line tools… finally!) New: LID Language Packs allow to store meta-files New: New entity “LID Language Model” (equivalent of *.lpa LanguagePrint Archive) Improved: Updated STT model RU_RU_A to version 4.6.0 of (updated language model) Removed: Support for RLS-enforced licences in command line applications Removed: FeaturePasterRepeat warning on null/empty repeat vector Speech Engine 3.37 Speech Engine 3.37.1, DB…

LID: Terminology and adaptation

…you audio files using GET /technologies/languageid/extractlp endpoint Create new (yet empty) language model using POST /technologies/languageid/languagemodels/{name} endpoint Upload languageprint– or languageprint archive file to the language model using POST /technologies/languageid/languagemodels/{name}/file endpoint repeat this upload for all necessary files – e.g. when creating completely new language model from your own audio files, this would be hundreds or thousands of files (see…

Speech to Text (STT)

…specific region, or using a specific dialect, actually means creation of a new acoustic model. If there are not enough data to train a completely new model, the available new data can be added to the data used for training of the existing model. Based on these two data sets a new, more robust model can be created. However this…

FAQs (PSP)

…may use them for both training a new language pack and testing/comparing against an existing language pack. The language-prints need to be compatible only with the model of LID used for language-print extraction. in FAQ Speech Platform Permalink Q: What are the recommendations for LID adaptation set? A: The following is recommended: For adding new language to language pack 20+…

STT: What is Preferred Phrases feature and how to use it

…a decoder. The decoder uses the information from acoustic model, combines it with information from language model recognition network (which describes the statistics about word grouping and sentences of a given language) and provides the transcription output. (See the Speech To Text article for more details about speech transcription principles) When using preferred phrases, we build additional language model…

SPE and Browser installation: standalone SPE

…merging the contents of two packages into one. The additional languages are provided upon request by Phonexia sales representative. If you do not have the languages you want to test, contact our sales to arrange the cooperation. Download the files with additional languages locally and unzip them. Then copy the additional languages over to where you saved the default Evaluation…

Q: Do the language-prints (LPs) extracted from audio sources depend on the currently available language pack?

A: The language-prints do not depend on the current language pack used. You may use them for both training a new language pack and testing/comparing against an existing language pack. The language-prints need to be compatible only with the model of LID used for language-print extraction….

Arabic dialects in Phonexia LID and STT

…TEXT (used for STT language model training) MSA is used in all formal writing such as official correspondence, literature, newspapers, webpages so there is no problem to accumulate loads of texts, but it will be more formal and far from spontaneous speech Support for MSA in Phonexia products Name LID L4 STT Description Arabic (MSA) arb — Modern Standard Arabic,…

Speech Engine update

…technology models configuration usually introduces new features or major fixes, which may change communication between server and client, or other changes which may affect customer processes can also include new technology models; with such update you can add only the new technology, without SPE installation Upgrade changes the first version number (e.g. x.y.z to x+1) and is a major change…

SID: Speaker Identification: Results Enhancement

…language. We have never seen this data during SID training so it is a sensible thing to calibrate the system. Since there is only a single source of data (telephony) and only a single language (Wakandan), one can assume that it is enough to create a single profile and use it for both sides of the comparison. We are monitoring…

Q: What are the recommendations for LID adaptation set?

A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in…

SPE and Browser installation: embedded SPE

…the cooperation. 3. Optional: add additional languages If you are going to test additional languages besides the default English, present in the Phonexia Evaluation package, you need to perform a simple operation of merging the contents of two packages into one. The additional languages are provided upon request by Phonexia sales representative. If you do not have the languages you…

Speaker Diarization (DIAR)

Speaker Diarization labels segments of the same voice(s) in one mono-channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers but of technical signals and silence as well. The outputs of the technology can be both log files with labels and/or split audio files/one new…