Skip to content Skip to main navigation Skip to footer

Search: Language models

34 results

Installation of Phonexia Browser

Some packages are distributed with only a limited set of speech technologies and languages or without speech technologies. First installation Our software is distributed as a ZIP file. Installation procedure is as simple as: unzip the archive paste additional KWS, STT… models paste the license.dat file to the root directory where you have BROWSER folder and run_browser(.exe) script run the…

Age Estimation (AGE)

Phonexia Age Estimation (AGE) estimates the age of a speaker from audio recording or voiceprint. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Audio: WAV or RAW (8 or 16 bits linear…

Gender Identification (GID)

Gender Identification is a language-, domain- and channel-independent technology that uses the acoustic characteristics of the recording to determine the gender of the speaker in question. This technology is able to distinguish between two genders: Male (M) and Female (F). Minimum of speech signal for identification: 7+ sec recommended (with XL4 and L4 model (9+ sec for previous generation of…

STT: Results explained

…milliseconds. Score is logarithm of probability from {-inf,0} interval – the higher score, the higher probability that the word was spoken in that time interval. Confidence is a probability from {0,1} interval. It’s calculated from the score value using e score formula. Multiplying the value by 100 gives the confidence percentage. NOTE: Some ancient legacy models do not support confidence….

Understand SPE home directory

…Data The data directory holds additional data files for entities created by that user – e.g. SID Speaker Models, or LID language packs. If no such entities exist for that user, this directory is empty. Unlike the storage, content of this directory is intended to be manipulated by SPE only and should not be manipulated directly on the filesystem level….

Download Speech Platform

…only English models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model…

Phonexia Speech Engine

…main binary file itself SPE requires database, which might be SQLite (delivered inside Phonexia package) or MySQL. No other components are needed. Structure of Technologies and technology models From the technical point of view, every technology can work with different technology modules. These are various languages for STT (CS_CZ4, EN_US4), or various sizes for SID (L3, XL3). Technology can work…

Releases and Changelogs (VIN)

…BSAPI 3.40.13 (2022-07-15) New: Added Romanian to the default set of available languages New: Added the ability to define custom language in the speaker metadata Fixed: When discarding a changed photo, the confirmation dialog “Do you want to save…” popped up infinitely Fixed: Missing file names when the SID Evaluator evaluates speakers from the workspace Fixed: Unwanted extra comparisons when…

STT: What is Words-To-Numbers feature and how to use it

This article explains details of new STT feature for native numeric numbers and dates trancription in n‍-best output and gives some tips for fine-tuning the results. NOTE: The feature works out-of-the-box in the following STT languages and models: English – EN_US_6 and EN_US_A_6 Spanish – ES_6 Polish – PL_PL_6 Czech – CS_CZ_5 and CS_CZ_6 Slovak – SK_SK_5 and SK_SK_6 You…

Understand SPE metafiles

…separate files. Another example would be the information about content of created LID language pack – if LID language pack is successfully created, SPE creates a metafile named report, which contains detailed information about the source files used for the language pack creation. See the LID language pack creation REST endpoint documentation for more details about the report metafile content….

Understand SPE user accounts

…not visible by SPE and by the account. Similar trickery can be done with the data directory, allowing to share LID language models and language packs, or SID speaker models, etc. between accounts. User accounts management SPE user accounts can be managed using REST API (see Administration section of the API documentation), or using command line administration utilities phxadmin or…

Understand SPE benchmark

…if such directory is found, audio files from that directory are used (expecting that the audio contains speech in that corresponding language). If not found, it falls back to default directory. The reason for language-specific data is that processing audio in different language than the language for which the model was trained negatively affects the processing speed (basically, the processing…

Arabic dialects in Phonexia LID and STT

Arabic language has (a) one standardised variety, and (b) many non-standard varieties (dialects). In this article, our linguistic team explains differences between Modern Standard Arabic and Arabic dialects in the context of Phonexia Arabic models. Standard variety: Modern Standard Arabic (MSA) All Arabs learn it at school (not from their parents, so we cannot say it is their native variety)…

FAQs (Browser)

…Browser. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages do you offer? It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What…

Speech Engine update

…technology models configuration usually introduces new features or major fixes, which may change communication between server and client, or other changes which may affect customer processes can also include new technology models; with such update you can add only the new technology, without SPE installation Upgrade changes the first version number (e.g. x.y.z to x+1) and is a major change…