Search: age%20estimation

96 results

LID: Terminology and adaptation

…archives use lid instead of phxcmd lid for language pack training Creating new language STEP 1: Extract languageprints from recordings using lpextract command. The example below demonstrates commands to extract languageprints from audio recordings in 2 languages, each language located in separate directories recordings in first language are located in /path/to/my/audio/MyLanguage directory recordings in second language are located in /other/path/to/audio/MyOtherLanguage…

Releases and Changelogs (SPE)

…LID language pack, hash of the file contained in the custom language pack report is incorrectly calculated (occurs mainly in Windows) Fixed: Items builtin_language_models and custom_language_models in a body of POST /technologies/languageid/languagepacks/{name} are now optional. At least one of them must not be empty. Fixed: Better server response message when language model was not found during creation of new LID…

Language Identification (LID)

Phonexia Language Identification (LID) will help you distinguish the spoken language or dialect. It will enable your system to automatically route valuable calls to your experts in the given language or to send them to other software for analysis. Application areas Preselecting multilingual sources and routing audio files to language-dependent technologies (transcribing, indexing, etc.) Analyzing network traffic media (language statistics)…

Release Notes

…Other technologies New Gender Identification (GID) model XL5 (since 3.56.0) This enables GID to use voiceprints created by the brand new Speaker Identification 4 model XL5 New Age Estimation (AGE) models XL4 and XL5 (since 3.57.0) This enables AGE to use voiceprints created by the Speaker Identification 4 model XL4 and XL5 New Voice Activity Detection (VAD) model SID4_XL5 (since…

FAQs (PSP)

…may use them for both training a new language pack and testing/comparing against an existing language pack. The language-prints need to be compatible only with the model of LID used for language-print extraction. in FAQ Speech Platform Permalink Q: What are the recommendations for LID adaptation set? A: The following is recommended: For adding new language to language pack 20+…

Age Estimation (AGE)

Phonexia Age Estimation (AGE) estimates the age of a speaker from audio recording or voiceprint. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Audio: WAV or RAW (8 or 16 bits linear…

Understand SPE directory structure

…└── technologies │ └── tts ├── home │ └── admin │ ├── data │ └── storage ├── lib ├── log ├── settings └── shared bsapi bsapi directory contains the BSAPI core subsystem, i.e. all files and data of the speech technologies itself. This directory contains separate subdirectory for each technology included in the distribution package. The number of subdirectories depends…

STT: Adding words to language model on the fly

…following internal linguistic rules for the given STT language. Still, the automatically generated pronunciation may not be in line with expectations, especially for foreign words (due to pronunciation differences between the word’s native language and the STT language). Therefore it is recommended to define the pronunciations explicitly, to help prevent mistranscriptions caused by incorrect generated default pronunciations. It is also…

STT: What is Preferred Phrases feature and how to use it

…a decoder. The decoder uses the information from acoustic model, combines it with information from language model recognition network (which describes the statistics about word grouping and sentences of a given language) and provides the transcription output. (See the Speech To Text article for more details about speech transcription principles) When using preferred phrases, we build additional language model…

SPE and Browser installation: standalone SPE

…to start processing your recordings with Phonexia Speech Technologies. 1. Download Evaluation package Download the Phonexia Evaluation package from https://partner.phonexia.com/kb/sp/speech-platform/evaluation-package/ Simply unzip the package to your desired location. Ideally avoid C:/Program Files as you may face issues later on with previleges 2. Save license.dat file Copy the license.dat file to the /SPE/ directory. Make sure the license.dat file is not…

Speech to Text (STT)

…language model to be used for transcription. As an output the transcription in one of the formats is provided. The technology extract features out of voice, using acoustic and language models together with pronunciation all in recognition network creates a hypothesis of transcribed words and „decode“ the most possible transcription. Based on requested output types one or more transcribed text…

STT: Language Model Customization tutorial

Language Model Customization tool (LMC) provides a way to improve the Speech To Text performance by creating customized language model. Language model is an important part of Phonexia Speech To Text. In a simplified way it can be imagined as a large dictionary with multiple statistics. The Speech To Text technology uses this dictionary and statistical model to convert audio…

Understand SPE executable files

…technologies configuration, diagnostic info collection, etc. Usage: phxadmin [OPTION | COMMAND] [subcommand] [suboption…] Options –help – Show help information on command line parameters and exit –version – Show SPE version and exit Commands user – Manage SPE users. Without sub-command, lists all users. technology – Manage technologies. Without sub-command, lists enabled technologies. language-pack – Manage LID language packs. Without sub-command,…

Q: Do the language-prints (LPs) extracted from audio sources depend on the currently available language pack?

A: The language-prints do not depend on the current language pack used. You may use them for both training a new language pack and testing/comparing against an existing language pack. The language-prints need to be compatible only with the model of LID used for language-print extraction….

Understand SPE database

…user), technology model used to create the profile, file with the profile content, hash rest_profile_sid4_metafiles list of files used as SID4 Audio Source Profiles metafiles rest_model_lid list of LID language packs – name, owner (SPE user), technology model to which the language pack belongs (i.e. technology model used to create source languageprints/language models) rest_model_lid_metafiles list of LID language packs metafiles…