Skip to content Skip to main navigation Skip to footer

Search: phoneme

22 results

Phoneme Recogniser (PHNREC)

Phonexia Phoneme Recogniser (PHNREC) converts speech signals into pronunciation characters (so called phonemes). After the conversion, the pronunciation (text) can be easily indexed and searched by third party text data mining tools. The technology is optimized for noisy recordings and colloquial speech, can process audio files as well as audio streams and can provide results in several output formats. Phoneme

Releases and Changelogs (SPE)

…(the first model of 5th generation) [BSAPI] Added more accurate G2P (5th generation only) [BSAPI#72] Fixed phoneme recognizer doesn’t make phonemes for phnrec_ru_ru.bs [BSAPI#99] Fixed phoneme recognizer with configuration phnrec_cs_cz.bs doesn’t transcript short recordings [BSAPI#82] Fixed missing configuration of phnrec for HR_HR4 [BSAPI#78] Fixed STT segmentation – a segment doesnt break on a long silence, creates false crosstalks [BSAPI#148] Phoneme

Keyword Spotting (KWS)

…to reveal (or “transcribe”) pronunciation directly from actual audio recording. Phoneme Recognizer Phoneme Recognizer (PHNREC) reveals the phoneme transcription of a specified audio recording, or its part. This can be used to get the actual pronunciation of a keyword or phrase as is actually spoken in the audio recording. This pronunciation can be then used in a keyword list for…

Releases and Changelogs (VIN)

…distribution allows 1:1 comparison Fixed: Various bug fixes Improved: Reworked dialog for population set management Changed: Population sets structure changed Removed: Speaker Identification models S2, L2, L3, XL3 are no longer supported Voice Inspector 3.2 Voice Inspector v3.2.2, BSAPI 3.15.0 (2018-06-05) Fixed possible application crashes on Windows Added phoneme type ‘affricate’ and fixed phoneme types: phoneme ‘C’ changed from ‘fricative’…

STT: Adding words to language model on the fly

…different alphabet (e.g. German word like “grüßen” in Czech transcription) or different writing script (like Cyrillic or Japanese Kana). In that case, the word pronunciation MUST be explicitly specified. The pronunciation must use only phonemes supported by the STT language (use GET /technologies/stt/phonemes to get allowed phonemes list). Specifying a word using disallowed characters without also specifying pronunciation causes that…

Releases and Changelogs (Browser)

…didn’t work Phonexia Browser v3.13.1, BSAPI 3.17.0 – Nov 19 2018 [G#79] Dropdown buttons in SID Evaluation wizard results page can get hidden without user knowing about them [G#90] Fixed synchronize properly items in View menu when layout is reseted [G#88] Fixed graphs in LID detail view may show incorrect values [G#36] Added Phoneme Recognizer [G#62] Use Qt 5 [G#85]…

STT: Language Model Customization tutorial

…containing list if words to be added to the STT language model, one word per line. Note: LMC v3.30.0 (March 2020) or older requires the text file without Byte-Order-Mark (BOM) Each word can be optionally followed by its pronunciation, separated from the word by SPACE or TAB character. Pronunciations must use only phonemes allowed by the corresponding language – see…

STT: What is Preferred Phrases feature and how to use it

…German word like “grüßen” in Czech transcription – or even using different writing script like Cyrillic or Japanese Kana. Such words MUST be accompanied by a pronunciation definition, and that definition must use only phonemes supported by the STT model (i.e. the German word from the previous example would need to have pronunciation defined using Czech phonemes). See also more…

Understand SPE directory structure

…at https://download.phonexia.com/docs/spe/ INSTALL.html, INSTALL.txt Quick installation guide in HTML and TXT format UPDATE.txt Quick update instructions and SPE configuration file changes between SPE versions result_versions.txt List of REST API result versions Phonemes_for_STT_and_KWS.pdf List of STT/KWS phonemes, useful e.g. for keyword pronunciations definitions Technology_LID_L4_Language_tags.pdf List of LID L4 language tags and more details about languages they refer to EULA EULA directory…

Key Features (VIN)

…Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

Key Features (PSP)

…– detects the audio part that contains voice, Speech Quality Estimation (SQE) – measures the quality of speech, Phoneme Recognizer (PHNREC) – several languages supported – converts speech into phonemes (written characters representing pronunciation), Waveform Denoiser (DENOISER) – automatically improves the audibility of speech for human listeners. Supported Languages The LID, STT and KWS technologies support various languages as listed…

Adding new language or technology model (Browser)

…on the SPE server -> Server Info. You should see the output similar to this: Please share this SPE version number with your Phonexia contact/or through support ticket (in the above example 3.50.5) . Installation of new models In our example, we will install Spanish (ES_6) model of Speech to Text and Keyword Spotting (with Phoneme Recognizer) into existing installation…

SPE and Browser installation: standalone SPE

…line type: phxadmin.exe /configure-tech SPE on Linux Open the Terminal window in  /SPE/ directory Type in the terminal: ./phxadmin –configure-tech This will open the list of technologies (and language models) available for you to chose from 1) Age Estimation [disabled] 2) Denoiser Technology [disabled] 3) Diarization [disabled] 4) Gender Identification [disabled] 5) Keyword Spotting [disabled] 6) Phoneme Recognition [disabled] 7)…

Download Speech Platform

…only English models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model…