Skip to content Skip to main navigation Skip to footer

Search: languages

35 results

Download Speech Platform

…only English models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model…

Installation of Phonexia Browser

Some packages are distributed with only a limited set of speech technologies and languages or without speech technologies. First installation Our software is distributed as a ZIP file. Installation procedure is as simple as: unzip the archive paste additional KWS, STT… models paste the license.dat file to the root directory where you have BROWSER folder and run_browser(.exe) script run the…

Phonexia Speech Engine

…main binary file itself SPE requires database, which might be SQLite (delivered inside Phonexia package) or MySQL. No other components are needed. Structure of Technologies and technology models From the technical point of view, every technology can work with different technology modules. These are various languages for STT (CS_CZ4, EN_US4), or various sizes for SID (L3, XL3). Technology can work…

Understand SPE metafiles

…i.e. should be handled by the application built on top of the SPE API. This includes handling of any metadata associated with the processed audiofiles, like phone numbers, source of the recording, date/time the audio was recorded, references to the persons speaking in the recording (names, photos, …), languages spoken in the recording, etc. – all this data is expected…

Understand SPE connectors for external TTS

…expected to provide information about actual TTS service capabilities: list of voice names, supported languages and audio quality (sampling frequencies). This info is used during SPE startup sequence – TTS connectors enabled in SPE configuration file are started with –info parameter and SPE reads the connector output. Connectors failing to provide the info won’t be available for use with SPE….

Q: What are the requirements for SID evaluation dataset?

…in each recording (i.e. usually 2+ minutes recording length) only one speaker in each recording wide variety of gender and age is recommended recordings should be as similar to the target use case as possible (device, channel, distance from mic, languages distribution) audio files should be mono, lin16 format, 8 kHz+ sample rate *Note: splitting single recording into multiple shorter…

Understand SPE directory structure

…at https://download.phonexia.com/docs/spe/ INSTALL.html, INSTALL.txt Quick installation guide in HTML and TXT format UPDATE.txt Quick update instructions and SPE configuration file changes between SPE versions result_versions.txt List of REST API result versions Phonemes_for_STT_and_KWS.pdf List of STT/KWS phonemes, useful e.g. for keyword pronunciations definitions Technology_LID_L4_Language_tags.pdf List of LID L4 language tags and more details about languages they refer to EULA EULA directory…

Understand SPE technologies configuration file

This article explains the purpose and structure of SPE technologies configuration file technologies.xml, or technologies.json created by Phonexia Browser. SPE installation includes usually multiple speech technologies (e.g. Speaker Identification, Speech To Text, etc.) in various technological models (e.g. L4, XL4, etc.), or supporting various languages (e.g. 6th generation of EN_US, CS_CZ, etc.) available. You can select from these technologies/models those…

Phoneme Recogniser (PHNREC)

…output) Note: The outputs can contain the following special tokens: sil silent part (or no speech detected) The list of phonemes is available in the document phonemes_for_stt_and_kws.pdf (delivered as part of manuals in SPE or STT or KWS).   Languages Supported List of supported languages in Phoneme Recogniser is same as in Keyword Spotting.   Link to API reference https://download.phonexia.com/docs/spe/#%2Ftechnologies%2Fphnrec…

Releases and Changelogs (VIN)

languages – improved report layout, easier CSS styling via style.css, added new report variables for Expert and Organization Voice Inspector 5.0 Voice Inspector 5.0.0, BSAPI 3.57.0 (2023-06-20) New: Speaker Identification XL5 technology model New: Data in lists/tables are now sorted alphabetically New: Enlarge the initial set of speakers included in examples; some of the speakers are multilingual ❗❗❗ Voice Inspector…

Adding new language or technology model (Browser)

…our example, we are adding new Spanish model (ES_6 technology model) of Speech to Text and Keyword Spotting (with Phoneme Recognizer). When you install new languages or models, they are turned off by default and need to be enabled in Phonexia Browser. To turn new models on, open Phonexia Browser: go to Settings Switch to Speech Engine tab Open STT…

STT: What is Words-To-Numbers feature and how to use it

This article explains details of new STT feature for native numeric numbers and dates trancription in n‍-best output and gives some tips for fine-tuning the results. NOTE: The feature works out-of-the-box in the following STT languages and models: English – EN_US_6 and EN_US_A_6 Spanish – ES_6 Polish – PL_PL_6 Czech – CS_CZ_5 and CS_CZ_6 Slovak – SK_SK_5 and SK_SK_6 You…

Download Semantic Search demo

…an Ubuntu-based Linux operating system with a GUI. Supported languages Supported languages ISO Name ISO Name ISO Name af Afrikaans ht Haitian_Creole pt Portuguese am Amharic hu Hungarian ro Romanian ar Arabic hy Armenian ru Russian as Assamese id Indonesian rw Kinyarwanda az Azerbaijani ig Igbo si Sinhalese be Belarusian is Icelandic sk Slovak bg Bulgarian it Italian sl Slovenian…

Phonexia technology models EoL

…SV_SE_6 2021-05 8th gen. Standard SV_SE_5 2020-04 7th gen. Standard Turkish (Turkey) TR_TR_6 2022-01 8th gen. Standard Ukrainian (Ukraine) UK_UA_6 2023-04 8th gen. Standard Vietnamese (Vietnam) VI_VN_6 2021-10 8th gen. Standard Deprecated languages/models (not supported, after end-of-life) Older/other languages or models not listed in the above table are no longer supported and reached end-of-life. These are 1st, 2nd, 3rd or…