…now load transcription files that contain spaces in a word instead of ‘+’ signs Fixed: Wrong file suffix when saving transcription on Windows Phonexia Browser 3.59 (Public release) Phonexia Browser 3.59.0, BSAPI 3.59.0 (2023-06-20) New: Transcription can be saved in text formats supported by the transcription widget Improved: SPE Output widget is now visible by default and gets focused when…
Search: supported%20languages
42 results
…distribution allows 1:1 comparison Fixed: Various bug fixes Improved: Reworked dialog for population set management Changed: Population sets structure changed Removed: Speaker Identification models S2, L2, L3, XL3 are no longer supported Voice Inspector 3.2 Voice Inspector v3.2.2, BSAPI 3.15.0 (2018-06-05) Fixed possible application crashes on Windows Added phoneme type ‘affricate’ and fixed phoneme types: phoneme ‘C’ changed from ‘fricative’…
…technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis Extraction (TAE) 14:22 Speech Platform architecture; Speech Engine, Phonexia Browser, Phonexia Voice Inspector brief 18:52 HW and SW requirements, typical deployment topologies 21:34 Supported file- and stream formats, typical implementations and data flows 27:29 Licensing technical options 32:24 Summary, recommended next steps https://youtu.be/DDu0Y1rgQ6k…
…as a source and creates new STT model with your customizations included as a target. To see results of the customizations, you need to use the new STT model for the transcription. Currently supported language model customizations are: adding new words and/or pronunciations This is intended for adding client-, domain- or product specific words like company names, product names, component…
…w/o speech context) Methodology SID4 performance was measured on a virtual machine, Ubuntu 18.04 installed as host OS. SID4 v 3.21.3 command line was used, supported by VAD 3.22.1 command line used for collecting statistical metadata. The Virtual Machine was reserved only for this measurement experiment. Technical details: Driven by bash script in terminal emulator Measuring script was run 50…
This article explains details and usage of SQL database scripts stored in SPE installation directory in /data/database subdirectory. These scripts are intended for setup and maintenance of SPE database for supported database types, currently SQLite and MariaDB (from SPE 3.46) / MySQL (up to SPE 3.45). Script types For each database type, there are two directories with two types of…
…kept in the database at all. Supported databases SPE supports SQLite and MariaDB 10.x (SPE 3.46+) MySQL 5.x (SPE up to 3.45) database engine. The database engine is configured in phxspe.properties SPE configuration file – see the Database section of SPE configuration file article for more details. SQLite SQLite is the out-of-the-box SPE default database type. By its nature, SQLite…
…speakers) Supported audio format: MS Wave or RAW with linear coding (8 or 16 bits), A-law, Mu-law; Sampling frequency 8kHz or higher Output: A scoring table with the results of comparisons in a Likelihood Ratio, Log-Likelihood Ratio (decimal or natural logarithm), and Verbal Ratio The graphical presentation of results in the form of a Probability Density Function plot and a…
Recognized languages Languages pre-trained in the default language pack are listed in the table below, each LID generation is a separate column (in the 4th generation we switched to using language tags instead of names): L4 L3, XL3 S2, L2 (deprecated sq-AL Albanian Albanian Albanian am-ET Amharic Amharic Amharic ar-EG Arabic (Egypt) Arabic ar-KW Arabic (Gulf, Kuwait) Arabic_Gulf …
…Speech Engine documentation); stream not supported, technology model name to be used for processing. Output: audio file (WAV or RAW), together with xml/json report (in SPE only). Fig.: Comparison of original recording (david_noisy.wav, top half of image) and same recording processed by Denoiser (david_denoised.wav, bottom half of the image). Typical Questions Q: What do you recommend for deploying this technology?…
…the technology and model. However, this feature should be used only in special cases, e.g. if suggested by Phonexia experts. SPE users should normally not fiddle around with BSAPI configuration files… and if some technology config customization is needed, the user configuration file is the right method. Technology names supported in technologies configuration file: AGE Age Estimation DENOISER Denoiser DIAR…
…different alphabet (e.g. German word like “grüßen” in Czech transcription) or different writing script (like Cyrillic or Japanese Kana). In that case, the word pronunciation MUST be explicitly specified. The pronunciation must use only phonemes supported by the STT language (use GET /technologies/stt/phonemes to get allowed phonemes list). Specifying a word using disallowed characters without also specifying pronunciation causes that…