Skip to content Skip to main navigation Skip to footer

Search: 1064Maximum number of tasks limit exceeded

59 results

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

Phonexia Ethical Code

…misuse their position in order to benefit us or our business partner. As we highly value free market competition, we do not give or accept bribes and we strictly apply this principle also on our business partners. Export Control We and our business partners must comply with all applicable export control rules (including but not limited to the relevant EU…

Installation of Phonexia Browser

Some packages are distributed with only a limited set of speech technologies and languages or without speech technologies. First installation Our software is distributed as a ZIP file. Installation procedure is as simple as: unzip the archive paste additional KWS, STT… models paste the license.dat file to the root directory where you have BROWSER folder and run_browser(.exe) script run the…

Age Estimation (AGE)

…time Representation of the results: For the CMD version Name_of_the_file.wav Age[integer – limited to 99] example/david_1.wav 41 example/david_2.wav 40 For the SPE version name – representing the age score – representing the score for the age [1/0] In order to get a result, each age receives a score; when the score equals to “1”, it represents the value of the…

Arabic dialects in Phonexia LID and STT

…Dialects are used for more personal communication, Facebook, Twitter, forums There is not much material available, since most of the written texts are in MSA Facebook, Twitter, forums can be used, but they need to be classified, corrected and unified manually – in Phonexia we do not do this The above are the reasons for limited out-of-the-box support of Arabic…

Understand SPE audio converter

SPE directly supports limited list of audio formats (codecs and containers), see Supported audio formats FAQ. Other audio formats must be converted using external tools. This conversion can be done either completely outside of SPE, before passing the files to SPE, or you can set up SPE to convert the files automatically. Then, depending on the capabilities of the conversion…

Contact

Visit Us at Address: Chaloupkova 3002/1a, CZ 612 00 Brno, Czech Republic, European Union GPS: N 49° 13.426′, E 016° 35.898 General Queries and Sales [email protected] landline: +420 511 205 265 Company registration details Identification number (ICO): 27680258 VAT identification (DIC): CZ27680258 Registered in the Business Register kept at the District Court in Brno, File C, Inset 51524….

Releases and Changelogs (VIN)

…1.3 2015-06-04 2016-12-04 2016-12-04 Public Changelogs Voice Inspector 5.2 Voice Inspector 5.2.0, BSAPI 3.61.0 (2024-04-04) New: New Case wizard checks for presence of Questioned and Reference recordings New: Number of audio channels is displayed in Case view Recording details view Score table view Report Fixed: Application crash with phoneme search Fixed: Generalized logistic distribution for Suspected speaker vs. Suspected speaker…

Q: What are the supported audio formats?

…Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized WAV audio format in 16-bit PCM little-endian as it is the default system. For more parameters please check FFmpeg manual pages. SoX sox <source_audio_file_name> -b 16 <output_audio_base_name>.wav Number of bits defined by -b parameter must be specified….

KWS: Results explained

…file article for more details. Example of user configuration file: [score_calib:SKeywordScoreCalibrationI] confidence_shift=0.0 confidence_sharpness=0.3 Results Keyword Spotting results contain list of detected keywords, each keyword with a start- and end time of the time slot where keyword was detected, and a score and confidence. Keyword is listed in the results with a numeric suffix. This number is a 0-based index of…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…noise, technical signals like ringing, DTMF tones, etc). This metric is useful for finding performance on actual audio data coming into audio processing pipeline. Regular recording with Voice and Silence segments in waveform Net Speech based FtRT is conservative, purely technical number. It is calculated from only spoken speech data, i.e. with all non-speech parts (silence, noise, DTMF tones, etc.)…

Understand SPE connectors for external TTS

…capabilities of the TTS service is not a good idea as it might potentially get incorrect over the time, leading to obscure issues in the application relying on the info. Required capabilities information JSON structure: { “apiVersion”: 2, “vendor”: string, “author”: string, “version”: string, “voices”: [ { “name”: string, “languageCodes”: [string, string, …], “naturalSampleRateHertz”: number }, . . . ]…

Understand SPE benchmark

…as follows (the version number 1.0 is present only for some historical reasons and is ignored): benchmark └── 1.0 ├── default │ ├── 030.wav │ ├── 060.wav │ ├── 090.wav │ ├── 120.wav │ ├── 150.wav │ ├── 180.wav │ ├── 210.wav │ ├── 240.wav │ ├── 270.wav │ └── 300.wav └── czech ├── 030.wav ├── 060.wav ├── 090.wav ├──…

Adding new language or technology model (Browser)

…on the SPE server -> Server Info. You should see the output similar to this: Please share this SPE version number with your Phonexia contact/or through support ticket (in the above example 3.50.5) . Installation of new models In our example, we will install Spanish (ES_6) model of Speech to Text and Keyword Spotting (with Phoneme Recognizer) into existing installation…