Skip to content Skip to main navigation Skip to footer

Search: escore%20*100%20formula

53 results

Understand SPE database scripts

This article explains details and usage of SQL database scripts stored in SPE installation directory in /data/database subdirectory. These scripts are intended for setup and maintenance of SPE database for supported database types, currently SQLite and MariaDB (from SPE 3.46) / MySQL (up to SPE 3.45). Script types For each database type, there are two directories with two types of…

Speech to Text (STT)

…the word correctly and when technology evaluates the best result as not matching to what was really said. Confusion network result type provides similar output as n-best, only with the exception that segments are returned word by word. Usage of confusion network is the same as of n-best. Training of new models To create new model of STT about 100

Contact

Visit Us at Address: Chaloupkova 3002/1a, CZ 612 00 Brno, Czech Republic, European Union GPS: N 49° 13.426′, E 016° 35.898 General Queries and Sales [email protected] landline: +420 511 205 265 Company registration details Identification number (ICO): 27680258 VAT identification (DIC): CZ27680258 Registered in the Business Register kept at the District Court in Brno, File C, Inset 51524….

Installation of Phonexia Browser

Some packages are distributed with only a limited set of speech technologies and languages or without speech technologies. First installation Our software is distributed as a ZIP file. Installation procedure is as simple as: unzip the archive paste additional KWS, STT… models paste the license.dat file to the root directory where you have BROWSER folder and run_browser(.exe) script run the…

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

…to download for commercial/research purposes under a Creative Commons 4.0 license. Data originates from OXFORD VGG VoxCeleb Dataset which detailed license can be found here. SpeakerID Example Data Set v1.0 83.89 MB Download Publications: S. Chung, A. Nagrani, A. Zisserman VoxCeleb2: Deep Speaker Recognition INTERSPEECH, 2018. Nagrani, J. S. Chung, A. Zisserman VoxCeleb: a large-scale speaker identification dataset INTERSPEECH, 2017….

Understand SPE benchmark

The SPE benchmark feature is great tool for quick and simple evaluation of processing speed directly on your hardware and using your audio files – simply call the …/benchmark endpoint corresponding to the technology you want to benchmark and wait for the result. The benchmark result summarizes the length of the processed speech, the processing time and the resulting Faster-than-Realtime…

Q: What do LLR, LR and score mean?

A: These abbreviations mean the following: LR – likelihood ratio, result from statistical test for two models comparison. It returns a number which expresses how many times more likely the data are under one model than the other. LR meets numbers in interval <0;+inf). LLR – abbreviation for log-likelihood ratio statistic, logarithmic function of LR. LLR meets numbers in interval…

Gender Identification (GID)

…generation of XL3 and L3 models) Output scoring: log-likelihood ratio (LLR) and score (0-1). Score can be interpreted as percentage by multiplying the score by 100. Typical use cases: filtering calls by gender, playing advertisement focused on specific gender, getting quick demographic analysis of the recordings. The speed of Gender Identification is up to 150 FtRT (depending on the model)….

Terms of Service

Description of the Services provided by Phonexia s.r.o. 1. Acceptance of Terms of Service (Terms as a Contract) 1.1. PHONEXIA-User Relationship. These Terms of Service (hereinafter referred to as “Agreement” or „Terms of Service“) and the PHONEXIA Privacy Policy govern the relationship between Phonexia s.r.o. (ID No.: 27680258, VAT No.: CZ27680258, registred seat at: Chaloupkova 3002/1a, 61200 Brno, registred by…

Understand SPE workers configuration

…process data faster than realtime, which allows them to utilize 100% of a physical CPU core. This means that for file processing technologies the number of workers should be set to a number of physical CPU cores in the server and there is no point configuring more workers. Stream processing can process data at real time speed at maximum –…

Understand SPE processing queue

This article explains details about SPE asynchronous requests processing queue, the processing task lifecycle and its handling states. When SPE receives an asynchronous API request, a task is created and put in a queue. Size of the queue is defined by server.n_task_limit setting in SPE configuration file, default value is 1000 tasks. Tasks in queue are then handled according to…

Q: What are the recommendations for LID adaptation set?

A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in…

Q: What languages do you offer?

It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more….