A: Yes. Documentation is here: https://download.phonexia.com/docs/spe….
Search: le
145 results
A: These abbreviations mean the following: LR – likelihood ratio, result from statistical test for two models comparison. It returns a number which expresses how many times more likely the data are under one model than the other. LR meets numbers in interval <0;+inf). LLR – abbreviation for log-likelihood ratio statistic, logarithmic function of LR. LLR meets numbers in interval…
A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in…
A: Via HTTP header “Accept” parameter (application/json; application/xml) Via request query “format=json/xml” If the format is not defined (or the HTTP header “Accept” parameter has one of these values: application/*,*/*,*), server will return json….
A: The following options are supported: HTTP basic authorization – Client asks for session by resource “post /login” with HTTP basic authorization in query header. If server responds with error 405, server doesn’t support authorization by sessions and it is necessary to use basic authorization. Authorization by session – Authorization by session is done by adding parameter “X-SessionID“ into HTTP…
A: You can receive the list of running/configured technologies by running query get /technologies or using the phxadmin utility with parameter configure-tech…
Our technologies are prepared to run on both Windows and Linux OS. For more details of the supported operating systems as well as recommended HW setup, see Recommended OS and HW…
A: Yes, the system comes as an API (for the production license)….
It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more….
Voice Activity Detection is a language-, domain- and channel-independent technology that identifies parts of audio recordings with speech content vs. non-speech content. It creates labels for speech and other signals in the recording; this can then serve as a decision point whether to process the recording by other technologies or not. VAD is usually part of rapid filtration process in…