Skip to content Skip to main navigation Skip to footer

Search: SID

53 results

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Language Identification – Languages

Recognized languages Languages pre-trained in the default language pack are listed in the table below, each LID generation is a separate column (in the 4th generation we switched to using language tags instead of names): L4 L3, XL3 S2, L2 (deprecated sq-AL Albanian Albanian Albanian am-ET Amharic Amharic Amharic ar-EG Arabic (Egypt) Arabic   ar-KW Arabic (Gulf, Kuwait) Arabic_Gulf  …

Get better support

Any errors should be tested on the latest version of the product. Please ask your Phonexia contact for a link to download the latest version. Before submitting issue/ticket… Any errors should be tested on the latest version of the product. Please ask your Phonexia contact for a link to download the latest version. For each issue, please collect the following…

Language Identification (LID)

Phonexia Language Identification (LID) will help you distinguish the spoken language or dialect. It will enable your system to automatically route valuable calls to your experts in the given language or to send them to other software for analysis. Application areas Preselecting multilingual sources and routing audio files to language-dependent technologies (transcribing, indexing, etc.) Analyzing network traffic media (language statistics)…

Login

Username or E-mail* Password* Only fill in if you are not human Keep me signed in Register Forgot your password?…

KWS: Results explained

This article aims on giving more details about Keyword Spotting outputs and hints on how to tailor Keyword Spotting to suit best your needs. Scoring Keyword Spotting works by calculating likelihood ratios (LR) that at a given spot occurs a keyword or just any other speech, and comparing those two likelihood ratios. The following scheme shows Background model for anything…

Credentials


This part requires higher (and non-anonymous) access level.
How to solve this situation:

  1. Log in here if you are not logged in.
  2. Register here. It takes just a few clicks and it’s free.

Contact

Visit Us at Address: Chaloupkova 3002/1a, CZ 612 00 Brno, Czech Republic, European Union GPS: N 49° 13.426′, E 016° 35.898 General Queries and Sales [email protected] landline: +420 511 205 265 Company registration details Identification number (ICO): 27680258 VAT identification (DIC): CZ27680258 Registered in the Business Register kept at the District Court in Brno, File C, Inset 51524….

Video – Speech Analytics technologies

MODULE 4: Speech Analytics technologies (23 min) Common generic rules for CLI, REST and GUI Speech To Text (STT) in CLI, REST and GUI Keyword Spotting (KWS) in CLI, REST and GUI Phoneme Recognizer (PHNREC) in CLI, REST and GUI Time Analysis Extraction (TAE) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=-FAoRywqv7U…

Support

Support is available 5 business days a week (Monday – Friday) / 8 business hours (09:00 – 17:00 CET) in English language. If you have issue with Speech Engine, please include a report in the ticket, to help the support staff to resolve your issue faster: Go to the Speech Engine installation directory Open command line/terminal (in Ubuntu Linux Right…

Video – Filtering and supporting technologies

MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…

Q: What do LLR, LR and score mean?

A: These abbreviations mean the following: LR – likelihood ratio, result from statistical test for two models comparison. It returns a number which expresses how many times more likely the data are under one model than the other. LR meets numbers in interval <0;+inf). LLR – abbreviation for log-likelihood ratio statistic, logarithmic function of LR. LLR meets numbers in interval…

Speech to Text (STT)

About STT Phonexia Speech to Text (STT) converts speech in audio signals into plain text. Technology works with both acoustics as well as dictionary of words, acoustic model and pronunciation. This makes it dependent on language and dictionary – only some set of words can be transcribed. As an input, audio file or stream is needed, together with selection of…