Skip to content Skip to main navigation Skip to footer

Search: pt-br

24 results

SID4 performance on Intel® Xeon® Platinum 8124M

…w/o speech context) Methodology SID4 performance was measured on a virtual machine, Ubuntu 18.04 installed as host OS. SID4 v 3.21.3 command line was used, supported by VAD 3.22.1 command line used for collecting statistical metadata. The Virtual Machine was reserved only for this measurement experiment. Technical details: Driven by bash script in terminal emulator Measuring script was run 50…

Arabic dialects in Phonexia LID and STT

…a bit, but you won’t understand Moroccan Data acquisition AUDIO (used for LID and STT training) MSA is used in formal speaking situations such as sermons, lectures, news broadcasts, and speeches so it is pretty difficult/impossible to find recordings of spontaneous phone conversations in MSA available MSA recordings are usually from broadcasting (microphone) or rather formal scripted speeches (also microphone)…

Language Identification – Languages

Recognized languages Languages pre-trained in the default language pack are listed in the table below, each LID generation is a separate column (in the 4th generation we switched to using language tags instead of names): L4 L3, XL3 S2, L2 (deprecated sq-AL Albanian Albanian Albanian am-ET Amharic Amharic Amharic ar-EG Arabic (Egypt) Arabic   ar-KW Arabic (Gulf, Kuwait) Arabic_Gulf  …

Phonexia Partner Program for Government Partners

…X 2-day Technical Response X X Presentation Data X X Phonexia News X X X Starter Kit Whether you are a new Silver partner looking for help with your first Phonexia project, or an experienced Gold partner that wants to ensure their top-class proof of concept will fulfill the customer’s expectations, the Starter Kit is an excellent way to secure…

STT: What is Preferred Phrases feature and how to use it

…may come handy, allowing to prompt the speech transcription with phrases or words which are expected to appear in the utterance, thus increasing the chances of correctly transcribed words, increasing the overall transcription accuracy. The intended application of this feature is mainly voicebots, i.e. in questions-driven dialogues, where the probable answers to each individual question are predictable and expected. But…

Contact

Visit Us at Address: Chaloupkova 3002/1a, CZ 612 00 Brno, Czech Republic, European Union GPS: N 49° 13.426′, E 016° 35.898 General Queries and Sales [email protected] landline: +420 511 205 265 Company registration details Identification number (ICO): 27680258 VAT identification (DIC): CZ27680258 Registered in the Business Register kept at the District Court in Brno, File C, Inset 51524….

Key Features (PSP)

Phonexia Speech Platform is provided as a set of several components: The Speech Engine (SPE) component is a REST API that includes technologies for the automated processing of audio files and audio streams. This component is usually provided in a specific configuration that meets the customer’s use case. The Phonexia Browser component is an expert-level application (on the top of…

Speech to Text (STT)

…including discriminative training and neural network-based features Output One-best transcription – i.e. a file with a time-aligned speech transcript (time of word’s start and end) Variants for transcriptions – i.e. hypotheses for words at each moment (confusion network) or hypotheses for utterances at each slot (n-best transcription) Processing speed – several versions available: from 8x faster than real-time processing on…

Keyword Spotting (KWS)

…experts. Typical use cases Call centers increase operator and supervisor efficiency by searching calls identify inappropriate expressions from operators check marketing campaigns with automatic script-compliance control Mass media and web search servers index and search multimedia by keyword route multimedia files and streams according to their content Security/defense maintain fast reaction times by routing calls with specific content to human…