Skip to content Skip to main navigation Skip to footer

Search: stt

46 results

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Video – Filtering and supporting technologies

MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…

Video – Voice Biometrics technologies

MODULE 3: Voice Biometrics technologies (23 min) Common generic rules for CLI, REST and GUI Speaker Identification (SID) in CLI, REST and GUI Language Identification (LID) in CLI, REST and GUI Gender Identification (GID) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=AyEoPfYVel8…

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

Download Orbis Trial

…Phonexia Orbis 1.4.0 Investigator trial 4.7 GB Download Phonexia Orbis 1.4.0 Investigator EN_US6 STT trial 5.6 GB Download Step #5 – Installation For installation instructions, please visit the Installation Guide page. We hope that you will like Phonexia Orbis. If you have any questions, don’t hesitate to contact your assigned Phonexia contact or write us a message on Phonexia Orbis…

Download Semantic Search demo

…/home/data/document_0003.txt /home/data/document_0004.txt Each document may also have metadata associated with it, these are textual and specified after the space symbol in <document_list> Example content of document list with metadata: /home/data/meeting_ashari_bago.txt STT transcript of meeting between bosses Ashari and Bago from 17.3.2021 /home/data/doc_twitter.1234.txt Twitter posts related to eventful event In this case metadata (e.g. STT transcript of meeting between bosses Ashari…

Download Speech Platform

…only English models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model…

Waveform Denoiser (DENOISER)

…It is advisable to use the technology after the acoustic quality check of recordings. If some technical information indicates, for example, low values of signal-to-noise ration (SNR), it is advisable to divert the recording directly into the Denoiser technology to automatic noise reduction. On the other hand, it is not appropriate to send an automatically reconstructed recording subsequently to STT

Installation of Phonexia Browser

Some packages are distributed with only a limited set of speech technologies and languages or without speech technologies. First installation Our software is distributed as a ZIP file. Installation procedure is as simple as: unzip the archive paste additional KWS, STT… models paste the license.dat file to the root directory where you have BROWSER folder and run_browser(.exe) script run the…

Phonexia technology models EoL

Speech to Text (STT) and Keyword Spotting (KWS) models Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic…

Understand SPE multithreaded technologies initialization

…the other hand, parallel threads may cause very intensive disk activity when the system reads source data for multiple technologies at the same time. This is notable especially by technologies like STT, where initialization of each model typically needs to read approx. 1 GB of data from disk. Depending on the disk subsystem performance, fragmentation, etc., this high disk activity…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

FAQs (Browser)

…details, see KWS technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages are supported by STT? A: Please see List of supported STT Languages. For more details, see STT technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: I am getting SPE related error after starting the Browser (e.g. SPE server crashed, Error Downloading…,…

Phoneme Recogniser (PHNREC)

…output) Note: The outputs can contain the following special tokens: sil silent part (or no speech detected) The list of phonemes is available in the document phonemes_for_stt_and_kws.pdf (delivered as part of manuals in SPE or STT or KWS).   Languages Supported List of supported languages in Phoneme Recogniser is same as in Keyword Spotting.   Link to API reference https://download.phonexia.com/docs/spe/#%2Ftechnologies%2Fphnrec…

Understand SPE database

…results – file, used technology model, used speaker model, used FAR calibration set, max. FAR, results JSON data rest_result_sid4 SID4 processing results – file, used technology model, used speaker model, used file- and speaker model Audio Source Profile, results JSON data rest_result_sqe SQE processing results – file, used technology model, results JSON data rest_result_stt STT processing results – file, used…