Search Results for: language pack

Results 41 - 50 of 53 Page 5 of 6
Results per-page: 10 | 20 | 50 | 100

LM

Relevance: 1%      Posted on: 2018-02-01

Language Model (“vocabulary” in STT technology)

Phonexia technologies introduction

Relevance: 1%      Posted on: 2019-01-25

Core objective: Basic understanding of Phonexia speech technologies and products; typical use cases, implementations and deployment topologies Duration: 35 minutes intended for idea makers and product designers assumes generic knowledge of Phonexia and speech technologies in general Content 00:00 Introduction What information can we get from speech? Overview of basic use cases Phonexia Speech Platform brief 4:21 Phonexia technologies overview and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender…

LP

Relevance: 1%      Posted on: 2018-02-01

Language Print - output data from LID technology

Technical Training Essentials

Relevance: 1%      Posted on: 2019-09-27

Core objective: Understanding technical essentials of using Phonexia technologies and products Duration: ~94 minutes (7 + 19 + 22 + 23 + 23 min chapters) intended for product architects or developers assumes you have already watched Phonexia technologies introduction video assumes understanding of working in command line REST API principles processing JSON or XML Introduction (7 min) technologies recap CLI, REST and GUI interfaces overview https://youtu.be/xzrHyyIl01s MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY MODULE 2: Filtering and supporting…

Ph

Relevance: 1%      Posted on: 2018-02-01

Phoneme – The smallest phonetic unit in a language that is capable of conveying a distinction in meaning, as the m of mat and the b of bat in English.

How to configure Speech Engine workers

Relevance: 1%      Posted on: 2020-03-28

Worker is a working thread performing the actual files- or realtime streams processing in Speech Engine. This article helps to understand the Speech Engine workers and provides information how to configure workers for optimal performance and server utilization. The default workers configuration in settings/phxspe.properties is as shown below – 8 workers for files processing and 8 workers for realtime streams processing. These numbers mean the maximum number of simultaneously running tasks. # Multithread settings server.n_workers = 8 server.n_realtime_workers = 8 Requests for additional file processing tasks are put in a queue and processed according their order and priorities. Requests for…

Gender Identification

Relevance: 1%      Posted on: 2018-04-16

Gender Identification is a language-, domain- and channel-independent technology that uses the acoustic characteristics of the recording to determine the gender of the speaker in question. This technology is able to distinguish between two genders: Male (M) and Female (F). Minimum of speech signal for identification: 9+ sec recommended Output scoring: likelihood ratio and percentage metric (0-100%) Typical use cases: filtering calls by gender, playing advertisement focused on specific gender, getting quick demographic analysis of the recordings. The speed of Gender Identification is up to 150 FtRT (depending on the model).

How to configure STT realtime stream word detection parameters

Relevance: 1%      Posted on: 2020-03-28

One of the improvements implemented since Speech Engine 3.24 is neural-network based VAD, used for word- and segment detection. This article describes the segmenter configuration parameters and how they are affecting the realtime stream STT results. The default segmenter parametrs are as shown below: [vad.online_segmenter:SOnlineVoiceActivitySegmenterI] backward_extensions_length_ms=150 forward_extensions_length_ms=750 speech_threshold=0.5 Backward- and forward extension are intervals in miliseconds, which extend the part of the signal going to the decoder. Decoder is a component, which determines what a particular part of the signal contains (speech, silence, etc.). Based on that, decoder also decides whether segment has finished or not. Unlike in file processing…

Phonexia Speech Platform

Relevance: 1%      Posted on: 2017-05-18

  Phonexia Speech Platform (Speech Platform) provides partners a complete portfolio of speech technologies with an easy-to-use design. The platform allows users to design and deploy a wide range of speech processing systems in a short time and without extensive knowledge of the technologies background. Products On top of Speech Platform, several products provided: for commercial market Phonexia Speech Analytics Phonexia Voice Biometrics for government market Phonexia Speech Analytics GOV Phonexia Voice Biometrics GOV Characteristics Completeness – all speech technologies in one place Simple to use – RESTfull API for rapid development Modularity – build your own specific process workflow…

Speech Quality Estimation

Relevance: 1%      Posted on: 2018-04-02

Speech Quality Estimation (SQE) is a language-, domain- and channel-independent technology that quantifies the quality of an audio recording. 2 most important statistics used in the calculation of the SQE score are SNR (signal-to-noise ratio) and the bitrate of the recording. SQE is usually part of the rapid filtration process in deployments. SQE also measures over 20 other properties of the recording, all of which can be found in the output file and further processed. See description in SPE documentation. Typical use cases are: verification of recording quality on the input, searching based on quality of the recording, noise of…