Search Results for: speech to text

Results 31 - 40 of 103 Page 4 of 11
Results per-page: 10 | 20 | 50 | 100

Browser3 – Releases and Changelogs

Relevance: 8%      Posted on: 2020-10-23

Phonexia Browser v3 (Browser3) is developed as client on top of Phonexia Speech Engine v3. Phonexia Browser is a successor of Phonexia Speech Intelligence Resolver v1 (SIR1). This page lists changes in Browser releases. Releases Changelogs Phonexia Browser v3.35.2, BSAPI 3.35.2 - Oct 21 2020 Public release Fixed: Speaker identification dialog in WaveEditor which did not work for SID4 Fixed detection of certain USB license tokens Phonexia Browser v3.35.0, BSAPI 3.35.0 - Oct 02 2020 Public release New: Compatibility with SPE 3.35 Phonexia Browser v3.30.12, BSAPI 3.30.11 - Aug 20 2020 Public release Fixed: Transcription results intermittently displays words in wrong…

Save Your Time

Relevance: 8%      Posted on: 2017-06-22

If you start, the following posts might be interesting for you:   Phonexia Speech Platform is defined as an umbrella concept for all our products and services related to speech technologies. Main packages are Voice Biometrics and Speech Analytics.   Phonexia Browser PhxBrowser - application for quick tests and visualization of speech technologies results.   Speech Engine SPE3 - RESTfull API - it is adjustable server component which houses all speech technologies.   Other "good to start" pages: Academy is to help partners to understand the market, Phonexia’s products and technologies. Manuals Glossary

Speaker Identification (SID)

Relevance: 8%      Posted on: 2019-06-13

Phonexia Speaker Identification uses the power of voice biometry to recognize speakers by their voice... i.e. to decide whether the voice in two recordings belongs to the same person or two different people. High accuracy of Speaker Identification, the Phonexia's flagship technology, has been validated in a NIST Speaker Recognition Evaluations. Basic use cases and application areas The technology can be used for various speaker recognition tasks. One basic distinction is based on the kind of question we want to answer. Speaker Identification is the case when we are asking "Whose voice is this?", such as in fake emergency calls.…

Performance of the Speaker Identification 4th generation (SID4): Intel® Xeon® Platinum 8124M

Relevance: 7%      Posted on: 2019-10-30

Benchmark goals Find realistic performance using total recording length Find FTRT based exactly on net_speech (engineering sizing data) Find system performance using all physical cores Find system performance using all logical cores Infrastructure setup Intel® Xeon® Platinum 8124M is used in virtual machine with 8 physical cores reserved exclusively for this VM, Hyper Threading is enabled [16 logical cores available], 32GB RAM, 30GB SSD based storage, 1000 I/O.s-1  reserved per core Benchmark data setup Data set statistic: Number of files: 32 [300 seconds each] RAW recordings length ∑: 9600 [sec] Net speech length ∑: 4224.77 [sec] In the data set…

How to convert STT confusion network results to one-best

Relevance: 6%      Posted on: 2020-04-06

Confusion Network output is the most detailed Speech Engine STT output as it provides multiple word alternatives for individual timeslots of processed speech signal. Therefore many applications want use it as the main source of speech transcription and perform eventual conversion to less verbose output formats internally. This article provides the recommended way to do the conversion. Time slots and word alternatives: The recommended algorithm for converting Confusion Network (CN) to One-best is as follows: loop through all CN timeslots from start to end in each timeslot, get the input alternative with highest score and if it's not <null/> or…

Site Map

Relevance: 6%      Posted on: 2017-06-23

Phonexia Speech Platform Phonexia Speech Platform for Enterprise Phonexia Speech Analytics (SAL) Phonexia Voice Biometrics (VBS) Phonexia Speech Platform for Government Phonexia Speech Analytics GOV (SAL.gov) Phonexia Voice Biometrics GOV (VBS.gov) Components and Tools Phonexia Speech Engine v3 Speech technologies available Phonexia Browser v3 Phonexia Voice Inspector v3 Speech Intelligence Resolver v1 End of Life Components & Tools Phonexia Voice Inspector v1 Knowledge Base Blog Case Studies Demos Frequently Asked Questions (FAQ) How To… Lifetime Support Policies Manuals Presale Whitepapers and Presentations Product Briefs Developer Corner Code Examples Hints for App Design Hints for App Development List of Resources Phonexia…

Phonexia technologies introduction

Relevance: 6%      Posted on: 2019-01-25

Core objective: Basic understanding of Phonexia speech technologies and products; typical use cases, implementations and deployment topologies Duration: 35 minutes intended for idea makers and product designers assumes generic knowledge of Phonexia and speech technologies in general Content 00:00 Introduction What information can we get from speech? Overview of basic use cases Phonexia Speech Platform brief 4:21 Phonexia technologies overview and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender…

Components and Tools

Relevance: 6%      Posted on: 2017-05-18

This section collect information about specific components and tools of our Speech Platform.   API RESTfull API - Phonexia Speech Engine v3 (SPE3) - recommended   Apps and Tools Phonexia Browser v3 (Browser3) Voice Inspector v4 (VIN4) Voice Inspector v3 (VIN3)   You might be interested to see also Product Portfolio or End of Life Components & Tools. You might also browse our product support lifecycle policy to see which of our versions are supported and maintained.

Voice Biometrics

Relevance: 6%      Posted on: 2018-04-07

Overview Phonexia Voice Biometrics is a special edition of Phonexia Speech Platform which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to verify speaker or identify crimes. The technologies reveals automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. Voice Biometrics - Typical Use-Cases Use case Speaker Verification is tailored to banks/insurance companies/money lending companies and others, where is needed to confirm if caller/voice in audio file is the same person who is known to the customer. For this use…

Phonexia Workflow

Relevance: 6%      Posted on: 2019-08-06

Phonexia Workflow is a set of tools complementing Phonexia Speech Engine (SPE), which allow users to chain speech technologies into scenarios and process audio recordings automatically using these scenarios. Scenarios are programmed using uniform API which provides an abstraction over Phonexia Speech Engine application. Provided Phonexia Workflow scenarios: SalEssentials - Speech Analytics Essentials filters out low quality audio files, provides demographic information, age estimation and speech to text processing VbsEssentials - Voice Biometrics Essentials filters out low quality audio files, provides gender identification, age estimation and speaker identification  The scenario is a tiny Java application which interacts with Phonexia technologies…