Search Results for: VAD full form

Results 1 - 10 of 53 Page 1 of 6
Results per-page: 10 | 20 | 50 | 100

VAD

Relevance: 100%      Posted on: 2018-02-01

Phonexia Voice Activity Detection

SPE3 – Releases and Changelogs

Relevance: 52%      Posted on: 2020-12-01

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). This page lists changes in SPE releases. Releases Changelogs Speech Engine 3.36.0, DB v1601, BSAPI 3.35.3 (2020-12-01) Non-public Feature Preview release New: Added some useful information to log messages: Stream ID in task-related log messages Audio length in debug log messages Workers and streams info in debug log messages New: Possibility to obtain information about input RTP connection (see GET /input_stream/rtp/info) New: Endpoint to get languageprint information (see POST /technologies/languageid/lpinfo) Improved:…

Voice Inspector – supporting technologies

Relevance: 39%      Posted on: 2019-06-28

Automatic Speaker Identification (SID) is the most important but not the only Phonexia technology that is implemented in Voice Inspector (VIN). Apart from SID, forensic experts, users of VIN, can benefit from automatic Signal-to-Noise Ratio calculation, Voice Activity detection, Phoneme search, and a Wave editor which incorporates the waveform, spectrum and power panel. Let's have a look on how to utilize individual technologies. Signal-to-Noise Ratio Recording quality can strongly influence the reliability of SID results and so the outcome of a forensic case. Therefore, VIN uses a module of Phonexia Speech Quality Estimation (SQE) to calculate the Signal-to-Noise Ratio (SNR)…

Speaker Identification (SID)

Relevance: 24%      Posted on: 2019-06-13

Phonexia Speaker Identification uses the power of voice biometry to recognize speakers by their voice... i.e. to decide whether the voice in two recordings belongs to the same person or two different people. High accuracy of Speaker Identification, the Phonexia's flagship technology, has been validated in a NIST Speaker Recognition Evaluations. Basic use cases and application areas The technology can be used for various speaker recognition tasks. One basic distinction is based on the kind of question we want to answer. Speaker Identification is the case when we are asking "Whose voice is this?", such as in fake emergency calls.…

Terms of Service

Relevance: 17%      Posted on: 2018-03-24

Description of the Services provided by Phonexia s.r.o. 1. Acceptance of Terms of Service (Terms as a Contract) 1.1. PHONEXIA-User Relationship. These Terms of Service (hereinafter referred to as "Agreement" or „Terms of Service“) and the PHONEXIA Privacy Policy govern the relationship between Phonexia s.r.o. (ID No.: 27680258, VAT No.: CZ27680258, registred seat at: Chaloupkova 3002/1a, 61200 Brno, registred by the County Court in Brno under file C, insert 5124), provider of the PHONEXIA technology (hereinafter referred to as "PHONEXIA") and you ("you", "your", „user“ or "Member"), and your use of and access to the website, PHONEXIA services or any…

Voice Inspector – Interpretation of results

Relevance: 17%      Posted on: 2019-06-24

Introduction Phonexia Voice Inspector (VIN) is a tool for forensic automatic speaker identification, compliant with the Methodological Guidelines for Best Practice in Forensic Semiautomatic and Automatic Speaker Recognition, published by the European Network of Forensic Science Institutes.  This post explains individual SID score types and ways to visualize the results in a speaker identification case implemented in Voice Inspector. Evidence In VIN, the term evidence has two meanings. In general, it refers to any SID score that the system calculates for any pair of recordings in the case. These scores are the output of the Phonexia SID technology which runs…

How to configure STT realtime stream word detection parameters

Relevance: 15%      Posted on: 2020-03-28

One of the improvements implemented since Speech Engine 3.24 is neural-network based VAD, used for word- and segment detection. This article describes the segmenter configuration parameters and how they are affecting the realtime stream STT results. The default segmenter parametrs are as shown below: [vad.online_segmenter:SOnlineVoiceActivitySegmenterI] backward_extensions_length_ms=150 forward_extensions_length_ms=750 speech_threshold=0.5 Backward- and forward extension are intervals in miliseconds, which extend the part of the signal going to the decoder. Decoder is a component, which determines what a particular part of the signal contains (speech, silence, etc.). Based on that, decoder also decides whether segment has finished or not. Unlike in file processing…

Performance of the Speaker Identification 4th generation (SID4): Intel® Xeon® Platinum 8124M

Relevance: 12%      Posted on: 2019-10-30

Benchmark goals Find realistic performance using total recording length Find FTRT based exactly on net_speech (engineering sizing data) Find system performance using all physical cores Find system performance using all logical cores Infrastructure setup Intel® Xeon® Platinum 8124M is used in virtual machine with 8 physical cores reserved exclusively for this VM, Hyper Threading is enabled [16 logical cores available], 32GB RAM, 30GB SSD based storage, 1000 I/O.s-1  reserved per core Benchmark data setup Data set statistic: Number of files: 32 [300 seconds each] RAW recordings length ∑: 9600 [sec] Net speech length ∑: 4224.77 [sec] In the data set…

SPE configuration

Relevance: 11%      Posted on: 2018-02-02

Basic explanation of configuration directives for SPE with hints & tips. Overview of phxspe.properties for beginners.

Phonexia – introduction

Relevance: 10%      Posted on: 2018-03-14

What we believe in At Phonexia, we find joy in pushing the boundaries of innovation in the field of speech technology by automating and simplifying solutions for many of today’s complex communication and security-strategic challenges. By providing our partners and customers with state-of-the art speech-technology software, we leverage the power, and data, in their voices. Who we are Phonexia is the only speech technology software manufacturer that reveals and leverages the most data in speech for enterprising trailblazers across the globe who want to discover and develop powerful new skills in a knowledge-based economy. We have more than 19 years…