Search Results for: `diarization

Results 1 - 10 of 15 Page 1 of 2
Results per-page: 10 | 20 | 50 | 100

Speaker Diarization

Relevance: 100%      Posted on: 2018-04-02

Speaker Diarization labels segments of the same voice(s) in one mono channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers, but of technical signals and silence as well. The outputs of the technology can be both log file with labels and/or split audio files/one new multichannel audio file. The correct speaker diarization is still research task nowadays. Typical use cases: Preprocessing for other speech recognition technologies, labeling the parts of the utterance according to the speakers, splitting telephone conversation recorded in mono into several…

Speaker Diarization (DIAR)

Relevance: 91%      Posted on: 2017-06-26

About DIAR Phonexia Speaker Diarization (DIAR) enables segmentation of voices in one monochannel audio record. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits linear coding), A-law or Mu-law, PCM, 8kHz+ sampling Output Log file with processed information (segmentation of speech, silence, and technical signals – ie. elimination of phone lines beeps, DTMF tones, music, pauses, etc.) Audio file extracted for each…

SPE3 – Releases and Changelogs

Relevance: 33%      Posted on: 2021-06-11

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). Releases Changelogs Speech Engine 3.40.5, DB v1700, BSAPI 3.40.4 (2021-05-09) Public release Fixed: When trying to register webhook over existing webhook for any stream technology, SPE returns HTTP 400 (1069) error instead of HTTP 500 Fixed: Invalid SQL syntax when overwriting voiceprint in a database Speech Engine 3.35.7, DB v1601, BSAPI 3.35.5 (2021-05-09) Public release Fixed: Invalid SQL syntax when overwriting voiceprint in a database Speech Engine 3.40.4, DB v1700, BSAPI…

Speech Intelligence Resolver v1

Relevance: 28%      Posted on: 2017-05-18

About Phonexia Speech Intelligence Resolver v1 (SIR1) combines the power of speech technologies within a single application. The application automatically performs visualization of the record as well as filtering the speech metadata uncovered from your records effectively. Speech technologies implemented: Phonexia Speaker Identification (SID2) Phonexia Language Identification (LID2) Phonexia Gender identification (GID) Phonexia Voice Activity Detection (VAD) Phonexia Speaker Diarization (DIAR) Phonexia Keyword Spotting (KWS) Phonexia Speech Quality Estimator (SQE) Phonexia Speech Transcription (STT) SIR is a client application cooperating with REST servers. It can be used as a standalone application due to the integrated local REST server. It was…

Browser3 – Releases and Changelogs

Relevance: 19%      Posted on: 2021-06-10

Phonexia Browser v3 (Browser3) is developed as client on top of Phonexia Speech Engine v3. Phonexia Browser is a successor of Phonexia Speech Intelligence Resolver v1 (SIR1). This page lists changes in Browser releases. Releases Changelogs Phonexia Browser 3.40.4, BSAPI 3.40.4 (2021-06-10) Public release Changed: SID Evaluator - do not interrupt processing when an error occurs, but view all errors and continue creating the evaluation set Fixed: SID Evaluator - invalid GID score values Fixed: SID Evaluator - missing SQE information in report Fixed: SID Evaluator - don't save disabled recordings to evaluation set Phonexia Browser 3.40.3, BSAPI 3.40.4 (2021-05-28)…

Phonexia Speech Engine

Relevance: 9%      Posted on: 2021-05-05

Phonexia Speech Engine (SPE) is main part of Phonexia Speech Platform. SPE is a server application for 64-bit Linux or Windows, providing REST API to entire portfolio of Phonexia speech technologies. SPE capabilities overview: Audio files and stream processing   Audio files   RTP / HTTP streams Speaker Identification (SID) ✓   ✓ Speech To Text (STT) ✓   ✓ Keyword Spotting (KWS) ✓   ✓ Voice Activity Detection (VAD) ✓   ✓ Time Analysis Extraction (TAE) ✓   ✓ Language Identification (LID) ✓     Gender Identification (GID) ✓     Age Estimation (AGE) ✓     Speech Quality…

Voice Biometrics

Relevance: 9%      Posted on: 2018-04-07

Overview Phonexia Voice Biometrics is a special edition of Phonexia Speech Platform which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to verify speaker or identify crimes. The technologies reveals automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. Voice Biometrics - Typical Use-Cases Use case Speaker Verification is tailored to banks/insurance companies/money lending companies and others, where is needed to confirm if caller/voice in audio file is the same person who is known to the customer. For this use…

Speech Analytics

Relevance: 9%      Posted on: 2018-04-06

Overview Phonexia Speech Analytics allows you to understand the  content of audio without having to listen to it. The results help both commercial entities and security/defense forces for immediate precise decision and response. The technologies reveal automatically WHAT content, TOPIC and KEY PHRASES are spoken, and many other metadata.   Speech Analytics - Typical Use-Cases Speech transcription is used in various applications. Knowledge of content of whole call is bringing business value to the customer, comparing to listening to the audio files by analytic or supervisor. Reading the text is also faster than listening to the audio. Speech Analytics output…

Phonexia Speech Platform for Government

Relevance: 5%      Posted on: 2017-05-18

Phonexia Voice Biometrics GOV is a special edition of Phonexia Speech Platform for Government which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to filter audio and prevent or identify crimes. The technologies reveal automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. The product can be used typically for investigation support, SIGINT or other types of operations. It serves 4 main use-cases: Voice Biometrics - Speaker Search in Archive (Investigation) Voice Biometrics - Speaker Spotting Tactical Voice Biometrics -…

Phonexia Browser

Relevance: 5%      Posted on: 2017-05-18

About Phonexia Browser v3 (Browser v3) software that combines the power of speech technologies in a single desktop application. The application automatically  performs visualization of records as well as effective filtration of speech metadata uncovered from the user´s records. Speech technologies implemented: Speaker Identification (SID) Language Identification (LID) Gender identification (GID) Voice Activity Detection (VAD) Speaker Diarization (DIAR) Keyword Spotting (KWS, 10+ languages available) Speech Quality Estimator (SQE) Speech to Text (STT, 10+ languages available) Age Estimation (AGE) Browser v3 is a client application cooperating with Speech Engine v3 (SPE3). It is possible to use it as a client -…