Search: SID API

20 results

Releases and Changelogs (SPE)

…(the first model of 5th generation) [BSAPI] Added more accurate G2P (5th generation only) [BSAPI#72] Fixed phoneme recognizer doesn’t make phonemes for phnrec_ru_ru.bs [BSAPI#99] Fixed phoneme recognizer with configuration phnrec_cs_cz.bs doesn’t transcript short recordings [BSAPI#82] Fixed missing configuration of phnrec for HR_HR4 [BSAPI#78] Fixed STT segmentation – a segment doesnt break on a long silence, creates false crosstalks [BSAPI#148] Phoneme…

Releases and Changelogs (Browser)

…doesn’t contain any speech Updated: Synchronize versioning with BSAPI Phonexia Browser 3.26 Phonexia Browser 3.26.0, BSAPI 3.26.0 (2020-02-28) Updated: Updated BSAPI Phonexia Browser 3.25 Phonexia Browser 3.25.1, BSAPI 3.25.0 (2020-02-03) Fixed: missing library on Windows Phonexia Browser 3.25.0, BSAPI 3.25.0 (2020-01-30) Updated: Updated BSAPI Phonexia Browser 3.24 Phonexia Browser 3.24.0, BSAPI 3.24.0 (2019-12-17) Fixed: Sorting of columns is not working…

Understand SPE database

…by SPE users: rest_model_sid list of SID speaker models – name, owner (SPE user), modification timestamp rest_model_sid_sources list of files used as sources for SID speaker models creation rest_model_sid_metafiles list of files used as SID speaker models metafiles rest_group_sid list of SID speaker groups – name, owner (SPE user) rest_group_sid_models associations between SID speaker groups and speaker models rest_voiceprint SID…

Release Notes

…how to use it. Deprecated Features BSAPI (C++ API) discontinued – ANNOUNCEMENT We set the End of Life for BSAPI (our C++ API) for 2023-03-31 after discussion with partners/customers, who actively gave us feedback on C++ API. What does it mean for partners/customers?: Partners/customers with installed BSAPI version and valid Maintenance & Support can update to BSAPI v3.40.x (March…

Releases and Changelogs (VIN)

…displays the names of compared recordings added Spanish localization updated BSAPI fixed labels and minor bugfixes Voice Inspector 3.1 Voice Inspector v3.1.1, BSAPI 3.9.1 (2016-12-14) fixed bug in Speaker Identification Evaluator – Evaluation from a directory Voice Inspector v3.1.0, BSAPI 3.9.1 (2016-10-24) VIN is available with the SID_L3 technology model Voice Inspector 3.0 Voice Inspector v3.0.0, BSAPI 3.7.0 – Aug…

Understand SPE directory structure

…built-in benchmark functionality – see more details in Understanding SPE benchmark article and …/benchmark REST API endpoint of each technology in SPE REST API documentation. The database directory contains SQL scripts for setup, maintenance and updates of supported databases. See more details in Understanding SPE database scripts. doc doc directory contains various documentation api_reference.html REST API documentation; also available online…

SID: Speaker Identification: Results Enhancement

…speaker voiceprints. In general, both sides (enroll and test) can be calibrated by either the same profile (in case they come from the same source) or by two different profiles in case they have a different source. Examples: We are selling SID to Wakanda’s Ministry of Defense. They like to monitor telephone network of their happy people speaking the Wakandan…

Understand SPE executable files

…SID4C (SID4 extractor and SID4 comparator) with both L4 and XL4 models, depending on actual availability of the technologies/models in that SPE installation. Due to the “…single character” pattern definition, the list won’t include SID4E_STREAM, SID4C_STREAM and SID4CALIB technologies. phxadmin2: example 3 ./phxadmin2 technology enable sid?_stream:*l?=3 sid4?_stream:*l?=1 enable 3 instances of technologies with names matching “sid followed by single character,…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

FAQs (PSP)

…license contains records for all required modules. See Licensing article for additional information in FAQ Phonexia Browser, FAQ Speech Platform, FAQ Voice Inspector Permalink Q: What are the requirements for SID evaluation dataset? For evaluating the real life scenario of Phonexia Speaker Identification technology, the system needs to be calibrated by SID dataset. SID dataset (minimum requirements): To measure SID…

Phonexia Speech Engine

Phonexia Speech Engine (SPE) is main part of Phonexia Speech Platform. SPE is a server application for 64-bit Linux or Windows, providing REST API to entire portfolio of Phonexia speech technologies. SPE capabilities overview: Audio files and stream processing Audio files RTP / HTTP streams Speaker Identification (SID) ✓ ✓ Speech To Text (STT) ✓ ✓ Keyword Spotting (KWS) ✓…

Understand SPE configuration

…with the help of environment variables) you can setup a more efficient deployment method. Simply un-comment this directive and set it up correctly. Btw, did I mentioned before, that hash sign (#) at the beginning of lines means “this is a comment”? # Set path to bsapi directory # bsapi.path = ${application.dir}bsapi ### # (c) 2013-2018 by Phonexia s.r.o….

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

Key Features (PSP)

…in the Languages Available section. Speech To Text (STT) and Keyword Spotting (KWS) languages Language Identification (LID) languages Supported Audio input The Speech Engine server supports various audio formats as listed in API reference > Audio requirements. It also supports the RTP/HTTP stream processing as listed in API reference > RTP/HTTP streams. The Speech Engine allows the usage of some…

Manuals

This section collects links or locations of manuals for specific Phonexia Speech Platform components. API Phonexia Speech Engine REST API – SPE – latest version manual online (api_reference.html for your version is located in doc subdirectory in SPE folder or distribution ZIP) Brno Speech Application Interface v3 – BSAPI3 – latest version manual online Applications and Tools Phonexia Browser –…