Search: speech%20%20%20%20%20ytics

127 results

Releases and Changelogs (SPE)

…recordings Fixed: Unable to initialize technologies when SPE is launched using UNC path on Windows Speech Engine 3.57 (Public release) Speech Engine 3.57.0, DB v1901, BSAPI 3.57.0 (2023-02-01) New: AGE XL4 and XL5 models (for compatibility with SID4 XL4 and XL5 voiceprints) Speech Engine 3.56 (Public release) Speech Engine 3.56.0, DB v1901, BSAPI 3.56.0 (2022-12-15) New: GID XL5 model (for…

Support Lifecycle Policy (PSP)

…N/A On project basis S1 (Beta) 2014-10 N/A On project basis O1 (Beta) 2014-10 N/A On project basis DENOISER EN_US1 (Beta) 2015-08 N/A On project basis CS_CZ1 (Beta) 2018-03 N/A On project basis Deprecated tech. models (not supported, after end-of-life). Technology Tech. model name Release Date End of Support GID O1 / GENERIC AGE L1 …

Releases and Changelogs (Browser)

Phonexia Browser is a tool for testing Phonexia speech technologies available via Speech Engine API. Releases Version Release Date End of Support Maintained Until Release type 3.60 2023-12-05 2025-06-01 n/a Public 3.59 2023-06-20 2025-01-01 n/a Public 3.58 2023-04-03 2024-10-01 n/a Public 3.57 2023-02-02 2024-08-01 n/a Public 3.56 2022-12-15 2024-06-01 n/a Public 3.55 2022-10-03 2024-04-01 3.60 Public 3.52 2021-07-01 2021-09-30 3.55…

Phonexia technology models EoL

…2020-10 6th gen. DIAR 5th gen. DIAR L1 (Beta) 2015-08 N/A On project basis S1 (Beta) 2014-10 N/A On project basis O1 (Beta) 2014-10 N/A On project basis DENOISER EN_US1 (Beta) 2015-08 N/A On project basis CS_CZ1 (Beta) 2018-03 N/A On project basis Deprecated tech. models (not supported, after end-of-life). Technology Tech. model name Release Date End of Support GID…

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

Release Notes

Table of Contents Toggle Speech Platform release 3.60 New features and fixes Previous Releases Speech Platform Public Release Fall 2022 (SPE v3.55) Speech Platform public release Spring 2022 (SPE v3.50) Speech Platform public release Fall 2021 (SPE v3.45) Speech Platform release 3.60 Here is a summary of most important new features and fixes since last Public Release 3.55. New features…

Releases and Changelogs (VIN)

…Target score distribution Fixed: Population Set selected correctly even if renamed in the selection window Improved: Speech length display in the case view: added “Unlimited” option to display the speech length permanently Improved: SID Evidence score aligned with Speech Engine output of SID score Removed: Speech length compensation Voice Inspector 5.1 Voice Inspector 5.1.0, BSAPI 3.60.0 (2023-12-07) New: A generalized…

Understand SPE audio converter

…file format ‘C:\TMP\tmp9408aaaaaa’: BsapiException: SWaveFileI(1751): Corrupted WAVE file format: ‘C:\TMP\tmp9408aaaaaa’. 2021-01-30 20:49:26 [Trace] ConverterSubsystem: Converting C:\TMP\tmp9408aaaaaa -> C:\TMP\tmp9408baaaaa.wav 2021-01-30 20:49:27 [Debug] ConverterSubsystem: File C:\TMP\tmp9408aaaaaa has been converted. 2021-01-30 20:49:27 [Trace] ConverterSubsystem: Removed temporary file: C:\TMP\tmp9408aaaaaa 2021-01-30 20:49:27 [Trace] Data: Moving: ‘C:\TMP\tmp9408baaaaa.wav’ -> ‘D:\SPE\home\admin\storage\test1.wav’ 2021-01-30 20:49:27 [Trace] Data: Moved: ‘C:\TMP\tmp9408baaaaa.wav’ -> ‘D:\SPE\home\admin\storage\test1.wav’ 2021-01-30 20:49:27 [Trace] Data: File ‘/test1.wav’ registered in database…

Open Source Acknowledgement

…license speexdsp BSD stdlibc++, libgcc, libwinpthread (Windows only) GNU GPL with GCC Runtime Library Exception: License utfcpp BSL-1.0 xxhash-cpp https://github.com/RedSpah/xxhash_cpp/blob/master/LICENSE. – Connect your Github account BSD-2 Copyright: 2012-2020: Yann Collet, 2017-2020: Red Gavin zlib Zlib Phonexia BROWSER and Voice Inspector dependencies Library License ADVobfuscator GitHub – andrivet/ADVobfuscator BaseMatrixOps Apache License blaze BSD-3-Clause boost BSL-1.0 botan BSD-2-Clause bzip2 bzip2-1.0.8 cpp-httplib MIT…

STT: Results explained

This article aims on giving more details about Speech To Text outputs and hints on how to tailor Speech To Text to suit best your needs. In the process of transcribing speech, the Speech To Text technology usually identifies multiple alternatives for individual speech segments, as multiple phrases can have similar pronunciations, possibly with different word boundaries, e.g. “eight tea…

FAQs (PSP)

…for 20+ languages including English, French, German, Russian, Spanish and many more. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages are supported by LID? A: Please see List of supported LID Languages. For more details, see LID technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: How to fix the Error 1013: Unsupported: Server does…

Phonexia Voice Inspector EoL

Version Release Date End of Support Maintained Until Release type 5.2 2024-04-15 2027-12-31 2027-12-31 Public 5.1 2023-12-07 2027-12-31 2027-12-31 Public 5.0 2023-06-29 2027-12-31 v.5.1 Public 4.0 2019-12-12 2023-12-31 2023-12-31 Public 3.2 2018-03-16 2020-12-16 2020-12-16 Public 3.1 2016-10-24 2018-04-24 v.3.2 Public 3.0 2016-08-05 2018-02-05 v.3.1 Public 1.3 2015-06-04 2016-12-04 2016-12-04 Public…

SPE and Browser installation: standalone SPE

…Quality Estimation Stream [disabled] 17) Speech To Text [disabled] 18) Speech To Text Input Stream [disabled] 19) Time Analysis [disabled] 20) Time Analysis Stream [disabled] 21) Voice Activity Detection [disabled] 22) Voice Activity Detector Stream Technology [disabled] 23) Enable all 24) Disable all 0) Quit Choose technology to configure [0]:23 Select the option to Enable all technologies (usually the option…

Speaker Identification (SID)

…of data and 1:1 comparisons to evaluate evidence and to establish probability of the identity of a speaker and use it in court. How does it work? The technology is based on the fact that the speech organs and the speaking habits of every person are more or less unique. As a result, the characteristics (or features) of the speech…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…noise, technical signals like ringing, DTMF tones, etc). This metric is useful for finding performance on actual audio data coming into audio processing pipeline. Regular recording with Voice and Silence segments in waveform Net Speech based FtRT is conservative, purely technical number. It is calculated from only spoken speech data, i.e. with all non-speech parts (silence, noise, DTMF tones, etc.)…