Skip to content Skip to main navigation Skip to footer

Search: XL4

16 results

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

Releases and Changelogs (VIN)

…5.0 requires a new license. To upgrade from version 4 or 3, please contact the Phonexia sales representative. Phonexia Voice Inspector 5.0 brings a Speaker Identification model XL5, that provides more accurate results for telephony data in comparison with previous generations of Speaker Identification models such as SID4 XL4. Users can observe that the SID4 XL5 model returns different values…

Speaker Identification

…There is always only one value, where FAR=FRR. This value is call an Equal Error Rate (EER) and is another standard metric of accuracy of biometrics technologies. In the next graph you can see how EER changes in relation to seconds of net speech used for different versions of our Speaker Identification. The current version is XL4. In different use…

Speaker Identification (SID)

…-1.152915 1.393920 0.003471 0.708449 -0.201536 0.857028 -1.229550 2.583504 -0.342451 -1.118439 -0.415567 -1.564529 0.807927 -0.276171 -2.637402 -1.691306 -1.307633 2.275870 -0.847365 … Voiceprint comparison Any voiceprint created from at least 10 seconds of speech – latest generation of Phonexia SID lowers this requirement to 7, or even 3 seconds for XL4 model – of an unknown speaker can then be compared with…

Gender Identification (GID)

Gender Identification is a language-, domain- and channel-independent technology that uses the acoustic characteristics of the recording to determine the gender of the speaker in question. This technology is able to distinguish between two genders: Male (M) and Female (F). Minimum of speech signal for identification: 7+ sec recommended (with XL4 and L4 model (9+ sec for previous generation of…

Diarization tool for Orbis

…multichannel WAV audio, where each speaker speaks only in their own channel. Tool will automatically convert audio files to WAV format. For example, this recording: audio.wav channel_1 […111..222..11….22222..] Will be converted to audio.wav channel_1 […111…….11………..] channel_2 [……..222……..22222..] IMPORTANT: Tool doesn’t process any metadata. Resulting files should be uploaded into Orbis without metadata file! Tool uses Phonexia Diarization technology model XL4….

Support Lifecycle Policy (PSP)

…3 or 4 in the model name.   Other technology models (SID, LID, GID, DIAR, AGE, SQE, VAD, DENOISE) Tech. models supported (generation specified by number in “Tech. model name”). Technology Tech. model name Released End of support Maintenance SID4 XL5 2022-09 6th gen. SID 5th gen. SID XL4 2020-03 6th gen. SID 5th gen. SID L4 2019-02 6th gen….

What is User configuration file and how to use it

…name User configuration file name stt_cs_cz_5_online.bs stt_cs_cz_5_online.bs.usr kws_nl_nl_5.bs kws_nl_nl_5.bs.usr phnrec_pashto.bs phnrec_pashto.bs.usr vpextract4_xl4.bs vpextract4_xl4.bs.usr During technology initialization (e.g. during Speech Engine startup), the initialization routine checks for existence of such user config file. If found, it’s automatically loaded after loading the main configuration file and the settings from user config is automatically applied over the setings from main configuration file. Usage…

Orbis 1.4.0 Release Notes

Newest generation of Speaker Identification technology added Speaker identification technology verifies and authenticates speakers in seconds. The new generation has increased accuracy by 1 percentage point (a relative improvement of 33 %) – XL5 model vs. XL4 model that was previously in Orbis. The processing speed of the XL5 model is the same or faster than that of the XL4

SPE and Browser installation: standalone SPE

…nr. 23) 1) Age Estimation [active model: XL5(1x)] 2) Denoiser Technology [active model: EN_US(1x)] 3) Diarization [active model: XL4(1x)] 4) Gender Identification [active model: XL5(1x)] 5) Keyword Spotting [active model: EN_US_6(1x)] 6) Phoneme Recognition [active model: EN_US_6(1x)] 7) Keyword Spotting Stream [active model: EN_US_6(1x)] 8) Language Identification LanguagePrint Comparator [active model: L4(1x)] 9) Language Identification LanguagePrint Extractor [active model: L4(1x)]…

Understand SPE executable files

…that all the technologies/models are available in that SPE installation, this command adds(*) the following to the technologies configuration file: SIDE_STREAM for both L3 and XL3 model, 3 instances of each SIDC_STREAM for both L3 and XL3 model, 3 instances of each SID4E_STREAM for both L4 and XL4 model, 1 instance of each SID4C_STREAM for both L4 and XL4 model,…

Phonexia technology models EoL

…4th generation models, typically marked with a number 1, 2, 3 or 4 in the model name.   Other technology models (SID, LID, GID, DIAR, AGE, SQE, VAD, DENOISE) Tech. models supported (generation specified by number in “Tech. model name”). Technology Tech. model name Released End of support Maintenance SID4 XL5 2022-09 6th gen. SID 5th gen. SID XL4 2020-03…

Recommended OS and HW (PSP)

…or 10th Gen Intel® Core Processor RAM: 16 GB Storage: 100 GB (depends on audio retention policy) SSD strongly recommended for superior performance over HDD Configuration includes: SID4 XL4, GID XL4, LID L4, AGE L4, VAD, SQE Transcription System, basic 100 hours/day package (***) files processing CPU: 8 physical cores, 1x Intel® Xeon E5-2640 v4 or similar or 10th Gen…

Understand SPE technologies configuration file

…of XL4 model <?xml version=”1.0″?> <technology_subsystem_settings> <technologies> <item> <name>STT</name> <models> <item> <name>SK_SK_5</name> <n_instances>8</n_instances> <config_file /> </item> </models> </item> <item> <name>STT_STREAM</name> <models> <item> <name>CS_CZ_6</name> <n_instances>2</n_instances> <config_file /> </item> </models> </item> <item> <name>SID4E</name> <models> <item> <name>L4</name> <n_instances>2</n_instances> <config_file /> </item> <item> <name>XL4</name> <n_instances>3</n_instances> <config_file /> </item> </models> </item> <item> <name>SID4C</name> <models> <item> <name>L4</name> <n_instances>2</n_instances> <config_file /> </item> <item> <name>XL4</name> <n_instances>3</n_instances> <config_file />…

Releases and Changelogs (SPE)

…destination address on some OSs Speech Engine 3.45.0, DB v1800, BSAPI 3.45.0 (2021-10-06) New: Added 6th generation of EN_US and EN_US_A STT (KWS/PHNREC will be added in one of the upcoming updates) New: Added XL4 model for GID (for compatibility with SID4 XL4 voiceprints) New: STT preferred phrases v2 with ability to dynamically add words to language model (currently in…