Skip to content Skip to main navigation Skip to footer

Search: Audio%20Source%20Profile

91 results

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

Testing possibilities

…(GSM, VoIP,…) Microphone placement (close-field vs. far-field) Audio quality Formats Codecs Background noise Geological locations Age distribution Style of speech Monolog vs. dialog Reading a text vs. live conversation In some of the scenarios mentioned above, it is quite difficult to assure all of these requirements, that is the reason why the best option for accuracy testing is definitely in…

STT: Language Model Customization tutorial

Language Model Customization tool (LMC) provides a way to improve the Speech To Text performance by creating customized language model. Language model is an important part of Phonexia Speech To Text. In a simplified way it can be imagined as a large dictionary with multiple statistics. The Speech To Text technology uses this dictionary and statistical model to convert audio

Time Analysis Extraction (TAE)

…dialogue. This can be used to improve calls between operators and callers or to indicate potential stress points in phone calls, for example, change of speech speed during the conversation). Input TAE can process both audio files and streams (for format details see Speech Engine documentation). By its nature, TAE is usable mainly on two channel phone calls recordings, where…

Orbis 1.1.0 Release Notes

…case number, suspects etc. Maximal speech length for the voiceprint extraction To optimize performance, we used a constraint on the total length of speech captured during voiceprint extraction. This allows the Orbis system to perform more extractions per minute, especially for the longer audio recordings. Only one channel processing To optimize performance, the option of processing only one channel (out…

What is User configuration file and how to use it

Advanced users with appropriate knowledge (gained e.g. by taking the Phonexia Academy Advanced Training) may want to finetune behavior of the technologies to adapt to the nature of their audio data. Modifying original BSAPI configuration files directly can be dangerous – inappropriate changes may cause unpredicatble behavior and without having a backup of the unmodified file it’s difficult to restore…

Releases and Changelogs (VIN)

…can more accurately model a skewed distribution. Such skewed distributions are being produced by the modern highly accurate speaker identification system of Phonexia. Fixed: Audio files with Unicode characters in their name can be also opened on Windows. Changed: The histogram bins in the probability density function plot are now normalized. Changed: The case description (field in the information table…

Proof of Concept

…client or vendor of the PBX) with Phonexia consultations (two hours) Setting up PBX to provide an audio source for Phonexia Voice Verify Three days – the estimated time to be spent by the partner on this step PBX administrator, based on Phonexia instructions (two hours) CRM/contact center software integration Understand the Voice Verify API, including sandbox tests Two days…

Download Voice Inspector 5.1

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…

Documentation (SPE)

…files in [SPE]/doc in standard software package and installation. You can also find REST API reference (Speech Engine) documentation online. You might be interested in reading the following information in manual: REST API reference Structure of API queries Asynchronous request Task prioritization Authentication Audio requirements RTP/HTTP streams Error responses API Commands Usage examples API Requirements Installation guide And much more…

Orbis Hardware Requirements

…for Orbis VM Memory 32 GB for Orbis VM Disk space 80 GB SSD typical installation of the Orbis does not exceed 20 GB depending on the size, length, and retention policy of your files, you may want to allocate more space* *1 MB of uploaded audio = 2.7 MB of storage needed Virtual Platform (hypervisor) minimal: VirtualBox 6.1.30 64-bit…

Understand SPE workers configuration

…no one can really speak faster than realtime 😉 – so a single physical CPU core can actually process multiple realtime tasks simultaneously, depending on how much faster than realtime a particular technology is (and also how much speech the audio contains). This means that for stream processing technologies it makes sense to configure higher number of workers than physical…

Q: How can we test Phonexia technologies?

We can prepare a testing package for you with full functionality of all technologies. The license validity is 90 days to allow you to test the technologies. Note: by default a NET license is provided for testing. This license needs to have active Internet connection to a phonexia licensing server in order to function. Rest assured no data – audio,…