Skip to content Skip to main navigation Skip to footer

Search: process

63 results

Understand SPE configuration

…0022 Data storage and multithread settings The home directory of SPE contains all user data including audio recordings and metadata files from speech processing (speaker models, description etc.). This is another good example of using environment variables if your topology design requires multiple instances of SPE processing the same payload. This is great for sharing raw data between multiple physical…

Privacy Policy

…personal data, to request blocking, correction, supplementing, or erasure of your personal data; further, you have the right to appeal directly to the relevant Data Protection Authority. You can at any time request information about the processing of your personal data, and we will provide such information to you. In addition, you may revoke your consent allowing us to process

Releases and Changelogs (Browser)

…wizard for creating SID calibration set [#5047] Added tracers to SID evaluation wizards which are synchronized across all graphs [#5112] Added FAR an FRR to tracer labels to Error rate and PDF plots in SID evaluation graphs [#5050] Presentation mode in graphs in SID evaluation [#5141] Added limitation of processing time or processed files in SID evaluation/calibration wizards [#5131] Calibration…

Designing and Developing Application

Before designing and developing the application, we encourage Partner to find clear answer for the following questions: Customer requirements: Do my customers need file processing (audio) or stream processing in real time? What is the human power of the customer that can analyze the results? How many minutes per day or streams in parallel do my customer need to process?…

Recommended OS and HW (PSP)

…or 10th Gen Intel® Core Processor RAM: 16 GB Storage: 100 GB (depends on audio retention policy) SSD strongly recommended for superior performance over HDD Configuration includes: SID4 XL4, GID XL4, LID L4, AGE L4, VAD, SQE Transcription System, basic 100 hours/day package (***) files processing CPU: 8 physical cores, 1x Intel® Xeon E5-2640 v4 or similar or 10th Gen…

SPE and Browser installation: embedded SPE

In this post, we break down the complexities of the initial installation process. By the end of the guide, you will be able to start processing your recordings with Phonexia Speech Technologies. 1. Download Evaluation package Download the Phonexia Evaluation package from https://partner.phonexia.com/kb/sp/speech-platform/evaluation-package/ Create a new directory and unzip the package into it in your desired location, for ex. C:/Phonexia/…

SID4 performance on Intel® Xeon® Platinum 8124M

…SID4 instances, shows how many parallel SID4 processes were initiated in parallel. Description: X-axis shows how many SID4 instances were activated in parallel processing Blue bar shows total performance based on RAW recordings length in data set Orange bar shows recalculated performance based on “Net_Speech” length calculated from original recordings in data set. How to understand the results context As…

Speaker Identification (SID)

…of acoustic features from a recording of a known speaker. The process continues with the creation of a speaker model which is then transformed into a small but highly representative numerical representation called a voiceprint. During this process, the SID technology applies state-of-the-art channel compensation techniques. The voiceprint is a fixed-length matrix which captures the most unique characteristics of a…

Voice Activity Detection (VAD)

…VAD is usually part of rapid filtration process in deployment. Typical use cases are: detection of present or absent human speech for voice processing, filtering non-speech parts of the recording, filtering out recordings with not enough net speech to be processed by other technologies voice activated process, etc. The speed of Voice Activity Detection is 140 ftRT per one instance….

Speech Quality Estimation (SQE)

…of bits used by the waveform absolute value if less than 8, the signal has insufficient quality wfilter_technical_signal_length – the length of technical signals (tones, wide-band noise, etc.), measured in seconds Processing speed Approx. 2,000x faster than real-time processing on 1 CPU core i.e. standard 8 CPU core server processes 384,000 hours of audio in 1 day of computing time…

STT: Configuring word detection parameters for stream transcription

…of the signal going to the decoder. Decoder is a component, which determines what a particular part of the signal contains (speech, silence, etc.). Based on that, decoder also decides whether segment has finished or not. Unlike in file processing (where it’s possible to analyze any part of the file), in realtime processing we do not see into the future,…

Age Estimation (AGE)

…coding), A-law or Mu-law, PCM, 8kHz+ sampling Voiceprints: AGE L4 model supports SID4 L4 voiceprints; legacy AGE models support voiceprints created by AGE itself Output Log file with processed information (age estimate) Processing speed Approx. 20x faster than real-time processing on 1 CPU core i.e. standard 8 CPU core server processes 3,840 hours of audio in 1 day of computing…

Phonexia End User License Agreement

…adjust it to fit the technical requirements of processing the Provided demo data. Phonexia will encrypt all Provided demo data before transmitting or distributing for processing purposes. The Client using Web demo license explicitly permits Phonexia to take these actions. 4.4 The Client agrees that the Provided demo data is not endorsed by Phonexia. All Provided demo data content is…

Understand SPE home directory

…that access rights are configured appropriately – e.g. the external process putting the files to the SPE storage might be running under different user context than the SPE process, making the files inaccessible for the SPE process… which might lead to obscure errors. Data The data directory holds additional data files for entities created by that user – e.g. SID…