Phonexia’s Speech Quality Estimation quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two)…
Search: sqe
23 results
…set source port for output stream New: Added SQE technology on stream New: Added Perceptual Evaluation of Speech Quality (PESQ) score estimation to SQE results New: Following word classes are transcribed more accurately in 6th generation of CS_CZ STT male/female first name and surname municipality street Fixed: LMC may use wrong paths on Windows platform Improved: Removed + symbol from…
…this new generation in built-in VAD in STT CS_CZ_6 (Czech language), we see increased accuracy (WAcc) by approx. 2% absolute. The implementation into other tech. models of STT (i.e., languages) will follow. It does not influence the *ID technologies. SQE: Added Perceptual Evaluation of Speech Quality (PESQ) score estimation The PESQ estimation was added as another available metric of SQE….
…or 10th Gen Intel® Core Processor RAM: 16 GB Storage: 100 GB (depends on audio retention policy) SSD strongly recommended for superior performance over HDD Configuration includes: SID4 XL4, GID XL4, LID L4, AGE L4, VAD, SQE Transcription System, basic 100 hours/day package (***) files processing CPU: 8 physical cores, 1x Intel® Xeon E5-2640 v4 or similar or 10th Gen…
…when an error occurs, but view all errors and continue creating the evaluation set Fixed: SID Evaluator – invalid GID score values Fixed: SID Evaluator – missing SQE information in report Fixed: SID Evaluator – don’t save disabled recordings to evaluation set Phonexia Browser 3.40.3, BSAPI 3.40.4 (2021-05-28) Fixed: Some minor bugs in licensing system Phonexia Browser 3.40.2, BSAPI 3.40.2…
…4th generation models, typically marked with a number 1, 2, 3 or 4 in the model name. Other technology models (SID, LID, GID, DIAR, AGE, SQE, VAD, DENOISE) Tech. models supported (generation specified by number in “Tech. model name”). Technology Tech. model name Released End of support Maintenance SID4 XL5 2022-09 6th gen. SID 5th gen. SID XL4 2020-03…
This part requires higher (and non-anonymous) access level.
How to solve this situation:
- Log in here if you are not logged in.
- Register here. It takes just a few clicks and it’s free.
…results – file, used technology model, used speaker model, used FAR calibration set, max. FAR, results JSON data rest_result_sid4 SID4 processing results – file, used technology model, used speaker model, used file- and speaker model Audio Source Profile, results JSON data rest_result_sqe SQE processing results – file, used technology model, results JSON data rest_result_stt STT processing results – file, used…
…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…
MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…
…✓ Voice Activity Detection (VAD) ✓ ✓ Time Analysis Extraction (TAE) ✓ ✓ Speech Quality Estimation (SQE) ✓ ✓ Language Identification (LID) ✓ Gender Identification (GID) ✓ Age Estimation (AGE) ✓ Speaker Diarization (DIAR) ✓ Results caching Processing results can be optionally stored in results cache database to speed up eventual re-processing of the same recordings by the same technology…
…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…
…technologies setup. If we assume that the whole machine is dedicated as a “speech computing unit” then, in general, we can calculate it as follows: file: phxspe.properties server.n_workers = <#_of_core> file: technologies.xml (no. of threads per technology, can be also set up by the phxadmin tool) SQE: <#_of_cores>/4 VAD: <#_of_cores>/2 other technologies: <#_of_cores> RAM: 8 cores = 32 GB 16…
…physical server, configure your technologies.xml to the following number of instances: SQE: <#_cpu_cores>/4 VAD: <#_cpu_cores>/2 any other technology: <#_cpu_cores> (Note: your license should also be configured properly. Ask our Sales department for cooperation in case of hot-load evaluation tests. The production license will be configured with our assistance, of course) Optimal RAM recommendation: 4 cores: 16 GB RAM 8 cores:…
…3 or 4 in the model name. Other technology models (SID, LID, GID, DIAR, AGE, SQE, VAD, DENOISE) Tech. models supported (generation specified by number in “Tech. model name”). Technology Tech. model name Released End of support Maintenance SID4 XL5 2022-09 6th gen. SID 5th gen. SID XL4 2020-03 6th gen. SID 5th gen. SID L4 2019-02 6th gen….