Skip to content Skip to main navigation Skip to footer

Search: process

84 results

Support

…the Product partially functional, the use of which in a production environment is substantially reduced. The Issue contains an error that impairs the ability of the system to process a majority of audio files or audio streams, or that renders the setup and maintenance of the system inoperable. Permalink Critical Issue The system is inoperative, and it has a critical…

FAQs (Browser)

…= ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 Important note: By design and saving computing resources ‘audio converter’ is not used if INPUT file ends with the extension .wav. In that case you must pre-process the audio recording before uploading it to the Phonexia SPE or using it in the Phonexia…

Q: How to fix Error 1007: Unsupported audio format?

…%2 is for output file ffmpeg example: audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 Important note: By design and saving computing resources ‘audio converter’ is not used if INPUT file ends with the extension .wav. In that case you must pre-process the audio recording before uploading it to the…

Understand SPE audio converter

…following: it’s one of the natively supported formats – then SPE simply continues the processing it’s one of “internally recognized”, but not natively supported formats (e.g. MP3 audio) – then if converter is enabled, SPE tries to convert the file if converter is disabled, upload ends up with error it’s some “internally unrecognized” format – this causes error during format…

What is User configuration file and how to use it

…working state. User configuration files provide a way to override processing parameters without modifying original BSAPI configuration files. WARNING: Inappropriate configuration changes may cause serious issues! Make sure you really know what you are doing. User configuration file is a plain text file with the same name as main configuration file, with additional extension .usr. For example: Main configuration file…

Language Identification (LID)

…actually expected in your use case. This process of tailoring the language pack for particular needs is called language pack adaptation and is described in LID: Terminology and adaptation article. Example usages of custom language packs Law enforcement agency monitoring a network of criminals using only a particular set of languages can use the approach of keeping only languages expected…

STT: What is Preferred Phrases feature and how to use it

…it can help in other applications, too – e.g. when transcribing domain-specific audios, the frequently used domain-specific phrases can be boosted. How preferred phrases work The picture below shows a simplified standard speech transcription process – the digitized speech signal spectrum is analyzed in the neural network acoustic model (which describes the pronunciations of a given language) and goes into…

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

…Evaluation Package Evaluation package (download page) is consisting of Phonexia Browser and Phonexia Speech Engine including all necessary technologies. 2. Data We prepared the dataset for your testing. Package contains data for speaker model creation and speaker spotting too. The process of testing is the same for the data set collected by the user himself. Dataset is available to download…

About Phonexia Orbis

…of the audio and many others. Hit Feature Users can specify the rules for highlighting audio based on desired criteria. The solution marks such recordings as a Hit, and a user can process these as a priority. Speaker clustering While working with loads of audio files, user may use clustering feature to analyse the files automatically and group the individual…

SID: Speaker Identification: Results Enhancement

Speaker Identification (SID) Results Enhancement is a process that adjusts the score threshold for detecting/rejecting speakers by removing the effect of speech length and audio quality. This is achieved by use of Audio Source Profiles, that represent as closely as possible the source of the speech recording (device, acoustic channel, distance from microphone, language, gender, etc.). Although the out-of-the-box system…

Orbis 1.0.0 Release Notes

…performance issues. Solution: This limitation will be removed in the Orbis v1.2.0. Recording upload and processing After upload the “In progress” status remains until the page is refreshed. This is a known UI bug. Solution: Please, refresh the list by changing the page or refresh (F5) the whole page (after the Upload progress is 100%). Recording metadata formats Orbis doesn’t…

Orbis 1.4.0 Release Notes

Newest generation of Speaker Identification technology added Speaker identification technology verifies and authenticates speakers in seconds. The new generation has increased accuracy by 1 percentage point (a relative improvement of 33 %) – XL5 model vs. XL4 model that was previously in Orbis. The processing speed of the XL5 model is the same or faster than that of the XL4…

Video – Filtering and supporting technologies

MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…