Search: channels

9 results

Speech Quality Estimation (SQE)

…channels. The statistics of all channels include the numbers for many aspects of recording quality, and the overall global score. Technology The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits…

Phonexia Speech Engine

…audio manipulation SPE has built-in basic audio files manipulation functionality, like separating individual channels from stereo recordings, cut one audio to several files, save audio from incoming stream to file and others. Stream audio player To support voicebot scenarios, SPE has the ability to play audiofiles directly to output RTP stream External Text-to-speech (TTS) integration Easy integration with external TTS…

Speaker Diarization (DIAR)

…silence as well. The outputs of the technology can be both log files with labels and/or split audio files/one new multichannel audio file. Typical use cases: Preprocessing for other speech recognition technologies, labeling the parts of the utterance according to the speakers, splitting telephone conversations recorded in mono into several channels, identifying how many speakers are speaking in the recording….

Releases and Changelogs (VIN)

…1.3 2015-06-04 2016-12-04 2016-12-04 Public Changelogs Voice Inspector 5.2 Voice Inspector 5.2.0, BSAPI 3.61.0 (2024-04-04) New: New Case wizard checks for presence of Questioned and Reference recordings New: Number of audio channels is displayed in Case view Recording details view Score table view Report Fixed: Application crash with phoneme search Fixed: Generalized logistic distribution for Suspected speaker vs. Suspected speaker…

Releases and Changelogs (SPE)

…es-XA language in LID model L4 default language pack (es-XA7 -> es-XA) Fixed: Time Analysis segfaults on audio with 3+ channels Fixed: vpextract_s_calib.bs config file not working Fixed: WebSocket reply to PING control frame does not follow the protocol specification + all changes included in Feature Preview releases 3.31 and 3.32 (see below) NOTE: Due to the change in STT…

LID: Terminology and adaptation

…to train a language using just a few and long audio files (like 5 files, 1 hour each) Acoustic channels should be as close as possible to channel of intended deployment Adaptation using REST API (SPE 3.38 or newer) SPE 3.38 and newer include LID adaptation tasks in REST API, which makes the adaptation significantly easier than in previous versions….

Understand SPE audio converter

…2021-01-30 20:49:27 [Trace] Rest.Object.AudioFile: [RID=2] Response HTTP: 200 JSON reposnse (all OK) ====================== { “result” : { “version” : 3, “name” : “AudioFileInfoResult”, “info” : { “name” : “test1.wav”, “last_modified” : “2021-01-30T19:49:27Z”, “created” : “2021-09-27T18:16:59Z”, “size” : 12800718, “is_directory” : false, “is_registered” : true, “frequency” : 8000, “length” : 400.02, “n_channels” : 2, “format” : “lin16” } } } SPE…

Time Analysis Extraction (TAE)

Technology description Time Analysis Extraction (TAE) by Phonexia extracts base information from dialogue in a recording, providing essential knowledge about conversation flow. That makes easy to identify: long reaction time crosstalk responses of speakers in both channels speed of speech measured in phonemes per second Typical usage domain It is typically used in contact centers for indicating weak moments in…

Input audio quality

…magically restore the information already lost during the original compression. No point trying that. 1 The joint-stereo encoding – which is commonly used by default in MP3 encoders – is tailored for usage with music audio, where both channels usually contain almost the same signal. Using joint-stereo encoding for telephony stereo, where each channel contains completely different signal (when one…