Skip to content Skip to main navigation Skip to footer

Search: file format

33 results

Q: How can I tell in which format the .wav file is?

A: From the utilities in the package*, you can find it in ffprobe <file_name>, it will write out the info about the file. *Utility “ffprobe” is not included in our package(s). It is part of ffmpeg (https://ffmpeg.org/ffprobe.html) and is necessary to be installed separately.

Phonexia Speech Engine

…– results are then returned immediately from the cache instead of complete re-processing of the audio file. Own persistent data storage SPE keeps uploaded audio files in its own persistent storage space, so the original source files can be archived or deleted after upload. Data privacy SPE keeps information about audio file or stream only as long as the file

Understand SPE metafiles

…DELETE methods to upload, download or delete any kind of file with metadata of your choice, associated with the corresponding SPE entity. There are no limits on the content of the metafiles, their names, etc. (apart from those imposed by the underlying operating system and/or filesystem). Plain text files, structured formats like JSON or XML, pictures, documents, multimedia files… you…

SPE and Browser installation: standalone SPE

…to start processing your recordings with Phonexia Speech Technologies. 1. Download Evaluation package Download the Phonexia Evaluation package from https://partner.phonexia.com/kb/sp/speech-platform/evaluation-package/ Simply unzip the package to your desired location. Ideally avoid C:/Program Files as you may face issues later on with previleges 2. Save license.dat file Copy the license.dat file to the /SPE/ directory. Make sure the license.dat file is not…

SPE and Browser installation: embedded SPE

…Windows: avoid C:/Program Files/ as you may face issues later on with privileges 2. Save the license.dat file Copy the license.dat file to the /SPE/ directory. Make sure the license.dat file is not altered in any way or renamed. The license is provided upon request by Phonexia sales representative. If you do not have it, contact our sales to arrange…

Key Features (PSP)

…audio conversion tools. Tested with sox or ffmpeg. For the configuration of this functionality, see [SPE]/settings/phxspe.properties Note: You should be aware that audio format conversion (e.g., if the original audio format is highly compressed) can decrease the accuracy of speech technologies. Integration Possibilities Phonexia Speech Platform can be integrated into a partner’s application using the Speech Engine component (REST API)….

Phoneme Recogniser (PHNREC)

Phonexia Phoneme Recogniser (PHNREC) converts speech signals into pronunciation characters (so called phonemes). After the conversion, the pronunciation (text) can be easily indexed and searched by third party text data mining tools. The technology is optimized for noisy recordings and colloquial speech, can process audio files as well as audio streams and can provide results in several output formats. Phoneme…

Release Notes

…Useful when symmetric RTP communication is needed Added /doc endpoint for serving REST API documentation in HTML format Get API documentation for your particular SPE version remotely, without physical access to SPE installation files Phonexia Browser updates We provide Phonexia Browser (a component of Speech Platform) for the basic evaluation of speech technologies. This is to help with the first…

Quick Start Guide (VIN)

…licensed does not need Internet connection. Copy a license file (license.dat, obtained from Phonexia) to the application’s root directory (e.g., next to VoiceInspector (Linux) / VoiceInspector.exe (Windows); the file should be copied to the VIN folder before running the “VoiceInspector (Linux) / VoiceInspector.exe (Windows)” executable file). Run VoiceInspector (Linux) / VoiceInspector.exe (Windows) To access this manual, press F1 or select…

Speech to Text (STT)

About STT Phonexia Speech to Text (STT) converts speech in audio signals into plain text. Technology works with both acoustics as well as dictionary of words, acoustic model and pronunciation. This makes it dependent on language and dictionary – only some set of words can be transcribed. As an input, audio file or stream is needed, together with selection of…

Speaker Identification (SID)

…speaker’s voice. It cannot be used to recreate the original audio file which is useful when the content has to stay anonymous. The recommended minimum amount of net speech for enrollment is approx. 30 seconds (latest generation of Phonexia SID lowers this requirement to 20 seconds). Voiceprints can then be stored in a database in the form of binary blobs,…

Speech Quality Estimation (SQE)

Phonexia’s Speech Quality Estimation quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two)…

Waveform Denoiser (DENOISER)

…software cannot remove unwanted speech or music in the background. Denoiser is used to remove noise from the recording and at the same time to amplify the speech signal for: Better intelligibility when listening by people (recommended use), Achieving better results with automatic speech recognition technologies (necessary to test on customer data first). Input: audio file (format details – see…

Time Analysis Extraction (TAE)

…dialogue. This can be used to improve calls between operators and callers or to indicate potential stress points in phone calls, for example, change of speech speed during the conversation). Input TAE can process both audio files and streams (for format details see Speech Engine documentation). By its nature, TAE is usable mainly on two channel phone calls recordings, where…

Keyword Spotting (KWS)

…experts. Typical use cases Call centers increase operator and supervisor efficiency by searching calls identify inappropriate expressions from operators check marketing campaigns with automatic script-compliance control Mass media and web search servers index and search multimedia by keyword route multimedia files and streams according to their content Security/defense maintain fast reaction times by routing calls with specific content to human…