Search: audio format

31 results

Q: How to fix Error 1007: Unsupported audio format?

Phonexia Browser application may return error “1007: Unsupported audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is…

Q: What are the supported audio formats?

…configured do this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized…

Understand SPE configuration file

…support is enabled or disabled. By default it’s enabled. server.audio_formats.flac.enabled # Enable or disable native support for FLAC audio format (Default: true) server.audio_formats.flac.enabled = true Controls whether the FLAC audio format support is enabled or disabled. By default it’s enabled. audio_converter.enabled # Enable or disable audio converter audio_converter.enabled = false Controls whether support for automatic audio format conversion is enabled…

Understand SPE audio converter

SPE directly supports limited list of audio formats (codecs and containers), see Supported audio formats FAQ. Other audio formats must be converted using external tools. This conversion can be done either completely outside of SPE, before passing the files to SPE, or you can set up SPE to convert the files automatically. Then, depending on the capabilities of the conversion…

Input audio quality

…audio codec, heavy compression, too low bitrate, etc. can damage or even completely destroy essential parts of the audio signal required by speech technologies. Commonly used audio compressions make use of perceptual limitation of human hearing and can remove frequencies which are covered by other frequencies, etc… Therefore, to get satisfactory results from speech technologies, use appropriate audio format. ⓘ…

Understand SPE configuration

…FLAC audio format (Default: true) server.audio_formats.flac.enabled = true # Enable or disable audio converter audio_converter.enabled = true # Set converter command # %1 is for input file # %2 is for output file # ffmpeg example: # audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 audio_converter.command = ffmpeg -loglevel warning…

FAQs (PSP)

…Browser, FAQ Speech Platform Permalink Q: How to fix Error 1007: Unsupported audio format? Phonexia Browser application may return error “1007: Unsupported audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE…

Releases and Changelogs (SPE)

…[G#157] Added endpoint for updating existing Audio Source Profile [G#160] SID4 calibration technology renamed: SID4CALIBSET -> SID4CALIB [G#161] Mean normalization support in Audio Source Profiles [G#169] Added cache for Audio Source Profiles, see server.audio_source_profiles_cache_size property [G#170] Added False Acceptance Calibration cache, see server.bsapi_comparator_fa_cache_size [G#149] Fixed: phxclient prints help if running without parameters [G#150] Fixed: UTF-8 symbols are not escaped in…

FAQs (Browser)

…audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is ffmpeg utility, powerful and well documented. Please find…

Key Features (PSP)

…audio conversion tools. Tested with sox or ffmpeg. For the configuration of this functionality, see [SPE]/settings/phxspe.properties Note: You should be aware that audio format conversion (e.g., if the original audio format is highly compressed) can decrease the accuracy of speech technologies. Integration Possibilities Phonexia Speech Platform can be integrated into a partner’s application using the Speech Engine component (REST API)….

LID: Terminology and adaptation

…20 hours of audio is required, see requirements below Enhancing existing language model by adding your own audio files to existing built-in language at least 5 hours of audio is required, see requirements below Creating custom language pack consisting of your chosen set of languages, both pre-trained or created from your audio files Audio recordings requirements Format: WAV, FLAC, RAW…

Release Notes

…and fixes Speech Engine: General Reduced RAM consumption (since 3.58.0) RAM consumption can be up to several gigabytes lower, depending on technologies configuration and processed audio. This is mainly visible in Speech To Text when processing many audios or longer audios (or both). The effect may be less visible in other technologies. Fixed issues with non-ASCII / Unicode file names…

Understand SPE connectors for external TTS

…little-endian mono audio data. In SPE 3.46 and newer, the audio sampling frequency must be set to the naturalSampleRateHertz value provided in the TTS service capabilities information. In SPE 3.45 and older, the audio sampling frequency must be fixed to 8000 Hz. SPE then reads the audio and writes it either to a file, or to an output realtime stream,…

SPE and Browser installation: standalone SPE

…change the following lines to enable the FFMPEG convertor: change the line: # Enable or disable audio converter audio_converter.enabled = false to: # Enable or disable audio converter audio_converter.enabled = true 6. Start Speech Engine In order to start the Speech Engine, start the SPE executable called phxspe On Windows – type cmd in the Address bar, to open the…

SPE and Browser installation: embedded SPE

…multimedia converter By default, the Speech Engine will accept only a limited list of audio formats. In order to process the non-native formats, install a multimedia converter. The recommended SW for this is FFmpeg. FFmpeg on Windows Download the latest version from https://www.gyan.dev/ffmpeg/builds/ffmpeg-release-essentials.zip After unzipping the package, move the ffmpeg.exe executable to the /SPE/ directory. You can delete the rest…