Skip to content Skip to main navigation Skip to footer

Search: ffmpeg

12 results

Understand SPE audio converter

…install the converter (e.g. on Ubuntu: sudo apt install ffmpeg, on CentOS sudo yum install ffmpeg, or similar, depending on your OS, version and configuration). ⊞ Windows The simplest way is to download the converter distribution package from its site and copy the executable (ffmpeg.exe, sox.exe, etc.) and all required accompanying DLLs to the SPE installation directory, i.e. along the…

SPE and Browser installation: embedded SPE

…multimedia converter By default, the Speech Engine will accept only a limited list of audio formats. In order to process the non-native formats, install a multimedia converter. The recommended SW for this is FFmpeg. FFmpeg on Windows Download the latest version from https://www.gyan.dev/ffmpeg/builds/ffmpeg-release-essentials.zip After unzipping the package, move the ffmpeg.exe executable to the /SPE/ directory. You can delete the rest…

SPE and Browser installation: standalone SPE

…Download the latest version from https://www.gyan.dev/ffmpeg/builds/ffmpeg-release-essentials.zip After unzipping the package, move the ffmpeg.exe executable to the /SPE/ directory. You can delete the rest of the contents of the ffmpeg-release-essentials package. They will not be needed. FMMPEG on Linux run the following commands sudo apt update && sudo apt upgrade sudo apt install ffmpeg Open the /SPE/settings/phxspe.properties file with notepad and…

Q: What are the supported audio formats?

…configured do this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized…

Understand SPE configuration file

…PCM 16-bit, 8 kHz. See audio_converter.command for more details. audio_converter.command # Set converter command # %1 is for input file # %2 is for output file # ffmpeg example: # audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 Sets the command…

FAQs (PSP)

…OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized WAV audio format in 16-bit PCM little-endian as it is the default system. For more parameters please check FFmpeg manual pages. SoX sox <source_audio_file_name> -b 16 <output_audio_base_name>.wav Number of bits defined by -b parameter must be specified. in FAQ Phonexia…

FAQs (Browser)

…this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized WAV audio…

Q: How to fix Error 1007: Unsupported audio format?

ffmpeg utility, powerful and well documented. Please find your distribution package at http://ffmpeg.org Then continue as described below: Using Phonexia Browser with embed SPE Open the Browser configuration dialog by click on button “Settings” located in tool ribbon. Select tab “Speech Engine” and configure SPE as described in documentation. Don’t forget select checkbox “Enable audio converter”. Using SPE as service/daemon…

Q: How can I tell in which format the .wav file is?

A: From the utilities in the package*, you can find it in ffprobe <file_name>, it will write out the info about the file. *Utility “ffprobe” is not included in our package(s). It is part of ffmpeg (https://ffmpeg.org/ffprobe.html) and is necessary to be installed separately….

Understand SPE configuration

…FLAC audio format (Default: true) server.audio_formats.flac.enabled = true # Enable or disable audio converter audio_converter.enabled = true # Set converter command # %1 is for input file # %2 is for output file # ffmpeg example: # audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 audio_converter.command = ffmpeg -loglevel warning…

Q: What to do with the ApplicationStartup: Unhandled exception: BsapiException error?

When running SPE, the following error occurs: [Error] ApplicationStartup: Unhandled exception: BsapiException: SWaveformSegmenterI(/mnt/phxspe/home/phx/storage/dfs/a1cabcf7-c761-49f1 -a9bc-0a8209a09fd9.opus Requested segment (78056, 102056) is out of waveform range (0,91840). A: It means that this opus file is created improperly and declares internally (in header) much more audio than available in real file. Please check your audio source/originator for proper functionality. Or use ffmpeg / sox…

Key Features (PSP)

…audio conversion tools. Tested with sox or ffmpeg. For the configuration of this functionality, see [SPE]/settings/phxspe.properties Note: You should be aware that audio format conversion (e.g., if the original audio format is highly compressed) can decrease the accuracy of speech technologies. Integration Possibilities Phonexia Speech Platform can be integrated into a partner’s application using the Speech Engine component (REST API)….