Skip to content Skip to main navigation Skip to footer

Search: audio supported

28 results

FAQs (PSP)

…Browser, FAQ Speech Platform Permalink Q: How to fix Error 1007: Unsupported audio format? Phonexia Browser application may return error “1007: Unsupported audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE…

Understand SPE configuration file

…support is enabled or disabled. By default it’s enabled. server.audio_formats.flac.enabled # Enable or disable native support for FLAC audio format (Default: true) server.audio_formats.flac.enabled = true Controls whether the FLAC audio format support is enabled or disabled. By default it’s enabled. audio_converter.enabled # Enable or disable audio converter audio_converter.enabled = false Controls whether support for automatic audio format conversion is enabled…

FAQs (Browser)

audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is ffmpeg utility, powerful and well documented. Please find…

Understand SPE audio converter

…can’t be converted: Converter is disabled 2021-01-30 20:59:52 [Trace] ConverterSubsystem: Removed temporary file: C:\TMP\tmp11452aaaaaa 2021-01-30 20:59:52 [Error] Rest.Object.AudioFile: [RID=2] REST error: (1007) Unsupported audio format 2021-01-30 20:59:52 [Trace] Rest.Object.AudioFile: [RID=2] Response HTTP: 415 RESTError: 1007 JSON response (error) ===================== { “result” : { “version” : 2, “name” : “ErrorResult”, “code” : 1007, “message” : “(1007) Unsupported audio format” } }…

Releases and Changelogs (SPE)

…[G#157] Added endpoint for updating existing Audio Source Profile [G#160] SID4 calibration technology renamed: SID4CALIBSET -> SID4CALIB [G#161] Mean normalization support in Audio Source Profiles [G#169] Added cache for Audio Source Profiles, see server.audio_source_profiles_cache_size property [G#170] Added False Acceptance Calibration cache, see server.bsapi_comparator_fa_cache_size [G#149] Fixed: phxclient prints help if running without parameters [G#150] Fixed: UTF-8 symbols are not escaped in…

Q: How to fix Error 1007: Unsupported audio format?

Phonexia Browser application may return error “1007: Unsupported audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is…

Q: What are the supported audio formats?

…configured do this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized…

Release Notes

…model (tech.model name: VI_VN_6) released We add this to languages supported by STT/KWS. Partners/Customers can transcribe Vietnamese audio or enhance some search-in-audio applications. Again, it is the 6th generation we use in automatic speech recognition technologies (i.e., STT, KWS, PHNREC) that will lead to more precise search results in audio content. Improved preferred phrases in STT– only in tech. model…

LID: Terminology and adaptation

…20 hours of audio is required, see requirements below Enhancing existing language model by adding your own audio files to existing built-in language at least 5 hours of audio is required, see requirements below Creating custom language pack consisting of your chosen set of languages, both pre-trained or created from your audio files Audio recordings requirements Format: WAV, FLAC, RAW…

Key Features (PSP)

…in the Languages Available section. Speech To Text (STT) and Keyword Spotting (KWS) languages Language Identification (LID) languages Supported Audio input The Speech Engine server supports various audio formats as listed in API reference > Audio requirements. It also supports the RTP/HTTP stream processing as listed in API reference > RTP/HTTP streams. The Speech Engine allows the usage of some…

Understand SPE configuration

…FLAC audio format (Default: true) server.audio_formats.flac.enabled = true # Enable or disable audio converter audio_converter.enabled = true # Set converter command # %1 is for input file # %2 is for output file # ffmpeg example: # audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 audio_converter.command = ffmpeg -loglevel warning…

Understand SPE connectors for external TTS

…little-endian mono audio data. In SPE 3.46 and newer, the audio sampling frequency must be set to the naturalSampleRateHertz value provided in the TTS service capabilities information. In SPE 3.45 and older, the audio sampling frequency must be fixed to 8000 Hz. SPE then reads the audio and writes it either to a file, or to an output realtime stream,…

Understand SPE database

SPE database serves multiple purposes: stores SPE internal data stores various information about SPE entities created by SPE user audio files metadata speaker models and their voiceprints speaker groups and their voiceprints calibration sets keyword lists language packs audio source profiles stores cached processing results (ON by default, can be set in SPE configuration file) optionally also stores SPE log…

Understand SPE directory structure

…database SQL scripts. data ├── phxspe.properties.default ├── init.d-phxspe.template ├── phxspe.service.template │ ├── benchmark └── database phxspe.properties.default Default phxspe.properties SPE configuration file init.d-phxspe.template Example SPE init.d script phxspe.service.template Example SPE systemd service unit file benchmark Default audio files for built-in benchmark functionality database Database SQL scripts for supported databases: SQLite, MariaDB and MySQL The phxspe.properties.default file is used by phxadmin tool…

Understand SPE executable files

…to URL (e.g. “http://server:port”) priority=number – Set request priority (see Understanding SPE processing priority for more details) phxclient: example 1 phxclient /login=admin /password=phonexia /method=POST /uri=”127.0.0.1:8600/audiofile?path=/myfile.wav” /data=”c:\audio files\example recording.wav” Upload example recording.wav file from c:\audio files folder to SPE running at this machine (i.e. with IP address 127.0.0.1) and put it in the root of SPE internal storage under myfile.wav name….