Skip to content Skip to main navigation Skip to footer

Search: age%20estimation

96 results

Get better support

…product executable file “properties” files (phxspe.properties from SPE is minimum) – usually in ./settings/ directory Issue data – supporting: Actual and active HW configuration (CPU, OS, RAM, storage status (free space)), you might use the following to get the information: The Benchmark function in SPE or Phonexia’s hw-gen for generating basic HW print Windows 64bit http://download.phonexia.com/utils/hw-gen64.exe GNU/Linux 64bit http://download.phonexia.com/utils/hw-gen64 System…

Q: What to do with the ApplicationStartup: Unhandled exception: BsapiException error?

When running SPE, the following error occurs: [Error] ApplicationStartup: Unhandled exception: BsapiException: SWaveformSegmenterI(/mnt/phxspe/home/phx/storage/dfs/a1cabcf7-c761-49f1 -a9bc-0a8209a09fd9.opus Requested segment (78056, 102056) is out of waveform range (0,91840). A: It means that this opus file is created improperly and declares internally (in header) much more audio than available in real file. Please check your audio source/originator for proper functionality. Or use ffmpeg / sox…

Q: What languages do you offer?

It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more….

Voice Activity Detection (VAD)

Voice Activity Detection is a language-, domain- and channel-independent technology that identifies parts of audio recordings with speech content vs. non-speech content. It creates labels for speech and other signals in the recording; this can then serve as a decision point whether to process the recording by other technologies or not. VAD is usually part of rapid filtration process in…

Speaker Diarization (DIAR)

Speaker Diarization labels segments of the same voice(s) in one mono-channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers but of technical signals and silence as well. The outputs of the technology can be both log files with labels and/or split audio files/one new…

Open Source Acknowledgement

This page collects information about Third party Open Source code and licenses. BSAPI3 updated dependencies Library License ADVobfuscator https://github.com/andrivet/ADVobfuscator#copyright-and-license. Copyright: 2010-2017 Sebastien Andrivet blaze BSD-3-Clause boost BSL-1.0 botan BSD-2-Clause bzip2 bzip2-1.0.8 cpp-httplib MIT dlfcn-win32 (Windows only) LGPL duktape MIT (link) flac BSD-style license fmt MIT gsl-lite MIT libbacktrace (Linux only) BSD-3-Clause minizip Zlib mkl freeware under ISSL (Intel Simplified Software…

404 error

404 four hundred four [fôr ˈhəndrid fôr] Page not found back to home…

Q: What are the supported audio formats?

…configured do this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized…

Q: How to fix Error 1007: Unsupported audio format?

…ffmpeg utility, powerful and well documented. Please find your distribution package at http://ffmpeg.org Then continue as described below: Using Phonexia Browser with embed SPE Open the Browser configuration dialog by click on button “Settings” located in tool ribbon. Select tab “Speech Engine” and configure SPE as described in documentation. Don’t forget select checkbox “Enable audio converter”. Using SPE as service/daemon…

Phonexia End User License Agreement

…demo software is provided “As Is” and “As Available” without warranty of any kind. Use of the Web demo software and any related services is done solely at the Client´s own discretion and risk. Phonexia is not responsible for any damage to the Client´s computer system or other device, or for the loss of any data as a result of…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

KWS: Results explained

…the detected pronunciation. Start- and end time is in HTK units. 1 HTK unit is 100 nanoseconds, so dividing the times by 10000 gives the amount of milliseconds. Score is log likelihood ratio from {-inf,+inf} interval. Confidence is a probability from {0,1} interval. To convert it to percentage, multiply the confidence value by 100. Example This example of Keyword Spotting…

What is User configuration file and how to use it

…name User configuration file name stt_cs_cz_5_online.bs stt_cs_cz_5_online.bs.usr kws_nl_nl_5.bs kws_nl_nl_5.bs.usr phnrec_pashto.bs phnrec_pashto.bs.usr vpextract4_xl4.bs vpextract4_xl4.bs.usr During technology initialization (e.g. during Speech Engine startup), the initialization routine checks for existence of such user config file. If found, it’s automatically loaded after loading the main configuration file and the settings from user config is automatically applied over the setings from main configuration file. Usage

STT: What is Words-To-Numbers feature and how to use it

…point zero three ⇒ 1586.03 sixty four million seven hundred thousand ninety ⇒ 64700090 This should help to simplify processing of the transcribed texts by text analytic layers or NLP (Natural Language Processing) engines, e.g. in voicebot applications. Where is the converted output available? The words to numbers conversion is available only in n-best output (i.e. where the entire sentence…

FAQs (VIN)

…is stored next to VoiceInspector.exe application. If the license.dat is already in the same directory that VoiceInspector.exe is and you are still receiving this error, please contact the technical support at our Service Desk. in FAQ Voice Inspector Permalink Q: I am getting the error message “Your license is not for this application.” A: Check your license file (license.dat) by…