Skip to content Skip to main navigation Skip to footer

Search: SID API

20 results

Understand SPE user accounts

…not visible by SPE and by the account. Similar trickery can be done with the data directory, allowing to share LID language models and language packs, or SID speaker models, etc. between accounts. User accounts management SPE user accounts can be managed using REST API (see Administration section of the API documentation), or using command line administration utilities phxadmin or…

Understand SPE metafiles

Certain SPE entities – SID Speaker models, SID Audio source profiles, LID Language packs – can have additional information associated with them in the form of “metafiles”. This article explains the intended usage of metafiles. In general, SPE is intended as under-the-hood engine, focusing purely on the speech-related audio processing. Any additional functionality should be done on the application layer,…

Understand SPE home directory

…storage – user-specific data Since SPE comes with pre-configured user “admin”, the default SPE installation contains admin subdirectory inside home: home └── admin ├── storage └── data Storage The user’s storage directory is where SPE stores files and directories uploaded or created via REST API, i.e. every action made using the /audiofile or /directory endpoints is reflected here – e.g….

KWS: Results explained

…sheet demonstrating the sigmoid function: Score-to-Confidence. Score-to-confidence conversion tuning Starting with SPE/BSAPI 3.24 (October 2019) it’s possible to modify the confidence calculation using confidence_shift and confidence_sharpness values in user configuration file in [score_calib:SKeywordScoreCalibrationI] section. User configuration file must have the same name as original configuration file, with added .usr extension, e.g. kws_en_us_5.bs.usr – see the What is a user configuration…

Waveform Denoiser (DENOISER)

…or SID technologies. Q: How does the Denoiser perform if part of the recording is noisy and part of the speech is good quality? The technology is being developed to automatically detect low quality audio segments and try to reconstruct them. On the contrary, well-recorded segments should be automatically recognized and retained their original speech quality. Q: Is there a…