Skip to content Skip to main navigation Skip to footer

Search: spe-3.11.1-win64.zip

130 results

Understand SPE user accounts

…prioritization section in the REST API documentation maximum pending requests – legacy REST Server 2.x attribute, ignored in SPE 3.x It’s important to realize that each SPE user account has its own home directory, where SPE stores the account’s data, see Understanding SPE home directory article. It means that by default the accounts’ data is isolated from each other. Therefore,…

Download Speech Platform

…Standalone mode – the recommended setup, requiring some manual steps using command line Further information resources Speech Engine REST API documentation online: https://download.phonexia.com/docs/spe/ offline: {SPE_directory}/doc/api_reference.html or http://{SPE_address:port}/doc Speech Engine technical documentation check the Speech Engine section and the “Understand…” articles listed in the left menu tutorials and training videos see technologies introduction video below and SPE Training videos section https://youtu.be/DDu0Y1rgQ6k…

Releases and Changelogs (VIN)

…Target score distribution Fixed: Population Set selected correctly even if renamed in the selection window Improved: Speech length display in the case view: added “Unlimited” option to display the speech length permanently Improved: SID Evidence score aligned with Speech Engine output of SID score Removed: Speech length compensation Voice Inspector 5.1 Voice Inspector 5.1.0, BSAPI 3.60.0 (2023-12-07) New: A generalized…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…in our example is 36 seconds. After stripping silence, it gets 14 seconds – this means that original audio contains 38% of net speech and 62% of silence. Phonexia speech technologies analyze the entire recording, but pick only the speech segments for AI processing, i.e. the absolute processing time will be practically the same… Creating voiceprint by Speaker Identification took:…

Download Voice Inspector 5.2

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…

Understand SPE administration and backup

…the system: SPE database – the technology models, SPE user accounts, etc. are stored here SPE configuration file (usually /settings/phxspe.properties) technologies configuration file (usually /settings/ technologies.xml, or see phxspe.properties for details) licensing file (license.dat, usually stored along to phxspe.exe, or see phxspe.properties for details) Optimally, Partner should backup also the following entire SPE directory [optional], with all subdirectories (/bsapi/, etc.)…

STT: What is Preferred Phrases feature and how to use it

…it can help in other applications, too – e.g. when transcribing domain-specific audios, the frequently used domain-specific phrases can be boosted. How preferred phrases work The picture below shows a simplified standard speech transcription process – the digitized speech signal spectrum is analyzed in the neural network acoustic model (which describes the pronunciations of a given language) and goes into…

Understand SPE metafiles

Certain SPE entities – SID Speaker models, SID Audio source profiles, LID Language packs – can have additional information associated with them in the form of “metafiles”. This article explains the intended usage of metafiles. In general, SPE is intended as under-the-hood engine, focusing purely on the speech-related audio processing. Any additional functionality should be done on the application layer,…

Speech Engine

To create the SPE report: Go to the SPE installation directory Open command line/terminal (in Ubuntu Linux Right click + press E, in Windows type cmd in the address bar) Run ./phxadmin –report (Linux) or phxadmin.exe /report (Windows) Zip up the created directory with report and attach the ZIP file to your issue description The Report functionality is not present…

STT: Language Model Customization tutorial

…model in Speech Engine using phxadmin 1) Placing the customized STT model in correct location In order to be recognized by Speech Engine, the customized STT model must be placed in a correct location. The best location in SPE 3.41 or newer is <SPE_directory>/shared/bsapi/stt (see Understand SPE directory structure article). In older versions it’s <SPE_directory>/bsapi/stt. The data and settings directories

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

Phonexia Speaker Identification is a voice biometry tool for recognition of speakers by their voice. In this video, we will show you how to start using this technology! You will learn how to create a “Speaker Model” to identify a speaker in a set of data. Ready to test it? Start with our video: What else is needed? 1. Phonexia…

Adding new language or technology model (Browser)

This article explains how to add a new technology model into the current Speech Engine (SPE) instance when using Phonexia Browser. Prerequisites To proceed, you need to have existing installation of SPE. If you do not have one, check other articles, especially: Download Speech Platform Installation of Phonexia Browser Documentation of Phonexia Browser Installation package with new language models Note:…

Keyword Spotting (KWS)

…a numerical expression of probability that word was said in a specified time frame. Keywords Keywords are not dependent on any dictionary. This allows to define specific, foreign or even nonexistent words like product names. However, only allowed graphemes (symbols) from a supported list can be used to define keywords. This list can be easily obtained by Speech Engine and…

Understand SPE technologies, instances and workers

Configuring Speech Engine to utilize effectively the full power of underlying hardware can get challenging – one can easily get lost in all the strange terms like technologies, instances, slots, or workers… This article should shed some light in it. Speech Engine is like post office Thinking about Speech Engine, there is actually a very nice analogy with post office…