Skip to content Skip to main navigation Skip to footer

Search: model L

64 results

Language Identification – Languages

Recognized languages Languages pre-trained in the default language pack are listed in the table below, each LID generation is a separate column (in the 4th generation we switched to using language tags instead of names): L4 L3, XL3 S2, L2 (deprecated sq-AL Albanian Albanian Albanian am-ET Amharic Amharic Amharic ar-EG Arabic (Egypt) Arabic   ar-KW Arabic (Gulf, Kuwait) Arabic_Gulf  …

Installation of Phonexia Browser

Some packages are distributed with only a limited set of speech technologies and languages or without speech technologies. First installation Our software is distributed as a ZIP file. Installation procedure is as simple as: unzip the archive paste additional KWS, STT… models paste the license.dat file to the root directory where you have BROWSER folder and run_browser(.exe) script run the…

Get better support

…information. It will help both of our parties to provide fastest and most efficient technical support to your customer: Issue data – required: LOG files or Console output from failed speech technology (for the command line) – usually in ./log/ directory) configuration files (technologies.xml from SPE is minimum) – usually in ./settings/ directory licensing file (license.dat) – usually along the…

Download Voice Inspector 5.1

…downloaded package (ZIP) to a location of your choice (e.g. ~/PhonexiaVIN/). Save the license.dat file to the root of your Voice Inspector directory (e.g. ~/PhonexiaVIN/) and plugin Phonexia USB token to USB port, when USB licensing is in use. Run ./VoiceInspector (Linux) / VoiceInspector.exe (Windows). Set up wizard will be launched automatically and help you with the first launch. You…

Understand SPE administration and backup

…the system: SPE database – the technology models, SPE user accounts, etc. are stored here SPE configuration file (usually /settings/phxspe.properties) technologies configuration file (usually /settings/ technologies.xml, or see phxspe.properties for details) licensing file (license.dat, usually stored along to phxspe.exe, or see phxspe.properties for details) Optimally, Partner should backup also the following entire SPE directory [optional], with all subdirectories (/bsapi/, etc.)…

Phonexia Speech Engine

…providers via simple plugin-like connectors interface Flexible integration SPE can provide results in JSON or XML format. Result can be obtained by polling, via websockets, or via webhooks (callbacks). Status information SPE can provide various status information to the application layer, e.g. license status, configuration info, current overall load, pending operations status, … Quick start The following tutorial describes the…

SID: Speaker Identification: Results Enhancement

…(if applicable) parameters for all calibration types (shifts, mean vectors etc.) optional user comments Creating Audio Source Profile An Audio Source Profile can be created either via the aspcreate4 command-line tool or via Speech Engine using the /technologies/speakerid4/audiosourceprofiles/{name} endpoint. Profiles created from the same data are identical, regardless of the interface used to create them. On the creation, content of…

Phonexia End User License Agreement

…running on servers providing services for Phonexia Partner’s customers. 2.4 Production license which can be used for commercial exploitation. Within the delivery, Client will receive Production license(s) corresponding with mutual agreed payment model for particular type of Software. Unless otherwise agreed, the standard Production license validity is set for twenty (20) years from the date the license commences. 2.5 Special…

Q: I can’t manage to run Phonexia Browser software. I always get an error.

I always get the same error messages: unable to connect to the SPE unable to start the localhost: giving up and kill the localhost. A: This error may happen if the initialization of SPE engine takes too long. Phonexia Browser software treats it as initialization failure and kills the server. You can fix this by doing the following: Increase timeout…

SID4 performance on Intel® Xeon® Platinum 8124M

Benchmark goals Find realistic performance using total recording length Find FTRT based exactly on net_speech (engineering sizing data) Find system performance using all physical cores Find system performance using all logical cores Infrastructure setup Intel® Xeon® Platinum 8124M is used in virtual machine with 8 physical cores reserved exclusively for this VM, Hyper Threading is enabled [16 logical cores available],…

Understand SPE home directory

…Data The data directory holds additional data files for entities created by that user – e.g. SID Speaker Models, or LID language packs. If no such entities exist for that user, this directory is empty. Unlike the storage, content of this directory is intended to be manipulated by SPE only and should not be manipulated directly on the filesystem level….

Diarization tool for Orbis

…multichannel WAV audio, where each speaker speaks only in their own channel. Tool will automatically convert audio files to WAV format. For example, this recording: audio.wav channel_1 […111..222..11….22222..] Will be converted to audio.wav channel_1 […111…….11………..] channel_2 [……..222……..22222..] IMPORTANT: Tool doesn’t process any metadata. Resulting files should be uploaded into Orbis without metadata file! Tool uses Phonexia Diarization technology model XL4….

Sizing of the computing units for speech technologies

Best practices for good sizing of Phonexia technologies depend on a few facts: Intense work with large data sets requires good performance and bandwidth between RAM and CPU. It all depends on the size of the files with technological models data, usually loaded into RAM and used intensively for computing operations Always think only about physical cores of CPU (HT,…