Skip to content Skip to main navigation Skip to footer

Search: server

63 results

Q: What are the supported audio formats?

Formats supported directly and natively are: WAVE (*.wav) container including any of: unsigned 8-bit PCM (u8) unsigned 16-bit PCM (u16le) IEEE float 32-bit (f32le) A-law (alaw) µ-law (mulaw) ADPCM FLAC codec inside FLAC (*.flac) container OPUS codec inside OGG (*.opus) container Other audio formats must be converted to one of those natively supported using external tools. SPE server can be…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Understand SPE administration and backup

…All other should be with “user” role (one user does not see content of other user). See Understand SPE user accounts for details. user.home – where the server stores the users data, see Understand SPE home directory for details LOG files – log file rotation is configured in phxspe.properties, see Understand SPE configuration file for details SPE database administration –…

Quick Start Guide (VIN)

Follow these steps to use Phonexia Voice Inspector: Download its latest version (provided as a ZIP archive here) Unpack the archive Download a license obtained from Phonexia. Please, don´t change name of the license (always in license.dat format). Note that a NET type of license needs an Internet connection to connect to our phonexia.com license server. The USB dongle type…

Speaker change on enrollments

Speaker change on enrollments is a feature based on Phonexia’s own diarization technology. It allows customers to be notified about the speaker changes during enrollment, which helps preventing a situations, where the enrolment is bad or compromised. Please note that this feature is currently available to single-server deployments only. To enable it, call endpoint /api/v2/maintenance/technologies in maintenance mode with the…

Orbis Hardware Requirements

…or newer recommended: Linux + recent stable KVM hypervisor Note: For testing purposes, you may have success running Orbis with lower specs. However, keep in mind, you may experience lower performance – or even crashes. In short: if running on lower than the recommended HW specs, no guarantee is provided by Phonexia. Client machine Orbis is a server application. Client…

Understand SPE user accounts

…prioritization section in the REST API documentation maximum pending requests – legacy REST Server 2.x attribute, ignored in SPE 3.x It’s important to realize that each SPE user account has its own home directory, where SPE stores the account’s data, see Understanding SPE home directory article. It means that by default the accounts’ data is isolated from each other. Therefore,…

Orbis FAQ


This part requires higher (and non-anonymous) access level.
How to solve this situation:

  1. Log in here if you are not logged in.
  2. Register here. It takes just a few clicks and it’s free.

Understand SPE metafiles

…can store whichever type of data would help your application. The files are physically stored in the SPE user’s “home”, in data subdirectory (see Understanding SPE home directory article for details). Maximum size of single metafile can be set using server.max_metadata_size setting in SPE cofiguration file. Example As an example, the picture below shows how Phonexia Browser uses SPE metafiles…

Understand SPE connectors for external TTS

…should log a successful TTS connector initialization: TTSSubsystem: Retrieving external connector info from ……./external/technologies/tts/acapela TTSSubsystem: External connector ‘acapela’ from ……./external/technologies/tts/acapela has been registered. If an error occurs, SPE logs the problem: TTSSubsystem: Retrieving external connector info from ……./external/technologies/tts/acapela TTSSubsystem: Cannot retrieve external connector info! ERROR: Loading configuration from “……./external/technologies/tts/acapela/connector.properties”;Error: acapela server is not running or address and ports are misconfigured;…

Q: How can we test Phonexia technologies?

We can prepare a testing package for you with full functionality of all technologies. The license validity is 90 days to allow you to test the technologies. Note: by default a NET license is provided for testing. This license needs to have active Internet connection to a phonexia licensing server in order to function. Rest assured no data – audio,…

Age Estimation (AGE)

…coding), A-law or Mu-law, PCM, 8kHz+ sampling Voiceprints: AGE L4 model supports SID4 L4 voiceprints; legacy AGE models support voiceprints created by AGE itself Output Log file with processed information (age estimate) Processing speed Approx. 20x faster than real-time processing on 1 CPU core i.e. standard 8 CPU core server processes 3,840 hours of audio in 1 day of computing…

Q: My NET license has stopped working, returning “Not enough free licenses” error.

…more instances than allowed by the license file (using -j parameter in command line). In rare cases your SW copy may have accidentally crashed. If this is the case, please wait for automatic license renewal period (60 minutes after last check). Check if your connection to the license server hasn’t changed. Check if validity of the license is not expired….

Speaker Identification (SID)

…technological model and can range from 5 to 50 times faster than real time on 1 server CPU core. Voiceprint extraction is the most time-consuming part of the process. Voiceprint comparison, on the other hand, is extremely fast – a millions of voiceprint comparisons can be done in 1 second. Voiceprint extraction (Speaker enrollment) Speaker enrollment starts with the extraction…

Input audio quality

…TIP: Tools like MediaInfo can easily give you technical information about your audio files. ? DO’S ? DON’TS Set your PBX, media server or recording device to one of these formats (in the order of preferrence): uncompressed WAV (16-bit, 8 kHz or more) A-law or μ-law (8-bit, 8 kHz) in WAV lossless formats like FLAC OPUS format (lossy, but developed…