…database SQL scripts. data ├── phxspe.properties.default ├── init.d-phxspe.template ├── phxspe.service.template │ ├── benchmark └── database phxspe.properties.default Default phxspe.properties SPE configuration file init.d-phxspe.template Example SPE init.d script phxspe.service.template Example SPE systemd service unit file benchmark Default audio files for built-in benchmark functionality database Database SQL scripts for supported databases: SQLite, MariaDB and MySQL The phxspe.properties.default file is used by phxadmin tool…
Search: support
64 results
…are two differences against the XML example: the STT_STREAM technology is missing – Phonexia Browser does not support stream processing, i.e. does not allow configuration of stream technologies the config_file setting is also missing – Phonexia Browser does not support this special expert-level feature, i.e. does not store the setting { “technology_subsystem_settings”: { “technologies”: [ { “name”: “STT”, “models”: […
Phonexia Voice Inspector (VIN) is developed as a desktop application for forensic speaker comparison. Releases Version Release Date End of Support Maintained Until Release type 5.2 2024-04-15 2027-12-31 2027-12-31 Public 5.1 2023-12-07 2027-12-31 2027-12-31 Public 5.0 2023-06-29 2027-12-31 v.5.1 Public 4.0 2019-12-12 2023-12-31 2023-12-31 Public 3.2 2018-03-16 2020-12-16 2020-12-16 Public 3.1 2016-10-24 2018-04-24 v.3.2 Public 3.0 2016-08-05 2018-02-05 v.3.1 Public…
…TEXT (used for STT language model training) MSA is used in all formal writing such as official correspondence, literature, newspapers, webpages so there is no problem to accumulate loads of texts, but it will be more formal and far from spontaneous speech Support for MSA in Phonexia products Name LID L4 STT Description Arabic (MSA) arb — Modern Standard Arabic,…
…output) Note: The outputs can contain the following special tokens: sil silent part (or no speech detected) The list of phonemes is available in the document phonemes_for_stt_and_kws.pdf (delivered as part of manuals in SPE or STT or KWS). Languages Supported List of supported languages in Phoneme Recogniser is same as in Keyword Spotting. Link to API reference https://download.phonexia.com/docs/spe/#%2Ftechnologies%2Fphnrec…
…expected to provide information about actual TTS service capabilities: list of voice names, supported languages and audio quality (sampling frequencies). This info is used during SPE startup sequence – TTS connectors enabled in SPE configuration file are started with –info parameter and SPE reads the connector output. Connectors failing to provide the info won’t be available for use with SPE….
…1 CPU core (eg. standard 8 CPU core server (8 instances of STT) can process 1010 hours of audio in 1 day of computing time (flat load, depend on technology model)) Supported languages: List of supported languages. Acoustic models Acoustic model is created by training on training data. It includes characteristics of a voices of a set of speakers provided…
This part requires higher (and non-anonymous) access level.
How to solve this situation:
- Log in here if you are not logged in.
- Register here. It takes just a few clicks and it’s free.
A: The following options are supported: HTTP basic authorization – Client asks for session by resource “post /login” with HTTP basic authorization in query header. If server responds with error 405, server doesn’t support authorization by sessions and it is necessary to use basic authorization. Authorization by session – Authorization by session is done by adding parameter “X-SessionID“ into HTTP…
…coding), A-law or Mu-law, PCM, 8kHz+ sampling Voiceprints: AGE L4 model supports SID4 L4 voiceprints; legacy AGE models support voiceprints created by AGE itself Output Log file with processed information (age estimate) Processing speed Approx. 20x faster than real-time processing on 1 CPU core i.e. standard 8 CPU core server processes 3,840 hours of audio in 1 day of computing…
Version Release Date End of Support Maintained Until Release type 5.2 2024-04-15 2027-12-31 2027-12-31 Public 5.1 2023-12-07 2027-12-31 2027-12-31 Public 5.0 2023-06-29 2027-12-31 v.5.1 Public 4.0 2019-12-12 2023-12-31 2023-12-31 Public 3.2 2018-03-16 2020-12-16 2020-12-16 Public 3.1 2016-10-24 2018-04-24 v.3.2 Public 3.0 2016-08-05 2018-02-05 v.3.1 Public 1.3 2015-06-04 2016-12-04 2016-12-04 Public…
…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…
…kept in the database at all. Supported databases SPE supports SQLite and MariaDB 10.x (SPE 3.46+) MySQL 5.x (SPE up to 3.45) database engine. The database engine is configured in phxspe.properties SPE configuration file – see the Database section of SPE configuration file article for more details. SQLite SQLite is the out-of-the-box SPE default database type. By its nature, SQLite…
Phonexia Voice Inspector software offers several features that strongly support the work of voice forensic experts: A standalone application with a complete easy-to-use Graphical User Interface (GUI) Automatic comparison of questioned recording (unknown speaker recording or voiceprint) against a suspected reference speaker (group of recordings or voiceprints) with a known speaker i.e. 1:1 identification and 1:N identification. Implemented speech technologies:…
…audio manipulation SPE has built-in basic audio files manipulation functionality, like separating individual channels from stereo recordings, cut one audio to several files, save audio from incoming stream to file and others. Stream audio player To support voicebot scenarios, SPE has the ability to play audiofiles directly to output RTP stream External Text-to-speech (TTS) integration Easy integration with external TTS…