Search Results for: SPE settings

Results 1 - 10 of 25 Page 1 of 3
Results per-page: 10 | 20 | 50 | 100

How to configure Speech Engine workers

Relevance: 100%      Posted on: 2020-03-28

Worker is a working thread performing the actual files- or realtime streams processing in Speech Engine. This article helps to understand the Speech Engine workers and provides information how to configure workers for optimal performance and server utilization. The default workers configuration in settings/phxspe.properties is as shown below – 8 workers for files processing and 8 workers for realtime streams processing. These numbers mean the maximum number of simultaneously running tasks. # Multithread settings server.n_workers = 8 server.n_realtime_workers = 8 Requests for additional file processing tasks are put in a queue and processed according their order and priorities. Requests for…

Phonexia Speech Engine

Relevance: 100%      Posted on: 2017-05-18

About Phonexia Speech Engine v3 (SPE3) is a main executive part of the Phonexia Speech Platform. It is a server application with REST API interface through which you can access all available speech technologies. Both, Linux 64bit and Windows 64bit operating systems are supported. Phonexia Speech Engine (SPE3) is adjustable server component which houses all speech technologies. SPE3 provides RESTfull application programming interface to access various technologies. Aside from technologies themselves the SPE has implemented other various functionality supporting work with speech technologies, recordings and streams, and others. Features Main purpose of SPE is to work as processing unit for…

VIN – Releases and Changelogs

Relevance: 100%      Posted on: 2018-04-08

Phonexia Voice Inspector (VIN) is developed as a desktop application for forensic speaker comparison. This page lists changes in VIN releases. Releases Changelogs Voice Inspector v4.0.0, BSAPI 3.23.0 - Dec 11 2019 - VIN is available with L4 technology model - Other technology models (S2, L2, L3, XL3) are no longer supported - Added Diarization Technology (available in waveform editor) - Population Sets structure changed - Reworked dialog for population set management - Added possibility to set type of estimation of the Target distribution - Using population set to estimate Target distribution allows 1:1 comparison - Bug fixes Voice Inspector…

Performance of the Speaker Identification 4th generation (SID4): Intel® Xeon® Platinum 8124M

Relevance: 100%      Posted on: 2019-10-30

Benchmark goals Find realistic performance using total recording length Find FTRT based exactly on net_speech (engineering sizing data) Find system performance using all physical cores Find system performance using all logical cores Infrastructure setup Intel® Xeon® Platinum 8124M is used in virtual machine with 8 physical cores reserved exclusively for this VM, Hyper Threading is enabled [16 logical cores available], 32GB RAM, 30GB SSD based storage, 1000 I/O.s-1  reserved per core Benchmark data setup Data set statistic: Number of files: 32 [300 seconds each] RAW recordings length ∑: 9600 [sec] Net speech length ∑: 4224.77 [sec] In the data set…

Speech Intelligence Resolver v1

Relevance: 100%      Posted on: 2017-05-18

About Phonexia Speech Intelligence Resolver v1 (SIR1) combines the power of speech technologies within a single application. The application automatically performs visualization of the record as well as filtering the speech metadata uncovered from your records effectively. Speech technologies implemented: Phonexia Speaker Identification (SID2) Phonexia Language Identification (LID2) Phonexia Gender identification (GID) Phonexia Voice Activity Detection (VAD) Phonexia Speaker Diarization (DIAR) Phonexia Keyword Spotting (KWS) Phonexia Speech Quality Estimator (SQE) Phonexia Speech Transcription (STT) SIR is a client application cooperating with REST servers. It can be used as a standalone application due to the integrated local REST server. It was…

SPE3 – Administration and Backup

Relevance: 100%      Posted on: 2018-04-15

Each Partner has its own administration and back up policy. Here, we highlight the most important SPE3 components to be administrated and backed up. Administration It is strongly recommended to describe your own administration approach with the following components SPE users (accounts) - Partner should maintain list of SPE users (accounts). There should be only few persons with “admin” role. All other should be with “user” role (do not see content of other “user”) and/or “vbs” role (dis/enables using of VoiceBiometry plugin) the SPE database and/or VBSplugin database administration – where the (temporary) results are stored user.home - where the…

How to configure STT realtime stream word detection parameters

Relevance: 100%      Posted on: 2020-03-28

One of the improvements implemented since Speech Engine 3.24 is neural-network based VAD, used for word- and segment detection. This article describes the segmenter configuration parameters and how they are affecting the realtime stream STT results. The default segmenter parametrs are as shown below: [vad.online_segmenter:SOnlineVoiceActivitySegmenterI] backward_extensions_length_ms=150 forward_extensions_length_ms=750 speech_threshold=0.5 Backward- and forward extension are intervals in miliseconds, which extend the part of the signal going to the decoder. Decoder is a component, which determines what a particular part of the signal contains (speech, silence, etc.). Based on that, decoder also decides whether segment has finished or not. Unlike in file processing…

Phonexia Voice Inspector v1

Relevance: 100%      Posted on: 2017-05-18

About Phonexia Voice Inspector v1 (VIN1) provides police forces and forensic experts with highly accurate speaker identification tools to be used during the investigation of criminal matters. It utilizes the power of voice biometry to automatically recognize the speaker by their voice. Main features of the VIN1 application: An automatic speaker identification tool to strengthen the results of the standard phonetic based approaches Scoring of the likelihood ratio (LR), log-likelihood ratio (LLR), and an option of a verbal presentation of the results Graphic presentation of the likelihood ratio (LR), probability density function and Tippett plot Generating detailed reports (expert opinion…

SPE3 – Releases and Changelogs

Relevance: 100%      Posted on: 2020-07-02

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). This page lists changes in SPE releases. Releases Changelogs Speech Engine 3.31.1 (07/02/2020) - DB v1500, BSAPI 3.31.0 Non-public Feature Preview release Fixed: SQLite database update from version v1401 fails Speech Engine 3.31.0 (07/01/2020) - DB v1500, BSAPI 3.31.0 Non-public Feature Preview release New: SPE now requires CentOS 7 or other Linux based OS with glibc >= 2.17 New: Added instructions for updating SPE (see doc/UPDATE.txt file) New: Added new LID…

What is a user configuration file and how to use it

Relevance: 100%      Posted on: 2020-03-28

Advanced users with appropriate knowledge (gained e.g. by taking the Phonexia Academy Advanced Training) may want to finetune behavior of the technologies to adapt to the nature of their audio data. Modifying original BSAPI configuration files directly can be dangerous – inappropriate changes may cause unpredicatble behavior and without having a backup of the unmodified file it's difficult to restore working state. User configuration files provide a way to override processing parameters without modifying original BSAPI configuration files. WARNING: Inappropriate configuration changes may cause serious issues! Make sure you really know what you are doing. User configuration file is a…