Search Results for: engine channels

Results 1 - 50 of 57Page 1 of 2
Results per-page: 10 | 20 | 50 | 100

SPE3 – Releases and Changelogs

Relevance: 100%      Posted on: 2021-04-16

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). Releases Changelogs Speech Engine 3.40.1, DB v1700, BSAPI 3.40.1 (2021-04-16) Public release Fixed: 6th generation STT/KWS stream result may start with words from end of previous stream Fixed: Some licensing error messages are not shown in log Fixed: Missing file names in log messages in SID and SID4 tasks Fixed: Keyword list may not work if XML is used as input and optional fields threshold or pronunciations are used Fixed: phxdamin2…

Speech Engine and technologies, instances, workers… explained

Relevance: 36%      Posted on: 2020-11-19

Configuring Speech Engine to utilize effectively the full power of underlying hardware can get challenging – one can easily get lost in all the strange terms like technologies, instances, slots, or workers... This article should shed some light in it. Speech Engine is like post office Thinking about Speech Engine, there is actually a very nice analogy with post office (or bank branch): Post office is a place providing different kinds of services – one can go there to send letters, send or pick up packages, get a POBox, get some financial services, insurance, etc.).   Speech Engine has various…

Speech Engine configuration file explained

Relevance: 35%      Posted on: 2021-02-19

In this article we explain details of the Speech Engine configuration file phxspe.properties, located in settings subdirectory in SPE installation location. Settings in this configuration file affect the Speech Engine behavior and performance. The configuration file is usually created after SPE installation – on first use of phxadmin, a default configuration filephxspe.properties is created in the settings directory. The file is loaded during SPE startup, i.e. you need to restart SPE to apply any changes made in the file. If Speech Engine is used together with Phonexia Browser in so-called "embedded" mode (see details about "embedded SPE" mode in Browser…

Phonexia Speech Engine

Relevance: 24%      Posted on: 2020-11-19

About Phonexia Speech Engine v3 (SPE3) is a main executive part of the Phonexia Speech Platform. It is a server application with REST API interface through which you can access all available speech technologies. Both, Linux 64bit and Windows 64bit operating systems are supported. Phonexia Speech Engine (SPE3) is adjustable server component which houses all speech technologies. SPE3 provides RESTfull application programming interface to access various technologies. Aside from technologies themselves the SPE has implemented other various functionality supporting work with speech technologies, recordings and streams, and others. Features Main purpose of SPE is to work as processing unit for…

Speech Engine 3.35.0

Relevance: 23%      Posted on: 2020-10-01

Speech Engine 3.35.0, DB v1600, BSAPI 3.35.0 (2020-10-01) New LID model L4 was promoted to production (LID BETA_L4 renamed to LID L4) Added new language tag documentation (doc/Technology_LID_L4_Language_tags.pdf) Updated STT model CS_CZ_5 to version 5.2.1 (fixes faulty transcription of numbers into Roman format) Added configurable STT Confusion Network threshold (in technology configuration file) Fixed STT didn't work with 4th and older generation models after introduction of the Preferred phrases feature in SPE 3.32 Update from SPE 3.30 causes errors in STT result cache memory leak in logging system Typo in name of es-XA language in LID model L4 default language…

How to configure Speech Engine workers

Relevance: 22%      Posted on: 2020-03-28

Worker is a working thread performing the actual files- or realtime streams processing in Speech Engine. This article helps to understand the Speech Engine workers and provides information how to configure workers for optimal performance and server utilization. The default workers configuration in settings/phxspe.properties is as shown below – 8 workers for files processing and 8 workers for realtime streams processing. These numbers mean the maximum number of simultaneously running tasks. # Multithread settings server.n_workers = 8 server.n_realtime_workers = 8 Requests for additional file processing tasks are put in a queue and processed according their order and priorities. Requests for…

Speech Engine 3.35.1

Relevance: 21%      Posted on: 2020-10-13

Speech Engine 3.35.1, DB v1600, BSAPI 3.35.1 (2020-10-13) Fixed Missing input stream task name in log messages Missing arguments in "word not found" error messages (when using preferred phrases)

Speech Engine 3.35.2

Relevance: 21%      Posted on: 2020-10-22

Speech Engine 3.35.2, DB v1600, BSAPI 3.35.2 (2020-10-22) Fixed Detection of certain USB license tokens

Speech Engine 3.35.3

Relevance: 21%      Posted on: 2020-11-24

Speech Engine 3.35.3, DB v1601, BSAPI 3.35.3 (2020-11-24) New Internal support for SAMPA phonetic alphabet Updated STT model RU_RU_A to version 4.5.0 of (updated language model) Updated STT/KWS/PHNREC model AR_XL to version 5.2.0 (updated language model, changed phonemes notation to X-SAMPA) Fixed Cannot create new output stream due to hanging unfinished tasks Task is not removed from pool when result is delivered via Webhook Some log messages contain format placeholder instead of numbers Missing <silence/> label in STT confusion network output STT confusion network contains <silence/> tags with confidence greater than 1.0 Diarization crashes during processing Diarization XL4 crashes on…

Speech Engine 3.35.4

Relevance: 21%      Posted on: 2020-12-14

Speech Engine 3.35.4, DB v1601, BSAPI 3.35.4 (2020-12-14) Fixed STT/KWS model AR_XL_5 has incorrect name and does not start Missing KWS model AR_XL_5 Processing of some short recordings causes TwoGmmCalibThreshold is not finite error STT preferred phrases "out of vocabulary" (OOV) warning message is now more verbose

Phonexia Speech Engine EoL

Relevance: 20%      Posted on: 2018-06-19

Information about release dates, support and maintenance periods of Phonexia Speech Engine (software End of Life - EoL).

LID adaptation

Relevance: 15%      Posted on: 2021-03-02

This article describes various ways of Language Identification adaptation. Basic terminology Languageprint (*.lp file) – numeric representation of the audio, extracted from audio file for language identification purpose of (similar to “voiceprint”, but representing the spoken language, not the speaking person) Languageprint archive (*.lpa file) – multiple languageprints combined into single archive Creation of languageprint archives is not supported by SPE, these are supported as input only.   Language model – digital characteristics of a specific language Language model can be trained from languageprints (*.lp), language prints archives (*.lpa), or from combination of both. LID language model should not be…

STT Language Model Customization tutorial

Relevance: 9%      Posted on: 2019-04-24

Language Model Customization tool (LMC) provides a way to improve the Speech To Text performance by creating customized language model. Language model is an important part of Phonexia Speech To Text. In a simplified way it can be imagined as a large dictionary with multiple statistics. The Speech To Text technology uses this dictionary and statistical model to convert audio signals into the proper text equivalents. Due to general diversity of spoken speech, the default generic language model may not acknowledge the importance of certain words over other words in certain situations. Language model customization is a way to inform…

Time Analysis (TAE)

Relevance: 7%      Posted on: 2017-05-18

Technology description Technology Time Analysis Extraction by Phonexia extracts base information from dialogue in a recording, providing essential knowledge about conversation flow. That makes it easy to identify long reaction time, crosstalk, or responses of speakers in both channels.  This technology is only meaningful when used on recordings with 2 channels. As an answer to the TAE technology, SPE returns a json/xml file. This file includes general information about the technology and details of the time analysis. The technology can work either with a closed recording or with a stream. Monologue Describes the statistics of a recording related to one…

Time Analysis

Relevance: 7%      Posted on: 2018-04-15

Time Analysis Extraction (TAE) by Phonexia extracts base information from dialogue in a recording, providing essential knowledge about conversation flow. That makes it easy to identify long reaction time, crosstalk, or responses of speakers in both channels. This technology is only meaningful when used on recordings with 2 channels. As an answer to the TAE technology, SPE returns a json/xml file. This file includes general information about the technology and details of the time analysis. The technology can work either with a closed recording or with a stream. Monologue Describes the statistics of a recording related to one channel. channel…

Difference between on-the-fly and off-line type of transcription (STT)

Relevance: 6%      Posted on: 2017-12-11

Similarly as human, the ASR (STT) engine is doing the adaptation to an acoustic channel, environment and speaker. Also the ASR (STT) engine is learning more information about the content during time, that is used to improve recognition. The dictate engine, also known as on-the-fly transciption, does not look to the future and has information about just a few seconds of speech at the beginning of recordings. As the output is requested immediately during processing of the audio, recording engine can't predict what will come in next seconds of the speech. When access to the whole recording is granted during off-line transcription…

Phonexia Speech Platform

Relevance: 6%      Posted on: 2017-05-18

  Phonexia Speech Platform (Speech Platform) provides partners a complete portfolio of speech technologies with an easy-to-use design. The platform allows users to design and deploy a wide range of speech processing systems in a short time and without extensive knowledge of the technologies background. Products On top of Speech Platform, several products provided: for commercial market Phonexia Speech Analytics Phonexia Voice Biometrics for government market Phonexia Speech Analytics GOV Phonexia Voice Biometrics GOV Characteristics Completeness – all speech technologies in one place Simple to use – RESTfull API for rapid development Modularity – build your own specific process workflow…

Speech Quality Estimator – Essential

Relevance: 4%      Posted on: 2018-04-04

Phonexia’s Speech Quality Estimator quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two) channels. The statistics of all channels include the numbers for many aspects of recording quality, and the overall global score. Technology The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies…

SPE configuration

Relevance: 3%      Posted on: 2018-02-02

Basic explanation of configuration directives for SPE with hints & tips. Overview of phxspe.properties for beginners.

Keyword Spotting

Relevance: 3%      Posted on: 2019-06-03

Phonexia Keyword Spotting (KWS) identifies occurrences of keywords and/or keyphrases in audio recordings. It can help you to get valuable information from huge quantities of speech recordings. You only need to specify the keywords or phrases you wish to find. This technology identifies all recordings with keyword occurrences and allows you to automatically route important recordings or calls to your experts. Typical use cases Call centers increase operator and supervisor efficiency by searching calls identify inappropriate expressions from operators check marketing campaigns with automatic script-compliance control Mass media and web search servers index and search multimedia by keyword route multimedia…

Phonexia Workflow

Relevance: 3%      Posted on: 2019-08-06

Phonexia Workflow is a set of tools complementing Phonexia Speech Engine (SPE), which allow users to chain speech technologies into scenarios and process audio recordings automatically using these scenarios. Scenarios are programmed using uniform API which provides an abstraction over Phonexia Speech Engine application. Provided Phonexia Workflow scenarios: SalEssentials - Speech Analytics Essentials filters out low quality audio files, provides demographic information, age estimation and speech to text processing VbsEssentials - Voice Biometrics Essentials filters out low quality audio files, provides gender identification, age estimation and speaker identification  The scenario is a tiny Java application which interacts with Phonexia technologies…

Save Your Time

Relevance: 3%      Posted on: 2017-06-22

If you start, the following posts might be interesting for you:   Phonexia Speech Platform is defined as an umbrella concept for all our products and services related to speech technologies. Main packages are Voice Biometrics and Speech Analytics.   Phonexia Browser PhxBrowser - application for quick tests and visualization of speech technologies results.   Speech Engine SPE3 - RESTfull API - it is adjustable server component which houses all speech technologies.   Other "good to start" pages: Academy is to help partners to understand the market, Phonexia’s products and technologies. Manuals Glossary

Components and Tools

Relevance: 3%      Posted on: 2017-05-18

This section collect information about specific components and tools of our Speech Platform.   API RESTfull API - Phonexia Speech Engine v3 (SPE3) - recommended   Apps and Tools Phonexia Browser v3 (Browser3) Voice Inspector v4 (VIN4) Voice Inspector v3 (VIN3)   You might be interested to see also Product Portfolio or End of Life Components & Tools. You might also browse our product support lifecycle policy to see which of our versions are supported and maintained.

Browser3 – Releases and Changelogs

Relevance: 2%      Posted on: 2021-04-06

Phonexia Browser v3 (Browser3) is developed as client on top of Phonexia Speech Engine v3. Phonexia Browser is a successor of Phonexia Speech Intelligence Resolver v1 (SIR1). This page lists changes in Browser releases. Releases Changelogs Phonexia Browser 3.40.0, BSAPI 3.40.0 (2021-03-26) Public release New: Compatibility with SPE 3.40 Changed: Using new licensing system under the hood (internal change) NOTE: When using Browser with FLS (Floating License Server), you need to upgrade FLS to version 2.x in order to be able to use Browser 3.40+ with FLS. Phonexia Browser v3.30.13, BSAPI 3.30.14 (2021-03-25) Public release Fixed: One more issue in…

Product Portfolio

Relevance: 2%      Posted on: 2018-04-02

Phonexia Speech Platform is an umbrella concept for all Phonexia’s products and services related to speech technologies. It gives us the ability to customize various products to a wide range of customer needs. Platform Edition is an encapsulation of specific setup of speech technologies, modules, applications, utilities and services designed for a specific market segment. We distinguish Speech Analytics (SAL) and Voice Biometrics (VBS) as most common domain of usage. It is also a tool for marketing and sales. Voice Biometrics is focused more on identifying speaker, gender, language spoken and more. Speech Analytics focuses on gathering information about content…

Speech Analytics Course (technical training)

Relevance: 2%      Posted on: 2017-05-18

The Speech Analytics course consists of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part of the course is obligatory)   SAL course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus – Explain basic needs Discussion of partner project focused mainly on finalizing the training topics and agenda. YES 0,75 Application Design & Development – Licensing Presentation of types of licensing, and how to use the license file. YES 0,75 Technologies – Data gathering and Quality measurement – basic Description of…

Q: I can’t manage to run Phonexia Browser software. I always get an error.

Relevance: 2%      Posted on: 2017-06-27

I always get the same error messages: unable to connect to the SPE unable to start the localhost: giving up and kill the localhost. A: It might be because the initialization of SPE engine is too long. Phonexia Browser software treats it as initialization failure and kills the server. You can proceed as follows: Increase timeout in Settings > Speech Engine tab > First connection timeout Use fewer instances of technologies Use smaller models of technologies

Open Source Acknowledgement

Relevance: 2%      Posted on: 2018-04-06

This page collect information about Open Source code and licenses. You might be interested to ask your Phonexia contact what part of the page is relevant to your project. Phonexia Voice Verify dependencies Name  Version  License  Django  2.1.11  BSD Jinja2  2.11.2  BSD-3-Clause  MarkupSafe  1.1.1  BSD-3-Clause  Pygments  2.6.1  BSD License beautifulsoup4  4.9.1  MIT  behave  1.2.6  BSD behave-django  1.4.0  MIT  certifi  2020.6.20  MPL-2.0  chardet  3.0.4  LGPL  coreapi  2.3.3  BSD coreschema  0.0.4  BSD  defusedxml  0.6.0  PSFL  django-allauth  0.39.1  MIT  django-constance  2.7.0  BSD  django-cors-headers  3.4.0  MIT License  django-environ  0.4.5  MIT  django-extra-fields  2.0.5  Apache-2.0  django-picklefield  3.0.1  MIT  django-rest-auth  0.9.3  MIT  djangorestframework  3.9.1  BSD  docker  4.2.2 …

Speaker Diarization

Relevance: 2%      Posted on: 2018-04-02

Speaker Diarization labels segments of the same voice(s) in one mono channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers, but of technical signals and silence as well. The outputs of the technology can be both log file with labels and/or split audio files/one new multichannel audio file. The correct speaker diarization is still research task nowadays. Typical use cases: Preprocessing for other speech recognition technologies, labeling the parts of the utterance according to the speakers, splitting telephone conversation recorded in mono into several…

Speech Intelligence Resolver v1

Relevance: 2%      Posted on: 2017-05-18

About Phonexia Speech Intelligence Resolver v1 (SIR1) combines the power of speech technologies within a single application. The application automatically performs visualization of the record as well as filtering the speech metadata uncovered from your records effectively. Speech technologies implemented: Phonexia Speaker Identification (SID2) Phonexia Language Identification (LID2) Phonexia Gender identification (GID) Phonexia Voice Activity Detection (VAD) Phonexia Speaker Diarization (DIAR) Phonexia Keyword Spotting (KWS) Phonexia Speech Quality Estimator (SQE) Phonexia Speech Transcription (STT) SIR is a client application cooperating with REST servers. It can be used as a standalone application due to the integrated local REST server. It was…

Speech Analytics

Relevance: 1%      Posted on: 2018-04-06

Overview Phonexia Speech Analytics allows you to understand the  content of audio without having to listen to it. The results help both commercial entities and security/defense forces for immediate precise decision and response. The technologies reveal automatically WHAT content, TOPIC and KEY PHRASES are spoken, and many other metadata.   Speech Analytics - Typical Use-Cases Speech transcription is used in various applications. Knowledge of content of whole call is bringing business value to the customer, comparing to listening to the audio files by analytic or supervisor. Reading the text is also faster than listening to the audio. Speech Analytics output…

Support

Relevance: 1%      Posted on: 2017-05-18

Technical support is available Monday - Friday, 9:00 - 17:00 CET. We will reply within 4 working hours. When reporting issue with Speech Engine, please attach an SPE report, which may help the support staff to solve your issue faster. To create the report (available in SPE 3.10 and newer): Go to the SPE installation directory Run ./phxadmin --report (Linux) or phxadmin.exe /report (Windows) Zip up the created directory with report and attach the ZIP file to your issue description When reporting issue with SPE older than 3.10, or with different supported Phonexia product, please see the Get better support…

Speaker Identification: Results Enhancement

Relevance: 1%      Posted on: 2019-05-29

Speaker Identification (SID) Results Enhancement is a process that adjusts the score threshold for detecting/rejecting speakers by removing the effect of speech length and audio quality. This is achieved by use of Audio Source Profiles, that represent as closely as possible the source of the speech recording (device, acoustic channel, distance from microphone, language, gender, etc.). Although the out-of-the-box system is robust in such factors, several result enhancement procedures can provide even better results and stronger evidence. Audio Source Profile An Audio Source Profile is a representation of the speech source, e.g., device, acoustic channel, distance from microphone, language, gender,…

Manuals

Relevance: 1%      Posted on: 2017-05-18

This section collects links or locations of manuals for specific Phonexia Speech Platform components. API Phonexia Speech Engine REST API - SPE - latest version manual online (api_reference.html for your version is located in doc subdirectory in SPE folder or distribution ZIP) Brno Speech Application Interface v3 - BSAPI3 – latest version manual online Applications and Tools Phonexia Browser - PhxBrowser_manual.pdf is located in the root folder of the Browser application or distribution ZIP Phonexia Voice Inspector - VIN-manual.pdf is located in the root folder of the Voice Inspector application or distribution ZIP End of Life Products & Tools Speech…

Phonexia Browser

Relevance: 1%      Posted on: 2017-05-18

About Phonexia Browser v3 (Browser v3) software that combines the power of speech technologies in a single desktop application. The application automatically  performs visualization of records as well as effective filtration of speech metadata uncovered from the user´s records. Speech technologies implemented: Speaker Identification (SID) Language Identification (LID) Gender identification (GID) Voice Activity Detection (VAD) Speaker Diarization (DIAR) Keyword Spotting (KWS, 10+ languages available) Speech Quality Estimator (SQE) Speech to Text (STT, 10+ languages available) Age Estimation (AGE) Browser v3 is a client application cooperating with Speech Engine v3 (SPE3). It is possible to use it as a client -…

Workflow – Releases and Changelogs

Relevance: 1%      Posted on: 2019-10-07

Phonexia Workflow is a set of tools complementing Phonexia Speech Engine (SPE), which allow users to chain speech technologies into scenarios and process audio recordings automatically using these scenarios. This page lists changes in Workflow releases. Changelogs == Phonexia Workflow v1 == Phonexia Workflow 1.4.1 (10/07/2019) - SPE 3.16 - 3.17 Support for IPv4 only (since SPE does not support IPv6) Configurable application webhook address in both Workflow Runner and Data Discovery Tool This address is auto-detected when no value is supplied - default In some cases like network specific configuration it might be necessary to configure it manually Rapid…

Site Map

Relevance: 1%      Posted on: 2017-06-23

Phonexia Speech Platform Phonexia Speech Platform for Enterprise Phonexia Speech Analytics (SAL) Phonexia Voice Biometrics (VBS) Phonexia Speech Platform for Government Phonexia Speech Analytics GOV (SAL.gov) Phonexia Voice Biometrics GOV (VBS.gov) Components and Tools Phonexia Speech Engine v3 Speech technologies available Phonexia Browser v3 Phonexia Voice Inspector v3 Speech Intelligence Resolver v1 End of Life Components & Tools Phonexia Voice Inspector v1 Knowledge Base Blog Case Studies Demos Frequently Asked Questions (FAQ) How To… Lifetime Support Policies Manuals Presale Whitepapers and Presentations Product Briefs Developer Corner Code Examples Hints for App Design Hints for App Development List of Resources Phonexia…

Frequently Asked Questions (FAQ)

Relevance: 1%      Posted on: 2017-05-18

You might browse the FAQ by topic or tags: FAQ - complete list of posts tagged: Speech Engine v3 (SPE3) related tagged: Voice Biometrics (VBS) related tagged: Speech Analytics (SAL) related   Please leave us a comment, if you find any incompleteness and need more details.  

TUTORIAL: Speaker Identification – How to Do a Basic Test

Relevance: 1%      Posted on: 2019-10-08

Phonexia Speaker Identification is a voice biometry tool for recognition of speakers by their voice. In this video, we will show you how to start using this technology! You will learn how to create a "Speaker Model" to identify a speaker in a set of data. Ready to test it? Start with our video: What else is needed? 1. Phonexia Evaluation Package Evaluation package (download page) is consisting of Phonexia Browser and Phonexia Speech Engine including all necessary technologies. 2. Data We prepared the dataset for your testing. Package contains data for speaker model creation and speaker spotting too. The…

Voice Inspector – Interpretation of results

Relevance: 1%      Posted on: 2019-06-24

Introduction Phonexia Voice Inspector (VIN) is a tool for forensic automatic speaker identification, compliant with the Methodological Guidelines for Best Practice in Forensic Semiautomatic and Automatic Speaker Recognition, published by the European Network of Forensic Science Institutes.  This post explains individual SID score types and ways to visualize the results in a speaker identification case implemented in Voice Inspector. Evidence In VIN, the term evidence has two meanings. In general, it refers to any SID score that the system calculates for any pair of recordings in the case. These scores are the output of the Phonexia SID technology which runs…

PHR

Relevance: 1%      Posted on: 2018-02-01

Phoneme recognizer – currently part of Keyword Spotting (Phonexia Keyword Spotting - acoustics based ASR, several tec...) technology in Phonexia Speech Engine  (REST Application Program Interface)

How to configure STT realtime stream word detection parameters

Relevance: 1%      Posted on: 2020-03-28

One of the improvements implemented since Speech Engine 3.24 is neural-network based VAD, used for word- and segment detection. This article describes the segmenter configuration parameters and how they are affecting the realtime stream STT results. The default segmenter parametrs are as shown below: [vad.online_segmenter:SOnlineVoiceActivitySegmenterI] backward_extensions_length_ms=150 forward_extensions_length_ms=750 speech_threshold=0.5 Backward- and forward extension are intervals in miliseconds, which extend the part of the signal going to the decoder. Decoder is a component, which determines what a particular part of the signal contains (speech, silence, etc.). Based on that, decoder also decides whether segment has finished or not. Unlike in file processing…

SPE

Relevance: 1%      Posted on: 2018-02-01

Phonexia Speech Engine (RESTfull API)

What is a user configuration file and how to use it

Relevance: 1%      Posted on: 2020-03-28

Advanced users with appropriate knowledge (gained e.g. by taking the Phonexia Academy Advanced Training) may want to finetune behavior of the technologies to adapt to the nature of their audio data. Modifying original BSAPI configuration files directly can be dangerous – inappropriate changes may cause unpredicatble behavior and without having a backup of the unmodified file it's difficult to restore working state. User configuration files provide a way to override processing parameters without modifying original BSAPI configuration files. WARNING: Inappropriate configuration changes may cause serious issues! Make sure you really know what you are doing. User configuration file is a…

SPE3 – Administration and Backup

Relevance: 1%      Posted on: 2018-04-15

Each Partner has its own administration and back up policy. Here, we highlight the most important SPE3 components to be administrated and backed up. Administration It is strongly recommended to describe your own administration approach with the following components SPE users (accounts) - Partner should maintain list of SPE users (accounts). There should be only few persons with “admin” role. All other should be with “user” role (do not see content of other “user”) and/or “vbs” role (dis/enables using of VoiceBiometry plugin) the SPE database and/or VBSplugin database administration – where the (temporary) results are stored user.home - where the…

How to convert STT confusion network results to one-best

Relevance: 1%      Posted on: 2020-04-06

Confusion Network output is the most detailed Speech Engine STT output as it provides multiple word alternatives for individual timeslots of processed speech signal. Therefore many applications want use it as the main source of speech transcription and perform eventual conversion to less verbose output formats internally. This article provides the recommended way to do the conversion. Time slots and word alternatives: The recommended algorithm for converting Confusion Network (CN) to One-best is as follows: loop through all CN timeslots from start to end in each timeslot, get the input alternative with highest score and if it's not <null/> or…

Designing and Developing Application

Relevance: 1%      Posted on: 2018-04-15

Before designing and developing the application, we encourage Partner to find clear answer for the following questions: Customer requirements: Do my customers need file processing (audio) or stream processing in real time? What is the human power of the customer that can analyze the results? How many minutes per day or streams in parallel do my customer need to process? What are real benefits for customer (finding the needle in haystack, approaching new information, processing only few data with highest possible accuracy)? How the solution match the current processes and infrastructure of the customer? How many false alarms are acceptable…

Phonexia Partner Program for Government Partners

Relevance: 1%      Posted on: 2020-08-25

Phonexia Partner Program for Government Partners This partnership program rewards partners in the government sector for selling and integrating the Phonexia’s speech recognition and voice biometrics product portfolio. Program Enrollment If you aspire to becoming a Phonexia partner, you can enroll into the Phonexia Partner Program and complete a three-month onboarding period. During this period, you will enjoy the same partnership benefits as our Silver partners. Your assigned Phonexia Account Manager will take you through all necessary legal documents, highlight every business aspect of our cooperation, and organize two calls with a pre-sales person to ensure that you understand the…

SPE3 – Quick Start Guide

Relevance: 1%      Posted on: 2018-04-16

Do you want to run the SPE3 for the first time? This post can help you. Distribution, installation and configuration SPE is distributed by Phonexia in .zip archives. These are downloaded from Phonexia package manager using link provided by Phonexia employee. Installation is done by simple unzipping the content of the downloaded .zip archive to SPE installation folder. Configuration of SPE is done at two places. First is executable file ./phxadmin or .\phxadmin.exe serving to set file to configuration and license files configure speech technologies configure user accounts set up of few various setting Running the ./phxadmin or .\phxadmin.exe command…