Skip to content Skip to main navigation Skip to footer

Search: file%20formats

94 results

Q: How can we test Phonexia technologies?

…functionality of all technologies. The license validity is 90 days to allow you to test the technologies. Note: by default a NET license is provided for testing. This license needs to have active Internet connection to a phonexia licensing server in order to function. Rest assured no data – audio, metafiles or even analytical files, are ever sent to phonexia.com….

Designing and Developing Application

Before designing and developing the application, we encourage Partner to find clear answer for the following questions: Customer requirements: Do my customers need file processing (audio) or stream processing in real time? What is the human power of the customer that can analyze the results? How many minutes per day or streams in parallel do my customer need to process?…

Q: What are the supported audio formats?

…configured do this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized…

Phonexia technologies introduction

…technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis Extraction (TAE) 14:22 Speech Platform architecture; Speech Engine, Phonexia Browser, Phonexia Voice Inspector brief 18:52 HW and SW requirements, typical deployment topologies 21:34 Supported file– and stream formats, typical implementations and data flows 27:29 Licensing technical options 32:24 Summary, recommended next steps   https://youtu.be/DDu0Y1rgQ6k…

Phonexia End User License Agreement

…particular order or agreement under the following terms: 1.1 The Client may use the Software only for the duration of a valid license file generated by Phonexia; 1.2 The Client shall run the Software on its own or outsourced hardware and agree to do so at its own risk; 1.3 The Client may use the Software for which Phonexia requires…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

STT: Results explained

…n-best output (since version 3.30) The n-best results are updated after each segment/sentence, i.e. they are only available in output when end-of-segment boundary (</segment> token) is encountered in the one-best output. Examples Examples of new generation and legacy file processing Speech To Text outputs: … { “channel_id” : 0, “score” : 0, “confidence” : 0, “start” : 0, “end” :…

Understand SPE database scripts

…updating SPE to newer version is: Check the SPE changelog and get DB version number used by your current SPE used by the SPE you are updating to Run every script starting with DB version from which you are updating, until the database version to which you are updating ⓘ Detailed update instructions are always described in UPDATE.txt file in…

Download Voice Inspector 5.2

…downloaded package (ZIP) to a location of your choice (e.g. ~/PhonexiaVIN/). Save the license.dat file to the root of your Voice Inspector directory (e.g. ~/PhonexiaVIN/) and plugin Phonexia USB token to USB port, when USB licensing is in use. Run ./VoiceInspector (Linux) / VoiceInspector.exe (Windows). Set up wizard will be launched automatically and help you with the first launch. You…

Documentation (SPE)

Partners and customers are encouraged to read Speech Engine (PhxSpe | PhxSpe.exe) software API reference and various manuals available as files in [SPE]/doc in standard software package and installation. You can also find REST API reference (Speech Engine) documentation online. You might be interested in reading the following information in manual: REST API reference Structure of API queries Asynchronous request…

Q: What are the requirements for SID evaluation dataset?

…in each recording (i.e. usually 2+ minutes recording length) only one speaker in each recording wide variety of gender and age is recommended recordings should be as similar to the target use case as possible (device, channel, distance from mic, languages distribution) audio files should be mono, lin16 format, 8 kHz+ sample rate *Note: splitting single recording into multiple shorter…

Speech Engine

To create the SPE report: Go to the SPE installation directory Open command line/terminal (in Ubuntu Linux Right click + press E, in Windows type cmd in the address bar) Run ./phxadmin –report (Linux) or phxadmin.exe /report (Windows) Zip up the created directory with report and attach the ZIP file to your issue description The Report functionality is not present…

Voice Inspector

…log: Go to the Voice Inspector installation directory Open command line/terminal (in Ubuntu Linux Right click + press E, in Windows type cmd in the address bar) Run ./VoiceInspector &>report.txt (Linux) or VoiceInspector.exe >report.txt (Windows) Do the sequence of steps leading to the issue you are experiencing Close Voice Inspector as usual attach the report.txt file to your issue description…