Skip to content Skip to main navigation Skip to footer

Search: audio requirements

15 results

LID: Terminology and adaptation

…20 hours of audio is required, see requirements below Enhancing existing language model by adding your own audio files to existing built-in language at least 5 hours of audio is required, see requirements below Creating custom language pack consisting of your chosen set of languages, both pre-trained or created from your audio files Audio recordings requirements Format: WAV, FLAC, RAW…

FAQs (PSP)

…Browser, FAQ Speech Platform Permalink Q: How to fix Error 1007: Unsupported audio format? Phonexia Browser application may return error “1007: Unsupported audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE…

FAQs (Browser)

audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is ffmpeg utility, powerful and well documented. Please find…

Key Features (PSP)

…in the Languages Available section. Speech To Text (STT) and Keyword Spotting (KWS) languages Language Identification (LID) languages Supported Audio input The Speech Engine server supports various audio formats as listed in API reference > Audio requirements. It also supports the RTP/HTTP stream processing as listed in API reference > RTP/HTTP streams. The Speech Engine allows the usage of some…

SID: Speaker Identification: Results Enhancement

…is robust in such factors, several result enhancement procedures can provide even better results and stronger evidence. Audio Source Profile An Audio Source Profile is a representation of the speech source, e.g., device, acoustic channel, distance from microphone, language, gender, etc. Technically, an Audio Source Profile is an entity that contains all information required for any system calibration or result…

Q: What are the requirements for SID evaluation dataset?

For evaluating the real life scenario of Phonexia Speaker Identification technology, the system needs to be calibrated by SID dataset. SID dataset (minimum requirements): To measure SID performance precisely, it’s important to prepare evaluation recordings set very carefully. The requirements are: 50+ known speakers, 200+ recordings in total (i.e. 3 to 5 recordings per speaker*) 1+ minute of net speech…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…computing performance is better by ~17% compared with Intel® Xeon® E5 2860 v4 FtRTaudio shows that real requirements for HW and its computing power are approx. 62% lower than traditional approach using FtRTnet_speech for audio dataset with similar ratio between speech and non-speech (silence) and it is proven by measuring it. Best practices Use FtRTaudio when calculating hardware sizing and…

Documentation (SPE)

…files in [SPE]/doc in standard software package and installation. You can also find REST API reference (Speech Engine) documentation online. You might be interested in reading the following information in manual: REST API reference Structure of API queries Asynchronous request Task prioritization Authentication Audio requirements RTP/HTTP streams Error responses API Commands Usage examples API Requirements Installation guide And much more…

Terms of Service

…any electronic data provided by you, including but not limited to any kind of audio and video materials, music, sounds, texts and pictures. You shall remain at all times solely responsible for the content of your account. You shall avoid uploading any kind of illegal, harmful, abusive or otherwise inappropriate content to your account: PHONEXIA reserves the right to remove…

Designing and Developing Application

…specific hardware (mainly CPU, virtualized infrastructure vs. HW) or are you going to buy specific HW for customer? What is short/long time storage requirements (ie. audio and results availability, desktop vs. distributed system)? Is there any synchronization required (ie. voiceprint database to clients)? What is the topology of the solution/app (ie. where to store audio, voiceprints, results, …)? How to…

Speaker Identification (SID)

…are monitoring a large number of audio recordings or streams and we are looking for the occurrence of a specific speaker(s). Speaker spotting can be deployed for the purpose of Fraud Alert. Speaker Verification is the case when we are asking “Is this Peter Smith’s voice?”, such as when a person calls the bank and says, “Hello, this is Peter…

Recommended OS and HW (PSP)

…external dependencies like databases, storages, etc.) would require additional resources. Therefore you should always perform a proper load test using your entire system to determine the actual HW requirements. To give you a picture, here are recommendations for typical configurations: Voice Biometrics, basic 100 hours/day package (***) files processing CPU: 8 physical cores, 1x Intel® Xeon E5-2640 v4 or similar…

Download Speech Platform

Step #1 – Download Try and evaluate all Phonexia speech technologies either via REST API using Speech Engine, or using the demo/testing GUI application named Phonexia Browser. Hardware requirements recommended: Intel Core i7 or better, 32 GB free RAM, 10+ GB storage (SSD preferred) minimum: Intel Core i5, 16 GB free RAM, 10 GB storage (SSD preferred) To prevent various…

Phonexia End User License Agreement

…downloading any necessary materials or software, even if Phonexia has been advised of the possibility of such damages. 4.2 Phonexia recognizes and agrees that the Client remains the sole owner of the title to any data provided to Phonexia while using the Web demo license, including audio recordings, transcripts, personal information, or any intellectual property rights contained therein (the “Provided…

Download Voice Inspector 5.2

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…