Skip to content Skip to main navigation Skip to footer

Search: age%20estimation

96 results

Q: I can’t manage to run Phonexia Browser software. I always get an error.

I always get the same error messages: unable to connect to the SPE unable to start the localhost: giving up and kill the localhost. A: This error may happen if the initialization of SPE engine takes too long. Phonexia Browser software treats it as initialization failure and kills the server. You can fix this by doing the following: Increase timeout…

Privacy Policy

usage such as how often you use your Phonexia Account, how often you upload audio, video or other files, the size of generated content and other activity related to your use of Phonexia services. 1.2 Computer browser Some information is also provided by your computer browser through cookies. By using our services, you agree to use of the cookies. Certain…

Phonexia Ethical Code

…maintain these standards and promote highly ethical reputation of Phonexia. To that end, all our personnel including agents, consultants and contractors as well as distribution partners involved in Phonexia´s international business activities must read, become familiar and comply with this Phonexia Ethical Code (the “Code”), as amended by Phonexia from time to time. Furthermore, we request our business partners to…

STT: Results explained

…milliseconds. Score is logarithm of probability from {-inf,0} interval – the higher score, the higher probability that the word was spoken in that time interval. Confidence is a probability from {0,1} interval. It’s calculated from the score value using e score formula. Multiplying the value by 100 gives the confidence percentage. NOTE: Some ancient legacy models do not support confidence….

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

…Evaluation Package Evaluation package (download page) is consisting of Phonexia Browser and Phonexia Speech Engine including all necessary technologies. 2. Data We prepared the dataset for your testing. Package contains data for speaker model creation and speaker spotting too. The process of testing is the same for the data set collected by the user himself. Dataset is available to download…

SID4 performance on Intel® Xeon® Platinum 8124M

…32GB RAM, 30GB SSD based storage, 1000 I/O.s-1 reserved per core Benchmark data setup Data set statistic: Number of files: 32 [300 seconds each] RAW recordings total length: 9600 seconds Net speech total length: 4224.77 secons Data set contains 44% of speech signal, 56% of silence or technical signal Statistic counted by Phonexia VAD 3.22.1, “vad_2.bs” settings (AKA strict VAD,…

Understand SPE connectors for external TTS

…from stdin is as follows: { “text”: string, “voice”: { “name”: string, “languageCode”: string } } Where: text is the text to be synthesized name is a voice name to be used for synthesis (ref. to the voice names provided in the connector “info” data) languageCode is a language code defining the language to be used for synthesis (ref. to…

Understand SPE technologies configuration file

…the technology and model. However, this feature should be used only in special cases, e.g. if suggested by Phonexia experts. SPE users should normally not fiddle around with BSAPI configuration files… and if some technology config customization is needed, the user configuration file is the right method. Technology names supported in technologies configuration file: AGE Age Estimation DENOISER Denoiser DIAR…

Speech Quality Estimation (SQE)

…channels. The statistics of all channels include the numbers for many aspects of recording quality, and the overall global score. Technology The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits…

Releases and Changelogs (VIN)

…Added the ability to define custom language in the speaker metadata Fixed: When discarding a changed photo, the confirmation dialog “Do you want to save…” popped up infinitely Fixed: Missing file names when the SID Evaluator evaluates speakers from the workspace Fixed: Unwanted extra comparisons when the SID Evaluator evaluates speakers from the workspace (instead of comparing only A x…

Designing and Developing Application

Before designing and developing the application, we encourage Partner to find clear answer for the following questions: Customer requirements: Do my customers need file processing (audio) or stream processing in real time? What is the human power of the customer that can analyze the results? How many minutes per day or streams in parallel do my customer need to process?…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

Understand SPE database scripts

This article explains details and usage of SQL database scripts stored in SPE installation directory in /data/database subdirectory. These scripts are intended for setup and maintenance of SPE database for supported database types, currently SQLite and MariaDB (from SPE 3.46) / MySQL (up to SPE 3.45). Script types For each database type, there are two directories with two types of…

Understand SPE audio converter

…file format ‘C:\TMP\tmp9408aaaaaa’: BsapiException: SWaveFileI(1751): Corrupted WAVE file format: ‘C:\TMP\tmp9408aaaaaa’. 2021-01-30 20:49:26 [Trace] ConverterSubsystem: Converting C:\TMP\tmp9408aaaaaa -> C:\TMP\tmp9408baaaaa.wav 2021-01-30 20:49:27 [Debug] ConverterSubsystem: File C:\TMP\tmp9408aaaaaa has been converted. 2021-01-30 20:49:27 [Trace] ConverterSubsystem: Removed temporary file: C:\TMP\tmp9408aaaaaa 2021-01-30 20:49:27 [Trace] Data: Moving: ‘C:\TMP\tmp9408baaaaa.wav’ -> ‘D:\SPE\home\admin\storage\test1.wav’ 2021-01-30 20:49:27 [Trace] Data: Moved: ‘C:\TMP\tmp9408baaaaa.wav’ -> ‘D:\SPE\home\admin\storage\test1.wav’ 2021-01-30 20:49:27 [Trace] Data: File ‘/test1.wav’ registered in database…