Application of the Code It is the policy of Phonexia, s.r.o. (“Phonexia”, “we”) to maintain the highest level of ethical standards in the conduct of our business affairs. Our values guide our actions in all cases. The actions and conduct of our officers, directors and employees (collectively, “Phonexia personnel”), as well as others acting on our behalf, are essential to maintain these standards and promote highly ethical reputation of Phonexia. To that end, all our personnel including agents, consultants and contractors as well as distribution partners involved in Phonexia´s international business activities must read, become familiar and comply with this…
Search Results for: ROM
|Results 41 - 60 of 74||Page 3 of 4|
|Results per-page: 10 | 20 | 50 | 100|
What we believe in At Phonexia, we find joy in pushing the boundaries of innovation in the field of speech technology by automating and simplifying solutions for many of today’s complex communication and security-strategic challenges. By providing our partners and customers with state-of-the art speech-technology software, we leverage the power, and data, in their voices. Who we are Phonexia is the only speech technology software manufacturer that reveals and leverages the most data in speech for enterprising trailblazers across the globe who want to discover and develop powerful new skills in a knowledge-based economy. We have more than 19 years…
This document describes all licensing types for Phonexia product licensing available to our partners and customers. Each partner/customer can choose the licensing variant which best fits the current project or infrastructure. The document does not describe business conditions of Phonexia licensing. What is the License? The License is a formal agreement regarding “The Product Usage Rights” between Phonexia s.r.o. and a user of any Phonexia technology or Phonexia product. Licenses are issued by the Business Department for all speech technologies and products, and may be required in order to use utilities and tools developed by Phonexia or partners. For technical…
Basic explanation of configuration directives for SPE with hints & tips. Overview of phxspe.properties for beginners.
Best practices for good sizing of Phonexia technologies depend on a few facts: Intense work with large data sets requires good performance and bandwidth between RAM and CPU. It all depends on the size of the files with technological models data, usually loaded into RAM and used intensively for computing operations Always think only about physical cores of CPU (HT, VT features can't help in performance) Also seek for CPUs with a large L3 cache. And the better CPUs are those with higher l3_cache_size/#_of_physical_CPU_cores ratio. We currently assume that CPUs from the current Intel Xeon Family in the 4th generation…
Voice Print – output from spoken speech extraction process of SID. Unique mathematical representation of the specific speaker or recording is created in form of the iVector (for SID generation 3) or xVector (Deep Embeddings for SID generation 4).
Median – Value separating higher half of data sample from lower half.
Likelihood Ratio – Result from statistical test for two models comparation. It gives back number which expresses how many times more likely the data are under one model than the other. LR meets numbers in interval <-∞;+∞>
Language Print Archive - pack of language prints from the recordings spoken in the same language/dialect. Used for the language identification in LID comparison.
Language Print - output data from LID technology
Features – FEA is optional output from KWS technology. Looking for keywords in FEA is faster than in original recording.
Distribution of audio and video content to a dispersed audience via any audio or visual mass communications medium, but usually one using electromagnetic radiation (radio waves)
A: Via HTTP header “Accept” parameter (application/json; application/xml) Via request query “format=json/xml” If the format is not defined (or the HTTP header "Accept" parameter has one of these values: application/*,*/*,*), server will return json.
A: From the utilities in the package, you can find it in "ffprobe <file_name>", it will write out the info about the file. *Utility "ffprobe" is not included in our package(s). It is part of ffmpeg (https://ffmpeg.org/ffprobe.html) and it is neccessary to install it separately.
A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in Phonexia using anonymized data (= language-prints extracted from a .wav audio)
A: The language-prints do not depend on the current language pack used. You may use them for both training a new language pack and testing/comparing against an existing language pack. The language-prints needs to be compatible only with the model of LID used for language-print extraction.