Skip to content Skip to main navigation Skip to footer

Search: wer

47 results

Phonexia technology models EoL

Speech to Text (STT) and Keyword Spotting (KWS) models Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic…

FAQs (Browser)

…localhost: giving up and kill the localhost. A: This error may happen if the initialization of SPE engine takes too long. Phonexia Browser software treats it as initialization failure and kills the server. You can fix this by doing the following: Increase timeout in Settings > Speech Engine tab > First connection timeout Use fewer instances of technologies, thus letting…

Support Lifecycle Policy (PSP)

General Lifecycle of Phonexia products is driven by Phonexia Product Support and Lifecycle Policy (valid from Q3/2019). Content of our support and software versioning approach is defined as well in this document. Specific versions of our products and languages are supported and maintained according to following tables. Phonexia Speech Engine Version Release Date End of Support Maintained Until Release type…

STT: Configuring word detection parameters for stream transcription

…middle value is −0.5. (yes, minus 0.5… the value in config file is set higher than the default value) Lower values mean “even if there is not a real silence in the signal, consider it being a silence“, i.e. end of segment is detected more frequently. Higher values mean “even if there is a silence in the signal, consider it…

Terms of Service

…such party. Such Force Majeure Events include, but are not limited to, adverse weather conditions, flood, fire, explosion, earthquake, volcanic action, power failure, embargo, war, revolution, civil commotion, act of public enemies, labor unrest (including, but not limited to, strikes, work stoppages, slowdowns, picketing or boycotts), inability to obtain equipment, parts, software or repairs thereof, acts or omissions of the…

Pricing

…NOT the number of voiceprints in the database against which that voiceprint is verified. The invoiceable actions are tiered into different intervals based on the amount of invoiceable actions used per month. The higher the interval, the lower the price per action within that interval. This ensures that you can use Voice Verify without any limitations and it will auto-adjust…

Documentation (VIN)

…Quick Start Guide Using the Application Interpretation of Results Description of Other Inbuilt Tools Troubleshooting In case of any problems or questions, please first search the manual for relevant keywords. If you don’t find an answer in the manual, contact our support. You can also browse the FAQ section or use the search function of the Partner Portal and look…

KWS: Results explained

…rather high scores and confidences If e.g. the word sale would be defined with a threshold value e.g. 0.20 in the keyword list, the second occurrence would not appear in results at all, since its confidence is lower than the threshold. … { “channel_id”: 0, “score”: 4.5108547, “confidence”: 0.9891304, “start”: 171400000, “end”: 175900000, “word”: “sale_0” }, { “channel_id”: 0, “score”:…

Understand SPE multithreaded technologies initialization

…detected CPU cores. The value can be set manually to other number as needed. If the number of initialization threads is set (either automatically, or manually) lower than the number of technology–model combinations to be initialized, the initialization is queued and the initializations are performed by whichever thread becomes available. In fact, single-threaded initialization follows the same principle… If the…

Speech Quality Estimation (SQE)

Phonexia’s Speech Quality Estimation quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two)…

STT: Adding words to language model on the fly

…word being ignored during transcription (see the warning_message parameter below). Transcription result If preferred phrases and/or words were specified when starting the transcription, the result contains the same phrases and dictionary structures which were used as input for the transcription task. The dictionary structure is enriched with pronunciations part, generated automatically for words which did not specify pronunciations in the…

Understand SPE processing priority

…they get processed as soon as possible, without waiting in the queue. Similarly, if one has a bunch of “not so important” files for processing, these can be sent for processing using lower priority… and they will get processed only later, after the higher-priority tasks in the queue get processed. To use task prioritization, you need to have task priorities…

Q: How do I get results for a pending operation?

…process the file. In response HTTP header (in parameter “Location”) there is path for pending resource. In the body there is a ID of pending operation. Polling: Client asks on the pending resource (e.g. “get /pending/{ID}). Server will answer with status 200 and in the body there is a status of operation: “running”. Client will repeat this request periodically with…