Skip to content Skip to main navigation Skip to footer

Search: Configuración del servidor

64 results

Understand SPE configuration file

…the configuration file. server.logging.file.purge_count # The value specifies the maximum number of archived log files. If the number is exceeded, # archived log files are deleted, starting with the oldest. server.logging.file.purge_count = 5 Sets the log files housekeeping strategy – how many most recent log files to keep. Default value is 5 days. server.logging.enable_async # Use separate thread for logging….

Understand SPE directory structure

…data directory holds additional data files for entities created by that user – e.g. SID Speaker Models, or LID language packs. If there no such entities exist for that user, this directory is empty. Here is an example of admin‘s data directory containing custom LID language pack for model L4 and SID speaker models named “David” and “Paul” (the tree…

Understand SPE database scripts

…MySQL command line client) use create_schema.sql script then use init_data.sql script when you need to clean your SPE DB (and don’t want to delete/re-create the entire DB for some reason) use drop.sql to completely erase the DB content, followed by re-creating the content using create_schema.sql and init_data.sql or use clean.sql to clean “rest_directory_type”, “rest_role”, “rest_user”, “rest_technology_model” and “rest_model_lid” tables Scripts…

Phoneme Recogniser (PHNREC)

…user can add to language model of speech-to-text technology (better accuracy of KWS technology). Input audio file (format details – see Speech Engine documentation); stream not supported, technology model name (i.e. language code) to be used for phoneme transcription. Output In the process of transcribing speech-to-phonemes, the Phoneme Recogniser usually identifies individual speech segments and convert it to pronunciation. Example…

Understand SPE metafiles

DELETE methods to upload, download or delete any kind of file with metadata of your choice, associated with the corresponding SPE entity. There are no limits on the content of the metafiles, their names, etc. (apart from those imposed by the underlying operating system and/or filesystem). Plain text files, structured formats like JSON or XML, pictures, documents, multimedia files… you…

Phonexia Speech Engine

…– results are then returned immediately from the cache instead of complete re-processing of the audio file. Own persistent data storage SPE keeps uploaded audio files in its own persistent storage space, so the original source files can be archived or deleted after upload. Data privacy SPE keeps information about audio file or stream only as long as the file…

Privacy Policy

…or transcribed content within your account. Deleted content may remain archived for back-up purposes within our system for a period of time. We have no responsibility in case that content becomes lost due to your deletion. 4. COOKIES We use cookies to remember selections and preferences that you’ve already made or information that you’ve already given. You can control or…

STT: Configuring word detection parameters for stream transcription

…i.e. the backward extension value actually says for how long the processing must be delayed (processing has to wait until that much input signal arrives) ⇒ increasing this value means that speech activity is detected with longer delay (e.g. means delayed barge-in detection in voicebot implementation). The forward extension value basically means “add this much of a following signal to…

Phonexia Partner Program for Government Partners

…is delivered over two days at Phonexia headquarters in Brno, Czech Republic. If this isn’t feasible for you, we can deliver it online or at your premises. I am currently an inactive partner. How do I re-engage with Phonexia? Reach out to your Phonexia sales representative, or contact us at [email protected]. We will be happy to restart our cooperation. If…

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

Language Identification (LID)

…Routing particular calls (languages) to human operators (language experts) Scoring and results The LID language pack defines a set of recognizable languages (represented by a language models). When identifying the language in audio recording (or languageprint), LID does the following: creates languageprint of the recording (if the input is audio recording) compares that languageprint with each language model in a…

Q: What are the recommendations for LID adaptation set?

A: The following is recommended: For adding new language to language pack 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech) Only 1 language per record For adapting the existing language model (discriminative training) 10+ hours of audio for each language May be done on customer site. May be done in…

Gender Identification (GID)

Gender Identification is a language-, domain- and channel-independent technology that uses the acoustic characteristics of the recording to determine the gender of the speaker in question. This technology is able to distinguish between two genders: Male (M) and Female (F). Minimum of speech signal for identification: 7+ sec recommended with XL5, XL4 and L4 model (9+ sec for previous generation…

Age Estimation (AGE)

…coding), A-law or Mu-law, PCM, 8kHz+ sampling Voiceprints: AGE L4 model supports SID4 L4 voiceprints; legacy AGE models support voiceprints created by AGE itself Output Log file with processed information (age estimate) Processing speed Approx. 20x faster than real-time processing on 1 CPU core i.e. standard 8 CPU core server processes 3,840 hours of audio in 1 day of computing…