Search: lid command

16 results

LID: Terminology and adaptation

…use absolute paths paths lead to the correct technological model directory of your choice (l4, l3, xl3, …) Example below assumes that the listfile will be saved to {SPE} directory (hence the relative paths to bsapi/…) and also assumes the “L4” model. You should reflect your setup accordingly. cs-CZ bsapi/lid/lprints/l4/cs-CZ.lpa pl-PL bsapi/lid/lprints/l4/pl-PL.lpa en-GB bsapi/lid/lprints/l4/en-GB.lpa ru-RU bsapi/lid/lprints/l4/ru-RU.lpa MyLanguage bsapi/lid/lprints/l4/MyLanguage.lpa MyOtherLanguage bsapi/lid/lprints/l4/MyOtherLanguage.lpa…

Releases and Changelogs (SPE)

…Added stream KWS (/technologies/keywordspotting/stream) Added technology benchmarks for AGE, DIAR, GID, LID, SID, STT (/technologies/{TECHNOLOGY}/benchmark) Added request to get voice-print info (/technologies/speakerid/vpinfo) Added usage examples to API documentation Add configuration options for TCP connection settings Added VAD segmentation to Time Analysis technology Support to acquire and compare language-prints LID technology was separated to LIDC (comparator) and LIDE (extractor) Support websockets…

Release Notes

…for the following parameters: CLI/CMD PARAMETERS NOTE LID Command Line “lid | lid.exe” -active-langs str1,str2… Deprecated STT Command Line “stt | stt.exe” -auto-scan-dir -move-input -no-locks -local-compliance str1,… -net-compliance str1,… -modif-delay num [3.5s] -stable-att-int num [3.5s] -cn-max-words-per-slot num [0] -cn-min-word-prob num [-70] Removed VAD Command Line “vad | vad.exe” -nonspeech-lab -save-log -log-suffix str [log] Deprecated Note: There is no impact on…

Understand SPE executable files

…technologies configuration, diagnostic info collection, etc. Usage: phxadmin [OPTION | COMMAND] [subcommand] [suboption…] Options –help – Show help information on command line parameters and exit –version – Show SPE version and exit Commands user – Manage SPE users. Without sub-command, lists all users. technology – Manage technologies. Without sub-command, lists enabled technologies. language-pack – Manage LID language packs. Without sub-command,…

FAQs (PSP)

…configuration for ffmpg: # Set converter command # %1 is for input file # %2 is for output file ffmpeg example: audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 Important note: By design and saving computing resources ‘audio converter’ is not used if INPUT file ends with the extension .wav….

Understand SPE configuration file

…PCM 16-bit, 8 kHz. See audio_converter.command for more details. audio_converter.command # Set converter command # %1 is for input file # %2 is for output file # ffmpeg example: # audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 Sets the command…

FAQs (Browser)

…languages are supported by LID? A: Please see List of supported LID Languages. For more details, see LID technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: How to fix the Error 1013: Unsupported: Server does not support authentication with token? A: Please check SPE subdirectory ./settings for configuration files. If only phxspe.browser.properties exists, then your Browser uses…

SPE and Browser installation: standalone SPE

…package. In other words, merge the contents of the /bsapi/ directory with the /SPE/bsapi/ 4. Configure Speech Engine In order to configure the Speech Engine, we have to navigate to /SPE/ directory and start the configuration utility called phxadmin SPE on Windows In the /SPE/ directory type cmd in the Address bar, to open the Command line. In the command…

Releases and Changelogs (Browser)

…Browser 3.60 (Public release) Phonexia Browser 3.60.0, BSAPI 3.60.0 (2023-12-05) New: Transcriptions of multiple files can be saved using the context menu command in Results pane New: Linux only: Added create_shortcut.sh script that integrates Browser with Linux desktop environments Improved: SID Evaluation is not considered Beta anymore Improved: Better looking icons in technologies configuration in Settings / Speech Engine tab…

Understand SPE configuration

…FLAC audio format (Default: true) server.audio_formats.flac.enabled = true # Enable or disable audio converter audio_converter.enabled = true # Set converter command # %1 is for input file # %2 is for output file # ffmpeg example: # audio_converter.command = ffmpeg -loglevel warning -y -i %1 %2 # sox example: # audio_converter.command = sox %1 %2 audio_converter.command = ffmpeg -loglevel warning…

Understand SPE database

…user), technology model used to create the profile, file with the profile content, hash rest_profile_sid4_metafiles list of files used as SID4 Audio Source Profiles metafiles rest_model_lid list of LID language packs – name, owner (SPE user), technology model to which the language pack belongs (i.e. technology model used to create source languageprints/language models) rest_model_lid_metafiles list of LID language packs metafiles…

Licensing (technical details)

…its startup and read its content. Alternatively, you can: specify the license file location in a configuration file (only for SPE and RLS) start the product executable with a license (SPE and RLS) or l parameter (command line), specifying license file location set the license file location in BS_LICENSE environment variable (only for command line) License types NET license NET…

Understand SPE database scripts

…MySQL command line client) use create_schema.sql script then use init_data.sql script when you need to clean your SPE DB (and don’t want to delete/re-create the entire DB for some reason) use drop.sql to completely erase the DB content, followed by re-creating the content using create_schema.sql and init_data.sql or use clean.sql to clean “rest_directory_type”, “rest_role”, “rest_user”, “rest_technology_model” and “rest_model_lid” tables Scripts…

Phonexia Speech Engine

…✓ Voice Activity Detection (VAD) ✓ ✓ Time Analysis Extraction (TAE) ✓ ✓ Speech Quality Estimation (SQE) ✓ ✓ Language Identification (LID) ✓ Gender Identification (GID) ✓ Age Estimation (AGE) ✓ Speaker Diarization (DIAR) ✓ Results caching Processing results can be optionally stored in results cache database to speed up eventual re-processing of the same recordings by the same technology…

Download Speech Platform

…(DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get started, please…