Q: What languages are supported by LID?
A: Please see List of supported LID Languages. For more details, see LID technology documentation.
A: Please see List of supported LID Languages. For more details, see LID technology documentation.
A: Please see List of supported KWS Languages. For more details, see KWS technology documentation.
A: There are multiple methods to train a new language, please see article in Components > Speech Technologies > LID.
A: Yes. Documentation is here: https://download.phonexia.com/docs/spe.
A: Yes, the system comes as an API (for the production license).
A: From the utilities in the package*, you can find it in ffprobe <file_name>
, it will write out the info about the file.
*Utility “ffprobe
” is not included in our package(s). It is part of ffmpeg (https://ffmpeg.org/ffprobe.html) and is necessary to be installed separately.
A: The language-prints do not depend on the current language pack used. You may use them for both training a new language pack and testing/comparing against an existing language pack.
The language-prints need to be compatible only with the model of LID used for language-print extraction.
A: The following is recommended:
For adding new language to language pack
For adapting the existing language model (discriminative training)
A: You can receive the list of running/configured technologies by running query get /technologies
or using the phxadmin utility with parameter configure-tech
A:
If the format is not defined (or the HTTP header “Accept” parameter has one of these values: application/*,*/*,*), server will return json.
A:
Windows:
PhxBrowser.exe /spe-debug /spe-output
Linux:
./PhxBrowser --spe-debug --spe-output
A: Threshold for score isn’t set up correctly. Adjust speaker score sharpness value to calibrate the recalculation.
Please see Calibration in technology documentation.
When running SPE, the following error occurs:
[Error] ApplicationStartup: Unhandled exception: BsapiException: SWaveformSegmenterI(/mnt/phxspe/home/phx/storage/dfs/a1cabcf7-c761-49f1 -a9bc-0a8209a09fd9.opus Requested segment (78056, 102056) is out of waveform range (0,91840).
A: It means that this opus file is created improperly and declares internally (in header) much more audio than available in real file.
Please check your audio source/originator for proper functionality.
Or use ffmpeg / sox utility as preprocessor of the audio and do audio normalization by self-conversion from opus to opus before recordings are processed through SPE.
A:
If server responds on pending request by status 200 – OK, the body of the response will have the result inside (server already has the result in cache memory and there is no need to process the file again).
If server responds on pending request by status 202 – Accepted, server will create task and server will begin to process the file. In response HTTP header (in parameter “Location”) there is path for pending resource. In the body there is a ID of pending operation.
Example of HTTP header:
GET /pending/ec563083-3d9b-457d-a0ac-24b197bc222f HTTP/1.1 Host: server.example.com Upgrade: websocket Connection: Upgrade Sec-WebSocket-Key: x3JJHMbDL1EzLkh9GBhXDw== Sec-WebSocket-Version: 13 X-SessionID: 258f505c-a6fa-4c3f-8a87-b048874ac6aa