Skip to content Skip to main navigation Skip to footer

Q: What are the recommendations for LID adaptation set?

A: The following is recommended:

For adding new language to language pack

  • 20+ hours of audio for each new language model (or 25+ hours of audio containing 80% of speech)
  • Only 1 language per record

For adapting the existing language model (discriminative training)

  • 10+ hours of audio for each language
  • May be done on customer site. May be done in Phonexia using anonymized data (= language-prints extracted from a .wav audio)