Age Estimation (AGE)

Phonexia Age Estimation (AGE) estimates the age of a speaker from audio recording or voiceprint.

Technology

Trained with emphasis on spontaneous telephony conversation
The technology is language-, accent-, text-, and channel- independent
Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc.

Input

Audio: WAV or RAW (8 or 16 bits linear coding),
A-law or Mu-law, PCM, 8kHz+ sampling
Voiceprints: AGE L4 model supports SID4 L4 voiceprints; legacy AGE models support voiceprints created by AGE itself

Output

Log file with processed information (age estimate)

Processing speed

Approx. 20x faster than real-time processing on 1 CPU core
i.e. standard 8 CPU core server processes 3,840 hours of audio in 1 day of computing time

Representation of the results:

For the CMD version

Name_of_the_file.wav Age[integer – limited to 99] example/david_1.wav 41
example/david_2.wav 40

For the SPE version

name – representing the age
score – representing the score for the age [1/0]

In order to get a result, each age receives a score; when the score equals to “1”, it represents the value of the age estimated by the system.

{
      "result": {
          "version": 2,
          "name": "AgeEstimationResult",
          "file": "/kelly_2.wav",
          "model": "L",
          "channel_scores": [ 
             {                  "channel": 0,
                   "scores": [
                       {
                           "name": "0",
                           "score": 0
                       },
                       {
                           "name": "1",
                           "score": 0
                       },
   . . .                     {
                           "name": "41",
                           "score": 1
                       },
                       {
                           "name": "42",
                           "score": 0
                       },
 . . .

In order to achieve the most representative results possible, a span of +/- 10 years should be added to the results.

Gender Identification (GID)

Voice Activity Detection (VAD)

Age Estimation (AGE)

Previous Article

Next Article

ABOUT PHONEXIA

LEGAL

ACCOUNT

Previous Article

Next Article

Related Articles

ABOUT PHONEXIA

LEGAL

ACCOUNT

TAGS