Correctness consistency

56.8% passed tests (21 passed / 16 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

anger

fear

surprise

crema-d-1.2.0-emotion.categories.test.gold_standard

0.75

0.40

danish-emotional-speech-1.1.1-emotion.test

0.48

0.54

emodb-1.2.0-emotion.categories.test.gold_standard

1.00

0.88

emovo-1.2.1-emotion.test

0.90

0.54

0.67

iemocap-2.3.0-emotion.categories.test.gold_standard

0.83

0.65

meld-1.3.1-emotion.categories.test.gold_standard

0.96

0.90

0.85

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.92

0.45

ravdess-1.1.2-emotion.speech.test

0.88

0.50

1.00

mean

0.84

0.62

0.77

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

boredom

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.89

danish-emotional-speech-1.1.1-emotion.test

1.00

emodb-1.2.0-emotion.categories.test.gold_standard

0.97

1.00

emovo-1.2.1-emotion.test

0.90

iemocap-2.3.0-emotion.categories.test.gold_standard

0.86

meld-1.3.1-emotion.categories.test.gold_standard

0.25

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.98

1.00

ravdess-1.1.2-emotion.speech.test

0.88

mean

0.97

0.85

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.39

danish-emotional-speech-1.1.1-emotion.test

0.27

emodb-1.2.0-emotion.categories.test.gold_standard

0.52

emovo-1.2.1-emotion.test

0.81

iemocap-2.3.0-emotion.categories.test.gold_standard

0.69

meld-1.3.1-emotion.categories.test.gold_standard

0.48

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.80

ravdess-1.1.2-emotion.speech.test

0.00

mean

0.49

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard1.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test1.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard1.png
../../../_images/visualization_emovo-1.2.1-emotion.test1.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard1.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard1.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard1.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test1.png