Correctness consistency

59.6% passed tests (28 passed / 19 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.05

danish-emotional-speech-1.1.1-emotion.test

0.12

emodb-1.2.0-emotion.categories.test.gold_standard

0.00

emovo-1.2.1-emotion.test

0.02

iemocap-2.3.0-emotion.categories.test.gold_standard

0.38

meld-1.3.1-emotion.categories.test.gold_standard

0.62

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.18

ravdess-1.1.2-emotion.speech.test

0.00

mean

0.17

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.98

0.99

0.98

1.00

danish-emotional-speech-1.1.1-emotion.test

0.92

0.92

emodb-1.2.0-emotion.categories.test.gold_standard

1.00

1.00

0.94

1.00

emovo-1.2.1-emotion.test

0.99

0.87

0.87

0.86

iemocap-2.3.0-emotion.categories.test.gold_standard

0.76

0.41

0.61

0.84

meld-1.3.1-emotion.categories.test.gold_standard

0.45

0.28

0.16

0.34

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.88

0.60

0.52

ravdess-1.1.2-emotion.speech.test

1.00

1.00

1.00

0.97

mean

0.87

0.83

0.71

0.61

0.81

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.72

danish-emotional-speech-1.1.1-emotion.test

0.90

emodb-1.2.0-emotion.categories.test.gold_standard

1.00

1.00

emovo-1.2.1-emotion.test

0.76

iemocap-2.3.0-emotion.categories.test.gold_standard

0.90

meld-1.3.1-emotion.categories.test.gold_standard

0.72

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

1.00

1.00

ravdess-1.1.2-emotion.speech.test

0.62

mean

1.00

0.83

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard72.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test50.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard50.png
../../../_images/visualization_emovo-1.2.1-emotion.test72.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard72.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard94.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard50.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test50.png