Correctness consistency

29.8% passed tests (14 passed / 33 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.40

danish-emotional-speech-1.1.1-emotion.test

0.52

emodb-1.2.0-emotion.categories.test.gold_standard

0.22

emovo-1.2.1-emotion.test

0.30

iemocap-2.3.0-emotion.categories.test.gold_standard

0.58

meld-1.3.1-emotion.categories.test.gold_standard

0.75

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.57

ravdess-1.1.2-emotion.speech.test

0.00

mean

0.42

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.84

0.55

0.45

0.51

danish-emotional-speech-1.1.1-emotion.test

0.17

0.08

emodb-1.2.0-emotion.categories.test.gold_standard

0.67

0.38

0.30

0.33

emovo-1.2.1-emotion.test

0.67

0.30

0.25

0.26

iemocap-2.3.0-emotion.categories.test.gold_standard

0.71

0.65

0.57

0.52

meld-1.3.1-emotion.categories.test.gold_standard

0.44

0.37

0.26

0.37

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.38

0.12

0.00

ravdess-1.1.2-emotion.speech.test

1.00

1.00

0.97

0.97

mean

0.61

0.52

0.43

0.57

0.38

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.98

danish-emotional-speech-1.1.1-emotion.test

1.00

emodb-1.2.0-emotion.categories.test.gold_standard

1.00

1.00

emovo-1.2.1-emotion.test

1.00

iemocap-2.3.0-emotion.categories.test.gold_standard

0.94

meld-1.3.1-emotion.categories.test.gold_standard

0.74

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

1.00

1.00

ravdess-1.1.2-emotion.speech.test

1.00

mean

1.00

0.96

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard76.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test54.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard54.png
../../../_images/visualization_emovo-1.2.1-emotion.test76.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard76.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard98.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard54.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test54.png