Correctness consistency

53.2% passed tests (25 passed / 22 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.23

danish-emotional-speech-1.1.1-emotion.test

0.23

emodb-1.2.0-emotion.categories.test.gold_standard

0.30

emovo-1.2.1-emotion.test

0.14

iemocap-2.3.0-emotion.categories.test.gold_standard

0.49

meld-1.3.1-emotion.categories.test.gold_standard

0.81

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.72

ravdess-1.1.2-emotion.speech.test

0.00

mean

0.36

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.90

0.80

0.84

0.82

danish-emotional-speech-1.1.1-emotion.test

0.79

0.77

emodb-1.2.0-emotion.categories.test.gold_standard

0.62

0.65

0.76

0.67

emovo-1.2.1-emotion.test

0.79

0.77

0.83

0.83

iemocap-2.3.0-emotion.categories.test.gold_standard

0.67

0.59

0.64

0.86

meld-1.3.1-emotion.categories.test.gold_standard

0.31

0.30

0.14

0.39

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.40

0.35

0.40

ravdess-1.1.2-emotion.speech.test

1.00

1.00

1.00

1.00

mean

0.69

0.70

0.64

0.64

0.72

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.81

danish-emotional-speech-1.1.1-emotion.test

0.92

emodb-1.2.0-emotion.categories.test.gold_standard

0.97

0.93

emovo-1.2.1-emotion.test

0.85

iemocap-2.3.0-emotion.categories.test.gold_standard

0.80

meld-1.3.1-emotion.categories.test.gold_standard

0.52

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.98

0.85

ravdess-1.1.2-emotion.speech.test

0.06

mean

0.97

0.72

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard68.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test46.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard46.png
../../../_images/visualization_emovo-1.2.1-emotion.test68.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard68.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard90.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard46.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test46.png