Correctness consistency

42.6% passed tests (20 passed / 27 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.48

danish-emotional-speech-1.1.1-emotion.test

0.29

emodb-1.2.0-emotion.categories.test.gold_standard

0.48

emovo-1.2.1-emotion.test

0.49

iemocap-2.3.0-emotion.categories.test.gold_standard

0.51

meld-1.3.1-emotion.categories.test.gold_standard

0.70

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.80

ravdess-1.1.2-emotion.speech.test

0.06

mean

0.48

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.77

0.71

0.67

0.72

danish-emotional-speech-1.1.1-emotion.test

0.58

0.79

emodb-1.2.0-emotion.categories.test.gold_standard

0.49

0.38

0.27

0.85

emovo-1.2.1-emotion.test

0.54

0.29

0.39

0.49

iemocap-2.3.0-emotion.categories.test.gold_standard

0.74

0.76

0.68

0.86

meld-1.3.1-emotion.categories.test.gold_standard

0.48

0.40

0.24

0.56

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.35

0.28

0.32

ravdess-1.1.2-emotion.speech.test

1.00

0.97

0.94

0.91

mean

0.62

0.55

0.51

0.68

0.69

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.97

danish-emotional-speech-1.1.1-emotion.test

0.96

emodb-1.2.0-emotion.categories.test.gold_standard

1.00

1.00

emovo-1.2.1-emotion.test

0.99

iemocap-2.3.0-emotion.categories.test.gold_standard

0.89

meld-1.3.1-emotion.categories.test.gold_standard

0.79

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

1.00

1.00

ravdess-1.1.2-emotion.speech.test

1.00

mean

1.00

0.95

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard71.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test49.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard49.png
../../../_images/visualization_emovo-1.2.1-emotion.test71.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard71.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard93.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard49.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test49.png