Correctness consistency

55.3% passed tests (26 passed / 21 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.41

danish-emotional-speech-1.1.1-emotion.test

0.37

emodb-1.2.0-emotion.categories.test.gold_standard

0.74

emovo-1.2.1-emotion.test

0.26

iemocap-2.3.0-emotion.categories.test.gold_standard

0.43

meld-1.3.1-emotion.categories.test.gold_standard

0.81

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.75

ravdess-1.1.2-emotion.speech.test

0.19

mean

0.49

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.85

0.89

0.87

0.97

danish-emotional-speech-1.1.1-emotion.test

0.56

0.94

emodb-1.2.0-emotion.categories.test.gold_standard

0.64

0.50

0.42

1.00

emovo-1.2.1-emotion.test

0.79

0.61

0.51

0.88

iemocap-2.3.0-emotion.categories.test.gold_standard

0.77

0.71

0.75

0.92

meld-1.3.1-emotion.categories.test.gold_standard

0.45

0.37

0.26

0.51

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.78

0.65

1.00

ravdess-1.1.2-emotion.speech.test

0.94

1.00

0.75

0.88

mean

0.72

0.67

0.60

0.75

0.89

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.88

danish-emotional-speech-1.1.1-emotion.test

0.98

emodb-1.2.0-emotion.categories.test.gold_standard

0.97

1.00

emovo-1.2.1-emotion.test

1.00

iemocap-2.3.0-emotion.categories.test.gold_standard

0.88

meld-1.3.1-emotion.categories.test.gold_standard

0.79

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.92

1.00

ravdess-1.1.2-emotion.speech.test

1.00

mean

0.95

0.94

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard75.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test53.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard53.png
../../../_images/visualization_emovo-1.2.1-emotion.test75.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard75.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard97.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard53.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test53.png