Correctness consistency

21.3% passed tests (10 passed / 37 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.19

danish-emotional-speech-1.1.1-emotion.test

0.17

emodb-1.2.0-emotion.categories.test.gold_standard

0.37

emovo-1.2.1-emotion.test

0.24

iemocap-2.3.0-emotion.categories.test.gold_standard

0.33

meld-1.3.1-emotion.categories.test.gold_standard

0.46

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.30

ravdess-1.1.2-emotion.speech.test

0.22

mean

0.29

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.55

0.42

0.59

0.72

danish-emotional-speech-1.1.1-emotion.test

0.69

0.71

emodb-1.2.0-emotion.categories.test.gold_standard

0.44

0.46

0.42

0.74

emovo-1.2.1-emotion.test

0.56

0.52

0.58

0.62

iemocap-2.3.0-emotion.categories.test.gold_standard

0.42

0.47

0.39

0.38

meld-1.3.1-emotion.categories.test.gold_standard

0.43

0.31

0.30

0.43

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.48

0.48

0.75

ravdess-1.1.2-emotion.speech.test

0.72

0.44

0.53

0.47

mean

0.54

0.43

0.48

0.39

0.60

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.94

danish-emotional-speech-1.1.1-emotion.test

0.90

emodb-1.2.0-emotion.categories.test.gold_standard

0.86

0.96

emovo-1.2.1-emotion.test

0.89

iemocap-2.3.0-emotion.categories.test.gold_standard

0.93

meld-1.3.1-emotion.categories.test.gold_standard

0.85

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.95

0.88

ravdess-1.1.2-emotion.speech.test

1.00

mean

0.91

0.92

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard69.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test47.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard47.png
../../../_images/visualization_emovo-1.2.1-emotion.test69.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard69.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard91.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard47.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test47.png