Correctness consistency

19.1% passed tests (9 passed / 38 failed).

Samples In Expected High Range

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75

Data

happiness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.42

danish-emotional-speech-1.1.1-emotion.test

0.42

emodb-1.2.0-emotion.categories.test.gold_standard

0.26

emovo-1.2.1-emotion.test

0.42

iemocap-2.3.0-emotion.categories.test.gold_standard

0.42

meld-1.3.1-emotion.categories.test.gold_standard

0.42

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.28

ravdess-1.1.2-emotion.speech.test

0.31

mean

0.37

Samples In Expected Low Range

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75

Data

anger

disgust

fear

frustration

sadness

crema-d-1.2.0-emotion.categories.test.gold_standard

0.35

0.27

0.35

0.41

danish-emotional-speech-1.1.1-emotion.test

0.35

0.44

emodb-1.2.0-emotion.categories.test.gold_standard

0.40

0.50

0.33

0.37

emovo-1.2.1-emotion.test

0.40

0.38

0.48

0.45

iemocap-2.3.0-emotion.categories.test.gold_standard

0.35

0.47

0.38

0.44

meld-1.3.1-emotion.categories.test.gold_standard

0.34

0.28

0.36

0.37

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.48

0.38

0.32

ravdess-1.1.2-emotion.speech.test

0.44

0.44

0.41

0.31

mean

0.39

0.37

0.40

0.38

0.39

Samples In Expected Neutral Range

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75

Data

boredom

neutral

crema-d-1.2.0-emotion.categories.test.gold_standard

0.80

danish-emotional-speech-1.1.1-emotion.test

0.71

emodb-1.2.0-emotion.categories.test.gold_standard

0.78

0.81

emovo-1.2.1-emotion.test

0.80

iemocap-2.3.0-emotion.categories.test.gold_standard

0.76

meld-1.3.1-emotion.categories.test.gold_standard

0.79

polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard

0.78

0.78

ravdess-1.1.2-emotion.speech.test

0.88

mean

0.78

0.79

Visualization

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_crema-d-1.2.0-emotion.categories.test.gold_standard83.png
../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test59.png
../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard59.png
../../../_images/visualization_emovo-1.2.1-emotion.test83.png
../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard83.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard107.png
../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard59.png
../../../_images/visualization_ravdess-1.1.2-emotion.speech.test59.png