Correctness consistency¶

53.2% passed tests (25 passed / 22 failed).

Samples In Expected High Range¶

Proportion of samples whose predictions fall into the expected value range of >= 0.55

Threshold: 0.75¶
Data	happiness
crema-d-1.2.0-emotion.categories.test.gold_standard	0.23
danish-emotional-speech-1.1.1-emotion.test	0.23
emodb-1.2.0-emotion.categories.test.gold_standard	0.30
emovo-1.2.1-emotion.test	0.14
iemocap-2.3.0-emotion.categories.test.gold_standard	0.49
meld-1.3.1-emotion.categories.test.gold_standard	0.81
polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard	0.72
ravdess-1.1.2-emotion.speech.test	0.00
mean	0.36

Samples In Expected Low Range¶

Proportion of samples whose predictions fall into the expected value range of <= 0.45

Threshold: 0.75¶
Data	anger	disgust	fear	frustration	sadness
crema-d-1.2.0-emotion.categories.test.gold_standard	0.90	0.80	0.84		0.82
danish-emotional-speech-1.1.1-emotion.test	0.79				0.77
emodb-1.2.0-emotion.categories.test.gold_standard	0.62	0.65	0.76		0.67
emovo-1.2.1-emotion.test	0.79	0.77	0.83		0.83
iemocap-2.3.0-emotion.categories.test.gold_standard	0.67		0.59	0.64	0.86
meld-1.3.1-emotion.categories.test.gold_standard	0.31	0.30	0.14		0.39
polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard	0.40		0.35		0.40
ravdess-1.1.2-emotion.speech.test	1.00	1.00	1.00		1.00
mean	0.69	0.70	0.64	0.64	0.72

Samples In Expected Neutral Range¶

Proportion of samples whose predictions fall into the expected value range of [0.3, 0.6]

Threshold: 0.75¶
Data	boredom	neutral
crema-d-1.2.0-emotion.categories.test.gold_standard		0.81
danish-emotional-speech-1.1.1-emotion.test		0.92
emodb-1.2.0-emotion.categories.test.gold_standard	0.97	0.93
emovo-1.2.1-emotion.test		0.85
iemocap-2.3.0-emotion.categories.test.gold_standard		0.80
meld-1.3.1-emotion.categories.test.gold_standard		0.52
polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard	0.98	0.85
ravdess-1.1.2-emotion.speech.test		0.06
mean	0.97	0.72

Visualization¶

Distribution of dimensional model predictions for samples with different categorical emotions. The expected range of model predictions is highlighted by the green brackground.

../../../_images/visualization_danish-emotional-speech-1.1.1-emotion.test46.png

../../../_images/visualization_emodb-1.2.0-emotion.categories.test.gold_standard46.png

../../../_images/visualization_emovo-1.2.1-emotion.test68.png

../../../_images/visualization_iemocap-2.3.0-emotion.categories.test.gold_standard68.png

../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard90.png

../../../_images/visualization_polish-emotional-speech-1.1.1-emotion.categories.test.gold_standard46.png

../../../_images/visualization_ravdess-1.1.2-emotion.speech.test46.png