Correctness speaker average

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

100.0% passed tests (3 passed / 0 failed).

100.0% passed tests (3 passed / 0 failed).

100.0% passed tests (3 passed / 0 failed).

100.0% passed tests (3 passed / 0 failed).

Mean Absolute Error

Threshold: 0.1

Data

Mean Absolute Error

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-full

0.03

0.04

0.03

0.02

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.05

0.04

0.03

0.03

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.04

0.07

0.04

0.03

mean

0.04

0.05

0.03

0.03

Visualization

The plot shows the predicted average value with the true average value. We select a slightly higher threshold for the absolute error in the plots compared to the Mean Absolute Error test as we are interested in highlighting only big deviations here.

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-full.png
../../../_images/visualization_iemocap-2.3.0-full1.png
../../../_images/visualization_iemocap-2.3.0-full2.png
../../../_images/visualization_iemocap-2.3.0-full3.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard8.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard9.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard10.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard11.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard8.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard9.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard10.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard11.png