Correctness speaker average

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

100.0% passed tests (3 passed / 0 failed).

100.0% passed tests (3 passed / 0 failed).

100.0% passed tests (3 passed / 0 failed).

100.0% passed tests (3 passed / 0 failed).

Mean Absolute Error

Threshold: 0.1

Data

Mean Absolute Error

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-full

0.04

0.02

0.01

0.05

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.09

0.05

0.04

0.06

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.08

0.06

0.07

0.08

mean

0.07

0.04

0.04

0.06

Visualization

The plot shows the predicted average value with the true average value. We select a slightly higher threshold for the absolute error in the plots compared to the Mean Absolute Error test as we are interested in highlighting only big deviations here.

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-full33.png
../../../_images/visualization_iemocap-2.3.0-full34.png
../../../_images/visualization_iemocap-2.3.0-full35.png
../../../_images/visualization_iemocap-2.3.0-full36.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard140.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard141.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard142.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard143.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard96.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard97.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard98.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard99.png