Correctness speaker average

Overall scores

w2v2-b-cat

w2v2-L-cat

w2v2-L-robust-cat

w2v2-L-vox-cat

w2v2-L-xls-r-cat

Overall Score

41.7% passed tests (5 passed / 7 failed).

33.3% passed tests (4 passed / 8 failed).

50.0% passed tests (6 passed / 6 failed).

33.3% passed tests (4 passed / 8 failed).

25.0% passed tests (3 passed / 9 failed).

Class Proportion Mean Absolute Error

Threshold: 0.1

Data

anger

happiness

neutral

sadness

w2v2-b-cat

w2v2-L-cat

w2v2-L-robust-cat

w2v2-L-vox-cat

w2v2-L-xls-r-cat

w2v2-b-cat

w2v2-L-cat

w2v2-L-robust-cat

w2v2-L-vox-cat

w2v2-L-xls-r-cat

w2v2-b-cat

w2v2-L-cat

w2v2-L-robust-cat

w2v2-L-vox-cat

w2v2-L-xls-r-cat

w2v2-b-cat

w2v2-L-cat

w2v2-L-robust-cat

w2v2-L-vox-cat

w2v2-L-xls-r-cat

iemocap-2.3.0-full

0.09

0.06

0.06

0.12

0.06

0.10

0.11

0.12

0.08

0.05

0.21

0.28

0.16

0.31

0.26

0.19

0.23

0.10

0.36

0.32

meld-1.3.1-emotion.categories.test.gold_standard

0.02

0.03

0.03

0.02

0.21

0.42

0.43

0.36

0.34

0.19

0.47

0.49

0.42

0.50

0.46

0.07

0.03

0.03

0.15

0.06

msppodcast-2.6.0-emotion.categories.test-1.gold_standard

0.05

0.10

0.07

0.07

0.15

0.11

0.12

0.07

0.10

0.10

0.23

0.26

0.16

0.17

0.16

0.09

0.05

0.05

0.09

0.13

mean

0.05

0.06

0.05

0.07

0.14

0.21

0.22

0.18

0.17

0.11

0.30

0.34

0.25

0.33

0.29

0.12

0.10

0.06

0.20

0.17

Visualization

The plot shows the proportion of the predicted samples for each class, as well as the true proportion of the class. We select a slightly higher threshold for the absolute error in the plots compared to the Class Proportion Difference test as we are interested in highlighting only big deviations here.

w2v2-b-cat

w2v2-L-cat

w2v2-L-robust-cat

w2v2-L-vox-cat

w2v2-L-xls-r-cat

../../../_images/visualization_iemocap-2.3.0-full22.png
../../../_images/visualization_iemocap-2.3.0-full23.png
../../../_images/visualization_iemocap-2.3.0-full24.png
../../../_images/visualization_iemocap-2.3.0-full25.png
../../../_images/visualization_iemocap-2.3.0-full26.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard32.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard33.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard34.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard35.png
../../../_images/visualization_meld-1.3.1-emotion.categories.test.gold_standard36.png
../../../_images/visualization_msppodcast-2.6.0-emotion.categories.test-1.gold_standard10.png
../../../_images/visualization_msppodcast-2.6.0-emotion.categories.test-1.gold_standard11.png
../../../_images/visualization_msppodcast-2.6.0-emotion.categories.test-1.gold_standard12.png
../../../_images/visualization_msppodcast-2.6.0-emotion.categories.test-1.gold_standard13.png
../../../_images/visualization_msppodcast-2.6.0-emotion.categories.test-1.gold_standard14.png