Correctness regression

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

22.2% passed tests (2 passed / 7 failed).

66.7% passed tests (6 passed / 3 failed).

55.6% passed tests (5 passed / 4 failed).

44.4% passed tests (4 passed / 5 failed).

Concordance Correlation Coeff

Threshold: 0.5

Data

Concordance Correlation Coeff

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.42

0.60

0.64

0.50

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.66

0.72

0.71

0.63

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.43

0.47

0.47

0.38

mean

0.50

0.60

0.60

0.50

Mean Absolute Error

Threshold: 0.1

Data

Mean Absolute Error

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.14

0.14

0.12

0.12

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.10

0.10

0.10

0.10

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.11

0.11

0.11

0.10

mean

0.12

0.12

0.11

0.11

Pearson Correlation Coeff

Threshold: 0.5

Data

Pearson Correlation Coeff

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.42

0.62

0.64

0.51

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.66

0.73

0.71

0.63

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.45

0.51

0.49

0.38

mean

0.51

0.62

0.61

0.51

Visualization

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard4.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard5.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard6.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard7.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard4.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard5.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard6.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard7.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard4.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard5.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard6.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard7.png