Correctness regression

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

44.4% passed tests (4 passed / 5 failed).

66.7% passed tests (6 passed / 3 failed).

66.7% passed tests (6 passed / 3 failed).

44.4% passed tests (4 passed / 5 failed).

Concordance Correlation Coeff

Threshold: 0.5

Data

Concordance Correlation Coeff

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.24

0.51

0.52

0.39

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.56

0.63

0.62

0.52

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.36

0.44

0.42

0.35

mean

0.39

0.53

0.52

0.42

Mean Absolute Error

Threshold: 0.1

Data

Mean Absolute Error

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.18

0.14

0.15

0.14

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.09

0.09

0.09

0.09

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.10

0.09

0.10

0.09

mean

0.12

0.11

0.11

0.11

Pearson Correlation Coeff

Threshold: 0.5

Data

Pearson Correlation Coeff

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.25

0.52

0.54

0.45

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.56

0.64

0.63

0.52

msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard

0.39

0.45

0.44

0.36

mean

0.40

0.54

0.53

0.44

Visualization

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard48.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard49.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard50.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard51.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard70.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard71.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard72.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard73.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard48.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard49.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard50.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard51.png