Correctness regression¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
---|---|---|---|---|
Overall Score |
22.2% passed tests (2 passed / 7 failed). |
66.7% passed tests (6 passed / 3 failed). |
55.6% passed tests (5 passed / 4 failed). |
44.4% passed tests (4 passed / 5 failed). |
Concordance Correlation Coeff¶
Data |
Concordance Correlation Coeff |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.42 |
0.60 |
0.64 |
0.50 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.66 |
0.72 |
0.71 |
0.63 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.43 |
0.47 |
0.47 |
0.38 |
mean |
0.50 |
0.60 |
0.60 |
0.50 |
Mean Absolute Error¶
Data |
Mean Absolute Error |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.14 |
0.14 |
0.12 |
0.12 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.10 |
0.10 |
0.10 |
0.10 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.11 |
0.11 |
0.11 |
0.10 |
mean |
0.12 |
0.12 |
0.11 |
0.11 |
Pearson Correlation Coeff¶
Data |
Pearson Correlation Coeff |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.42 |
0.62 |
0.64 |
0.51 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.66 |
0.73 |
0.71 |
0.63 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.45 |
0.51 |
0.49 |
0.38 |
mean |
0.51 |
0.62 |
0.61 |
0.51 |
Visualization¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
---|---|---|---|