Correctness regression¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
---|---|---|---|---|
Overall Score |
44.4% passed tests (4 passed / 5 failed). |
66.7% passed tests (6 passed / 3 failed). |
66.7% passed tests (6 passed / 3 failed). |
44.4% passed tests (4 passed / 5 failed). |
Concordance Correlation Coeff¶
Data |
Concordance Correlation Coeff |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.24 |
0.51 |
0.52 |
0.39 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.56 |
0.63 |
0.62 |
0.52 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.36 |
0.44 |
0.42 |
0.35 |
mean |
0.39 |
0.53 |
0.52 |
0.42 |
Mean Absolute Error¶
Data |
Mean Absolute Error |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.18 |
0.14 |
0.15 |
0.14 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.09 |
0.09 |
0.09 |
0.09 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.10 |
0.09 |
0.10 |
0.09 |
mean |
0.12 |
0.11 |
0.11 |
0.11 |
Pearson Correlation Coeff¶
Data |
Pearson Correlation Coeff |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.25 |
0.52 |
0.54 |
0.45 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.56 |
0.64 |
0.63 |
0.52 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.39 |
0.45 |
0.44 |
0.36 |
mean |
0.40 |
0.54 |
0.53 |
0.44 |
Visualization¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
---|---|---|---|