Correctness regression¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
---|---|---|---|---|
Overall Score |
0.0% passed tests (0 passed / 9 failed). |
0.0% passed tests (0 passed / 9 failed). |
22.2% passed tests (2 passed / 7 failed). |
0.0% passed tests (0 passed / 9 failed). |
Concordance Correlation Coeff¶
Data |
Concordance Correlation Coeff |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.24 |
0.42 |
0.42 |
0.10 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.25 |
0.48 |
0.53 |
0.24 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.07 |
0.22 |
0.24 |
0.06 |
mean |
0.18 |
0.37 |
0.40 |
0.14 |
Mean Absolute Error¶
Data |
Mean Absolute Error |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.19 |
0.17 |
0.17 |
0.20 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.16 |
0.14 |
0.14 |
0.14 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.15 |
0.13 |
0.14 |
0.13 |
mean |
0.17 |
0.14 |
0.15 |
0.16 |
Pearson Correlation Coeff¶
Data |
Pearson Correlation Coeff |
|||
---|---|---|---|---|
CNN14 |
w2v2-b |
hubert-b |
axlstm |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.26 |
0.45 |
0.43 |
0.14 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.26 |
0.49 |
0.55 |
0.27 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.08 |
0.25 |
0.30 |
0.07 |
mean |
0.20 |
0.39 |
0.43 |
0.16 |
Visualization¶
CNN14 |
w2v2-b |
hubert-b |
axlstm |
---|---|---|---|