Correctness speaker ranking¶
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
---|---|---|---|---|---|
Overall Score |
0.0% passed tests (0 passed / 2 failed). |
0.0% passed tests (0 passed / 2 failed). |
50.0% passed tests (1 passed / 1 failed). |
0.0% passed tests (0 passed / 2 failed). |
0.0% passed tests (0 passed / 2 failed). |
Spearmans Rho¶
Data |
Spearmans Rho |
||||
---|---|---|---|---|---|
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.70 |
0.61 |
0.87 |
0.42 |
0.58 |
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard |
0.22 |
0.37 |
0.28 |
0.34 |
0.15 |
mean |
0.46 |
0.49 |
0.57 |
0.38 |
0.36 |
Visualization¶
The plots visualize the precision of predicting speakers to be in the Top 25% or Bottom 25% of all speakers. Green dots indicate correctly classified speakers, red false positive speakers, whereby red squares indicate confusions between Top 25% and Bottom 25% speakers. The remaining grey data points are samples outside the range of interest. They contain false negatives that should have been predicted in the Top 25% or Bottom 25% of speakers, but were not. True negatives are those speakers that are not part of the Top 25% or Bottom 25%, and were predicted as such.
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
---|---|---|---|---|