Correctness speaker ranking¶

Overall scores¶
	w2v2-b	w2v2-L	w2v2-L-robust	w2v2-L-xls-r	w2v2-L-vox
Overall Score	0.0% passed tests (0 passed / 2 failed).	0.0% passed tests (0 passed / 2 failed).	50.0% passed tests (1 passed / 1 failed).	0.0% passed tests (0 passed / 2 failed).	0.0% passed tests (0 passed / 2 failed).

Spearmans Rho¶

Threshold: 0.7¶
Data	Spearmans Rho
Data	w2v2-b	w2v2-L	w2v2-L-robust	w2v2-L-xls-r	w2v2-L-vox
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.70	0.61	0.87	0.42	0.58
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.22	0.37	0.28	0.34	0.15
mean	0.46	0.49	0.57	0.38	0.36

Visualization¶

The plots visualize the precision of predicting speakers to be in the Top 25% or Bottom 25% of all speakers. Green dots indicate correctly classified speakers, red false positive speakers, whereby red squares indicate confusions between Top 25% and Bottom 25% speakers. The remaining grey data points are samples outside the range of interest. They contain false negatives that should have been predicted in the Top 25% or Bottom 25% of speakers, but were not. True negatives are those speakers that are not part of the Top 25% or Bottom 25%, and were predicted as such.

w2v2-b	w2v2-L	w2v2-L-robust	w2v2-L-xls-r	w2v2-L-vox