Robustness low quality phone¶

Overall scores¶
	w2v2-L	hubert-L	wavlm	data2vec
Overall Score	75.0% passed tests (3 passed / 1 failed).	50.0% passed tests (2 passed / 2 failed).	75.0% passed tests (3 passed / 1 failed).	50.0% passed tests (2 passed / 2 failed).

Change Ccc Low Quality Phone¶

Threshold: -0.05¶
Data	Change CCC Low Quality Phone
Data	w2v2-L	hubert-L	wavlm	data2vec
iemocap-2.3.0-emotion.dimensions.test.gold_standard	-0.02	-0.05	-0.05	-0.05
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	-0.05	-0.05	0.01	-0.05
mean	-0.04	-0.05	-0.02	-0.05

Percentage Unchanged Predictions Low Quality Phone¶

Threshold: 0.5¶
Data	Percentage Unchanged Predictions Low Quality Phone
Data	w2v2-L	hubert-L	wavlm	data2vec
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.64	0.62	0.58	0.64
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.53	0.68	0.68	0.65
mean	0.58	0.65	0.63	0.65

Visualization¶

Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L	hubert-L	wavlm	data2vec