Robustness low quality phone¶

Overall scores¶
	w2v2-L	hubert-L	wavlm	data2vec
Overall Score	25.0% passed tests (1 passed / 3 failed).	75.0% passed tests (3 passed / 1 failed).	100.0% passed tests (4 passed / 0 failed).	75.0% passed tests (3 passed / 1 failed).

Change Ccc Low Quality Phone¶

Threshold: -0.05¶
Data	Change CCC Low Quality Phone
Data	w2v2-L	hubert-L	wavlm	data2vec
iemocap-2.3.0-emotion.dimensions.test.gold_standard	-0.06	-0.05	-0.01	-0.02
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	-0.03	-0.01	-0.01	-0.01
mean	-0.04	-0.03	-0.01	-0.01

Percentage Unchanged Predictions Low Quality Phone¶

Threshold: 0.5¶
Data	Percentage Unchanged Predictions Low Quality Phone
Data	w2v2-L	hubert-L	wavlm	data2vec
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.39	0.75	0.82	0.52
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.39	0.74	0.76	0.42
mean	0.39	0.74	0.79	0.47

Visualization¶

Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L	hubert-L	wavlm	data2vec