Robustness low quality phone¶

Overall scores¶
	CNN14	w2v2-b	hubert-b	axlstm
Overall Score	25.0% passed tests (1 passed / 3 failed).	100.0% passed tests (4 passed / 0 failed).	25.0% passed tests (1 passed / 3 failed).	0.0% passed tests (0 passed / 4 failed).

Change Ccc Low Quality Phone¶

Threshold: -0.05¶
Data	Change CCC Low Quality Phone
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	-0.00	-0.03	-0.07	-0.05
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	-0.08	-0.03	-0.05	-0.07
mean	-0.04	-0.03	-0.06	-0.06

Percentage Unchanged Predictions Low Quality Phone¶

Threshold: 0.5¶
Data	Percentage Unchanged Predictions Low Quality Phone
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.29	0.58	0.50	0.37
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.33	0.59	0.54	0.39
mean	0.31	0.58	0.52	0.38

Visualization¶

Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

CNN14	w2v2-b	hubert-b	axlstm