Robustness simulated recording condition¶

Overall scores¶
	CNN14	w2v2-b	hubert-b	axlstm
Overall Score	0.0% passed tests (0 passed / 6 failed).	0.0% passed tests (0 passed / 6 failed).	0.0% passed tests (0 passed / 6 failed).	0.0% passed tests (0 passed / 6 failed).

Percentage Unchanged Predictions Simulated Position¶

Threshold: 0.8¶
Data	Percentage Unchanged Predictions Simulated Position
Data	CNN14	w2v2-b	hubert-b	axlstm
emovo-1.2.1-emotion.test	0.60	0.59	0.64	0.57
imda-nsc-read-speech-balanced-2.6.0-headset	0.75	0.67	0.79	0.68
timit-1.4.1-files	0.58	0.75	0.80	0.60
mean	0.64	0.67	0.74	0.62

Percentage Unchanged Predictions Simulated Room¶

Threshold: 0.8¶
Data	Percentage Unchanged Predictions Simulated Room
Data	CNN14	w2v2-b	hubert-b	axlstm
emovo-1.2.1-emotion.test	0.49	0.49	0.60	0.43
imda-nsc-read-speech-balanced-2.6.0-headset	0.53	0.73	0.74	0.41
timit-1.4.1-files	0.41	0.66	0.72	0.42
mean	0.48	0.63	0.69	0.42

Visualization Simulated Position¶

Difference of predictions for audio with a baseline simulated position and audio with a different simulated position. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

CNN14	w2v2-b	hubert-b	axlstm