Robustness simulated recording condition¶

Overall scores¶
	w2v2-L	hubert-L	wavlm	data2vec
Overall Score	16.7% passed tests (1 passed / 5 failed).	16.7% passed tests (1 passed / 5 failed).	50.0% passed tests (3 passed / 3 failed).	16.7% passed tests (1 passed / 5 failed).

Percentage Unchanged Predictions Simulated Position¶

Threshold: 0.8¶
Data	Percentage Unchanged Predictions Simulated Position
Data	w2v2-L	hubert-L	wavlm	data2vec
emovo-1.2.1-emotion.test	0.60	0.75	0.83	0.59
imda-nsc-read-speech-balanced-2.6.0-headset	0.78	0.79	0.90	0.73
timit-1.4.1-files	0.87	0.86	0.94	0.83
mean	0.75	0.80	0.89	0.72

Percentage Unchanged Predictions Simulated Room¶

Threshold: 0.8¶
Data	Percentage Unchanged Predictions Simulated Room
Data	w2v2-L	hubert-L	wavlm	data2vec
emovo-1.2.1-emotion.test	0.42	0.67	0.72	0.45
imda-nsc-read-speech-balanced-2.6.0-headset	0.56	0.80	0.61	0.53
timit-1.4.1-files	0.68	0.77	0.54	0.65
mean	0.55	0.75	0.62	0.54

Visualization Simulated Position¶

Difference of predictions for audio with a baseline simulated position and audio with a different simulated position. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L	hubert-L	wavlm	data2vec