Robustness simulated recording condition

Overall scores

w2v2-L

hubert-L

wavlm

data2vec

Overall Score

16.7% passed tests (1 passed / 5 failed).

16.7% passed tests (1 passed / 5 failed).

50.0% passed tests (3 passed / 3 failed).

16.7% passed tests (1 passed / 5 failed).

Percentage Unchanged Predictions Simulated Position

Threshold: 0.8

Data

Percentage Unchanged Predictions Simulated Position

w2v2-L

hubert-L

wavlm

data2vec

emovo-1.2.1-emotion.test

0.60

0.75

0.83

0.59

imda-nsc-read-speech-balanced-2.6.0-headset

0.78

0.79

0.90

0.73

timit-1.4.1-files

0.87

0.86

0.94

0.83

mean

0.75

0.80

0.89

0.72

Percentage Unchanged Predictions Simulated Room

Threshold: 0.8

Data

Percentage Unchanged Predictions Simulated Room

w2v2-L

hubert-L

wavlm

data2vec

emovo-1.2.1-emotion.test

0.42

0.67

0.72

0.45

imda-nsc-read-speech-balanced-2.6.0-headset

0.56

0.80

0.61

0.53

timit-1.4.1-files

0.68

0.77

0.54

0.65

mean

0.55

0.75

0.62

0.54

Visualization Simulated Position

Difference of predictions for audio with a baseline simulated position and audio with a different simulated position. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L

hubert-L

wavlm

data2vec

../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test15.png
../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test19.png
../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test20.png
../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test21.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset15.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset19.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset20.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset21.png
../../../_images/visualization-simulated-position_timit-1.4.1-files15.png
../../../_images/visualization-simulated-position_timit-1.4.1-files19.png
../../../_images/visualization-simulated-position_timit-1.4.1-files20.png
../../../_images/visualization-simulated-position_timit-1.4.1-files21.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test15.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test19.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test20.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test21.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset15.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset19.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset20.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset21.png
../../../_images/visualization-simulated-room_timit-1.4.1-files15.png
../../../_images/visualization-simulated-room_timit-1.4.1-files19.png
../../../_images/visualization-simulated-room_timit-1.4.1-files20.png
../../../_images/visualization-simulated-room_timit-1.4.1-files21.png