Robustness simulated recording condition

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

0.0% passed tests (0 passed / 6 failed).

0.0% passed tests (0 passed / 6 failed).

0.0% passed tests (0 passed / 6 failed).

0.0% passed tests (0 passed / 6 failed).

Percentage Unchanged Predictions Simulated Position

Threshold: 0.8

Data

Percentage Unchanged Predictions Simulated Position

CNN14

w2v2-b

hubert-b

axlstm

emovo-1.2.1-emotion.test

0.60

0.59

0.64

0.57

imda-nsc-read-speech-balanced-2.6.0-headset

0.75

0.67

0.79

0.68

timit-1.4.1-files

0.58

0.75

0.80

0.60

mean

0.64

0.67

0.74

0.62

Percentage Unchanged Predictions Simulated Room

Threshold: 0.8

Data

Percentage Unchanged Predictions Simulated Room

CNN14

w2v2-b

hubert-b

axlstm

emovo-1.2.1-emotion.test

0.49

0.49

0.60

0.43

imda-nsc-read-speech-balanced-2.6.0-headset

0.53

0.73

0.74

0.41

timit-1.4.1-files

0.41

0.66

0.72

0.42

mean

0.48

0.63

0.69

0.42

Visualization Simulated Position

Difference of predictions for audio with a baseline simulated position and audio with a different simulated position. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test11.png
../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test12.png
../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test13.png
../../../_images/visualization-simulated-position_emovo-1.2.1-emotion.test14.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset11.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset12.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset13.png
../../../_images/visualization-simulated-position_imda-nsc-read-speech-balanced-2.6.0-headset14.png
../../../_images/visualization-simulated-position_timit-1.4.1-files11.png
../../../_images/visualization-simulated-position_timit-1.4.1-files12.png
../../../_images/visualization-simulated-position_timit-1.4.1-files13.png
../../../_images/visualization-simulated-position_timit-1.4.1-files14.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test11.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test12.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test13.png
../../../_images/visualization-simulated-room_emovo-1.2.1-emotion.test14.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset11.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset12.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset13.png
../../../_images/visualization-simulated-room_imda-nsc-read-speech-balanced-2.6.0-headset14.png
../../../_images/visualization-simulated-room_timit-1.4.1-files11.png
../../../_images/visualization-simulated-room_timit-1.4.1-files12.png
../../../_images/visualization-simulated-room_timit-1.4.1-files13.png
../../../_images/visualization-simulated-room_timit-1.4.1-files14.png