Robustness background noise

Overall scores

w2v2-L

hubert-L

wavlm

data2vec

Overall Score

16.7% passed tests (4 passed / 20 failed).

37.5% passed tests (9 passed / 15 failed).

33.3% passed tests (8 passed / 16 failed).

37.5% passed tests (9 passed / 15 failed).

Change Ccc Babble Noise

Threshold: -0.05

Data

Change CCC Babble Noise

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.06

-0.03

-0.02

-0.04

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.07

-0.02

0.01

-0.02

mean

-0.07

-0.03

-0.01

-0.03

Change Ccc Coughing

Threshold: -0.05

Data

Change CCC Coughing

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.14

-0.11

-0.08

-0.05

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.12

-0.10

-0.12

-0.06

mean

-0.13

-0.11

-0.10

-0.06

Change Ccc Environmental Noise

Threshold: -0.05

Data

Change CCC Environmental Noise

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.05

-0.03

-0.02

-0.03

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.03

-0.01

0.01

-0.01

mean

-0.04

-0.02

-0.01

-0.02

Change Ccc Music

Threshold: -0.05

Data

Change CCC Music

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.04

-0.02

-0.03

-0.02

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.03

-0.01

0.01

-0.01

mean

-0.04

-0.01

-0.01

-0.01

Change Ccc Sneezing

Threshold: -0.05

Data

Change CCC Sneezing

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.10

-0.07

-0.05

-0.04

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.10

-0.07

-0.10

-0.06

mean

-0.10

-0.07

-0.08

-0.05

Change Ccc White Noise

Threshold: -0.05

Data

Change CCC White Noise

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.06

-0.04

-0.04

-0.03

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.03

-0.02

0.02

-0.02

mean

-0.04

-0.03

-0.01

-0.03

Percentage Unchanged Predictions Babble Noise

Threshold: 0.9

Data

Percentage Unchanged Predictions Babble Noise

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.65

0.87

0.81

0.76

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.55

0.78

0.71

0.76

mean

0.60

0.82

0.76

0.76

Percentage Unchanged Predictions Coughing

Threshold: 0.9

Data

Percentage Unchanged Predictions Coughing

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.43

0.48

0.55

0.62

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.41

0.43

0.37

0.59

mean

0.42

0.45

0.46

0.60

Percentage Unchanged Predictions Environmental Noise

Threshold: 0.9

Data

Percentage Unchanged Predictions Environmental Noise

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.75

0.86

0.79

0.80

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.70

0.86

0.80

0.85

mean

0.72

0.86

0.80

0.82

Percentage Unchanged Predictions Music

Threshold: 0.9

Data

Percentage Unchanged Predictions Music

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.79

0.91

0.80

0.82

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.71

0.88

0.77

0.85

mean

0.75

0.90

0.79

0.83

Percentage Unchanged Predictions Sneezing

Threshold: 0.9

Data

Percentage Unchanged Predictions Sneezing

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.51

0.55

0.58

0.55

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.41

0.49

0.32

0.56

mean

0.46

0.52

0.45

0.56

Percentage Unchanged Predictions White Noise

Threshold: 0.9

Data

Percentage Unchanged Predictions White Noise

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.66

0.78

0.48

0.67

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.63

0.72

0.73

0.74

mean

0.65

0.75

0.60

0.71

Visualization Babble Noise

Difference of predictions for clean audio and audio with added babble noise. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L

hubert-L

wavlm

data2vec

../../../_images/visualization-babble-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard26.png
../../../_images/visualization-babble-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard30.png
../../../_images/visualization-babble-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard31.png
../../../_images/visualization-babble-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard32.png
../../../_images/visualization-babble-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard26.png
../../../_images/visualization-babble-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard30.png
../../../_images/visualization-babble-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard31.png
../../../_images/visualization-babble-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard32.png
../../../_images/visualization-coughing_iemocap-2.3.0-emotion.dimensions.test.gold_standard26.png
../../../_images/visualization-coughing_iemocap-2.3.0-emotion.dimensions.test.gold_standard30.png
../../../_images/visualization-coughing_iemocap-2.3.0-emotion.dimensions.test.gold_standard31.png
../../../_images/visualization-coughing_iemocap-2.3.0-emotion.dimensions.test.gold_standard32.png
../../../_images/visualization-coughing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard26.png
../../../_images/visualization-coughing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard30.png
../../../_images/visualization-coughing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard31.png
../../../_images/visualization-coughing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard32.png
../../../_images/visualization-environmental-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard26.png
../../../_images/visualization-environmental-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard30.png
../../../_images/visualization-environmental-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard31.png
../../../_images/visualization-environmental-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard32.png
../../../_images/visualization-environmental-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard26.png
../../../_images/visualization-environmental-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard30.png
../../../_images/visualization-environmental-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard31.png
../../../_images/visualization-environmental-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard32.png
../../../_images/visualization-music_iemocap-2.3.0-emotion.dimensions.test.gold_standard26.png
../../../_images/visualization-music_iemocap-2.3.0-emotion.dimensions.test.gold_standard30.png
../../../_images/visualization-music_iemocap-2.3.0-emotion.dimensions.test.gold_standard31.png
../../../_images/visualization-music_iemocap-2.3.0-emotion.dimensions.test.gold_standard32.png
../../../_images/visualization-music_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard26.png
../../../_images/visualization-music_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard30.png
../../../_images/visualization-music_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard31.png
../../../_images/visualization-music_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard32.png
../../../_images/visualization-sneezing_iemocap-2.3.0-emotion.dimensions.test.gold_standard26.png
../../../_images/visualization-sneezing_iemocap-2.3.0-emotion.dimensions.test.gold_standard30.png
../../../_images/visualization-sneezing_iemocap-2.3.0-emotion.dimensions.test.gold_standard31.png
../../../_images/visualization-sneezing_iemocap-2.3.0-emotion.dimensions.test.gold_standard32.png
../../../_images/visualization-sneezing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard26.png
../../../_images/visualization-sneezing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard30.png
../../../_images/visualization-sneezing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard31.png
../../../_images/visualization-sneezing_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard32.png
../../../_images/visualization-white-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard26.png
../../../_images/visualization-white-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard30.png
../../../_images/visualization-white-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard31.png
../../../_images/visualization-white-noise_iemocap-2.3.0-emotion.dimensions.test.gold_standard32.png
../../../_images/visualization-white-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard26.png
../../../_images/visualization-white-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard30.png
../../../_images/visualization-white-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard31.png
../../../_images/visualization-white-noise_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard32.png