Robustness low quality phone

Overall scores

w2v2-L

hubert-L

wavlm

data2vec

Overall Score

25.0% passed tests (1 passed / 3 failed).

75.0% passed tests (3 passed / 1 failed).

100.0% passed tests (4 passed / 0 failed).

75.0% passed tests (3 passed / 1 failed).

Change Ccc Low Quality Phone

Threshold: -0.05

Data

Change CCC Low Quality Phone

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.06

-0.05

-0.01

-0.02

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.03

-0.01

-0.01

-0.01

mean

-0.04

-0.03

-0.01

-0.01

Percentage Unchanged Predictions Low Quality Phone

Threshold: 0.5

Data

Percentage Unchanged Predictions Low Quality Phone

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.39

0.75

0.82

0.52

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.39

0.74

0.76

0.42

mean

0.39

0.74

0.79

0.47

Visualization

Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L

hubert-L

wavlm

data2vec

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard68.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard82.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard83.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard84.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard106.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard126.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard127.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard128.png