Robustness low quality phone

Overall scores

w2v2-L

hubert-L

wavlm

data2vec

Overall Score

75.0% passed tests (3 passed / 1 failed).

50.0% passed tests (2 passed / 2 failed).

75.0% passed tests (3 passed / 1 failed).

50.0% passed tests (2 passed / 2 failed).

Change Ccc Low Quality Phone

Threshold: -0.05

Data

Change CCC Low Quality Phone

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

-0.02

-0.05

-0.05

-0.05

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

-0.05

-0.05

0.01

-0.05

mean

-0.04

-0.05

-0.02

-0.05

Percentage Unchanged Predictions Low Quality Phone

Threshold: 0.5

Data

Percentage Unchanged Predictions Low Quality Phone

w2v2-L

hubert-L

wavlm

data2vec

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.64

0.62

0.58

0.64

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.53

0.68

0.68

0.65

mean

0.58

0.65

0.63

0.65

Visualization

Difference of predictions for original audio and low quality phone audio. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.

w2v2-L

hubert-L

wavlm

data2vec

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard112.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard126.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard127.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard128.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard172.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard192.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard193.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard194.png