Robustness small changes

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

75.0% passed tests (15 passed / 5 failed).

90.0% passed tests (18 passed / 2 failed).

80.0% passed tests (16 passed / 4 failed).

40.0% passed tests (8 passed / 12 failed).

Percentage Unchanged Predictions Additive Tone

Threshold: 0.95

Data

Percent Unchanged Pred Additive Tone

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.95

0.97

0.93

0.75

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.92

0.98

0.98

0.80

mean

0.94

0.97

0.96

0.78

Percentage Unchanged Predictions Append Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Append Zeros

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

0.87

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.92

mean

0.99

1.00

1.00

0.90

Percentage Unchanged Predictions Clip

Threshold: 0.95

Data

Percent Unchanged Pred Clip

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

0.99

0.98

0.93

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.98

mean

1.00

0.99

0.99

0.96

Percentage Unchanged Predictions Crop Beginning

Threshold: 0.95

Data

Percent Unchanged Pred Crop Beginning

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.97

0.96

0.64

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

0.99

0.68

mean

0.99

0.98

0.97

0.66

Percentage Unchanged Predictions Crop End

Threshold: 0.95

Data

Percent Unchanged Pred Crop End

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

0.94

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.97

mean

0.99

1.00

1.00

0.95

Percentage Unchanged Predictions Gain

Threshold: 0.95

Data

Percent Unchanged Pred Gain

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.98

1.00

1.00

1.00

mean

0.98

1.00

1.00

1.00

Percentage Unchanged Predictions Highpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Highpass Filter

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.99

0.99

0.99

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.99

1.00

1.00

1.00

mean

0.99

0.99

0.99

0.99

Percentage Unchanged Predictions Lowpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Lowpass Filter

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.94

1.00

1.00

0.97

mean

0.97

1.00

1.00

0.98

Percentage Unchanged Predictions Prepend Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Prepend Zeros

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.96

0.94

0.61

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

0.99

0.67

mean

0.99

0.98

0.96

0.64

Percentage Unchanged Predictions White Noise

Threshold: 0.95

Data

Percent Unchanged Pred White Noise

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.72

0.93

0.88

0.62

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.73

0.83

0.86

0.69

mean

0.72

0.88

0.87

0.66

Visualization

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard12.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard13.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard14.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard15.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard20.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard21.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard22.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard23.png