Robustness small changes

Overall scores

CNN14

w2v2-b

hubert-b

axlstm

Overall Score

80.0% passed tests (16 passed / 4 failed).

90.0% passed tests (18 passed / 2 failed).

80.0% passed tests (16 passed / 4 failed).

40.0% passed tests (8 passed / 12 failed).

Percentage Unchanged Predictions Additive Tone

Threshold: 0.95

Data

Percent Unchanged Pred Additive Tone

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.87

0.97

0.94

0.81

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.95

0.99

0.98

0.84

mean

0.91

0.98

0.96

0.82

Percentage Unchanged Predictions Append Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Append Zeros

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

0.90

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.93

mean

0.99

1.00

1.00

0.92

Percentage Unchanged Predictions Clip

Threshold: 0.95

Data

Percent Unchanged Pred Clip

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

0.99

0.99

0.94

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.99

mean

1.00

0.99

0.99

0.96

Percentage Unchanged Predictions Crop Beginning

Threshold: 0.95

Data

Percent Unchanged Pred Crop Beginning

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.99

0.97

0.71

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.72

mean

0.99

0.99

0.98

0.71

Percentage Unchanged Predictions Crop End

Threshold: 0.95

Data

Percent Unchanged Pred Crop End

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

0.95

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.98

mean

0.99

1.00

1.00

0.96

Percentage Unchanged Predictions Gain

Threshold: 0.95

Data

Percent Unchanged Pred Gain

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.97

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.99

1.00

1.00

1.00

mean

0.98

1.00

1.00

1.00

Percentage Unchanged Predictions Highpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Highpass Filter

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.99

0.99

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

mean

0.99

0.99

0.99

1.00

Percentage Unchanged Predictions Lowpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Lowpass Filter

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.98

1.00

1.00

0.98

mean

0.99

1.00

1.00

0.99

Percentage Unchanged Predictions Prepend Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Prepend Zeros

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.98

0.95

0.67

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

0.99

0.71

mean

0.99

0.99

0.97

0.69

Percentage Unchanged Predictions White Noise

Threshold: 0.95

Data

Percent Unchanged Pred White Noise

CNN14

w2v2-b

hubert-b

axlstm

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.71

0.91

0.90

0.75

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.80

0.80

0.84

0.62

mean

0.76

0.85

0.87

0.69

Visualization

CNN14

w2v2-b

hubert-b

axlstm

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard56.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard57.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard58.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard59.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard86.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard87.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard88.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard89.png