Robustness small changes

Overall scores

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

Overall Score

75.0% passed tests (15 passed / 5 failed).

80.0% passed tests (16 passed / 4 failed).

90.0% passed tests (18 passed / 2 failed).

60.0% passed tests (12 passed / 8 failed).

80.0% passed tests (16 passed / 4 failed).

Percentage Unchanged Predictions Additive Tone

Threshold: 0.95

Data

Percent Unchanged Pred Additive Tone

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.92

0.99

0.99

0.95

0.99

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.96

0.96

0.99

0.94

0.99

mean

0.94

0.97

0.99

0.94

0.99

Percentage Unchanged Predictions Append Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Append Zeros

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

0.99

1.00

1.00

1.00

1.00

Percentage Unchanged Predictions Clip

Threshold: 0.95

Data

Percent Unchanged Pred Clip

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.98

1.00

1.00

0.99

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

0.99

1.00

1.00

0.99

1.00

Percentage Unchanged Predictions Crop Beginning

Threshold: 0.95

Data

Percent Unchanged Pred Crop Beginning

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.92

0.95

0.96

0.91

0.95

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.97

0.97

0.97

0.91

0.98

mean

0.95

0.96

0.96

0.91

0.96

Percentage Unchanged Predictions Crop End

Threshold: 0.95

Data

Percent Unchanged Pred Crop End

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

0.99

0.99

0.99

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

0.99

1.00

0.99

0.99

0.99

Percentage Unchanged Predictions Gain

Threshold: 0.95

Data

Percent Unchanged Pred Gain

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

1.00

1.00

1.00

1.00

1.00

Percentage Unchanged Predictions Highpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Highpass Filter

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

0.99

0.99

0.97

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.99

1.00

1.00

0.98

0.96

mean

0.99

1.00

0.99

0.98

0.96

Percentage Unchanged Predictions Lowpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Lowpass Filter

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.99

0.99

mean

1.00

1.00

1.00

0.99

0.99

Percentage Unchanged Predictions Prepend Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Prepend Zeros

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.92

0.94

0.91

0.87

0.94

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.97

0.97

0.96

0.88

0.97

mean

0.95

0.95

0.94

0.88

0.95

Percentage Unchanged Predictions White Noise

Threshold: 0.95

Data

Percent Unchanged Pred White Noise

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.83

0.92

0.82

0.76

0.83

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.83

0.88

0.97

0.68

0.84

mean

0.83

0.90

0.90

0.72

0.83

Visualization

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard101.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard116.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard117.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard118.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard119.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard153.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard176.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard177.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard178.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard179.png