Robustness small changes

Overall scores

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

Overall Score

90.0% passed tests (18 passed / 2 failed).

90.0% passed tests (18 passed / 2 failed).

95.0% passed tests (19 passed / 1 failed).

90.0% passed tests (18 passed / 2 failed).

90.0% passed tests (18 passed / 2 failed).

Percentage Unchanged Predictions Additive Tone

Threshold: 0.95

Data

Percent Unchanged Pred Additive Tone

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.97

1.00

1.00

0.99

0.98

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.99

1.00

1.00

1.00

1.00

mean

0.98

1.00

1.00

0.99

0.99

Percentage Unchanged Predictions Append Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Append Zeros

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

1.00

1.00

1.00

1.00

1.00

Percentage Unchanged Predictions Clip

Threshold: 0.95

Data

Percent Unchanged Pred Clip

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.99

1.00

0.99

0.99

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

0.99

0.99

1.00

0.99

0.99

Percentage Unchanged Predictions Crop Beginning

Threshold: 0.95

Data

Percent Unchanged Pred Crop Beginning

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

0.99

0.99

0.99

0.99

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

0.99

0.99

0.99

0.99

0.99

Percentage Unchanged Predictions Crop End

Threshold: 0.95

Data

Percent Unchanged Pred Crop End

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

1.00

1.00

1.00

1.00

1.00

Percentage Unchanged Predictions Gain

Threshold: 0.95

Data

Percent Unchanged Pred Gain

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

1.00

1.00

1.00

1.00

1.00

Percentage Unchanged Predictions Highpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Highpass Filter

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.99

1.00

1.00

1.00

0.99

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

0.99

1.00

1.00

1.00

0.99

Percentage Unchanged Predictions Lowpass Filter

Threshold: 0.95

Data

Percent Unchanged Pred Lowpass Filter

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

1.00

1.00

1.00

1.00

1.00

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

1.00

1.00

mean

1.00

1.00

1.00

1.00

1.00

Percentage Unchanged Predictions Prepend Zeros

Threshold: 0.95

Data

Percent Unchanged Pred Prepend Zeros

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.98

0.99

0.98

0.98

0.98

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

1.00

1.00

1.00

0.99

1.00

mean

0.99

0.99

0.99

0.98

0.99

Percentage Unchanged Predictions White Noise

Threshold: 0.95

Data

Percent Unchanged Pred White Noise

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

iemocap-2.3.0-emotion.dimensions.test.gold_standard

0.91

0.91

0.99

0.87

0.77

msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard

0.80

0.72

0.90

0.74

0.76

mean

0.85

0.81

0.95

0.80

0.77

Visualization

w2v2-b

w2v2-L

w2v2-L-robust

w2v2-L-xls-r

w2v2-L-vox

../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard57.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard72.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard73.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard74.png
../../../_images/visualization_iemocap-2.3.0-emotion.dimensions.test.gold_standard75.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard87.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard110.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard111.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard112.png
../../../_images/visualization_msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard113.png