Robustness spectral tilt¶
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
---|---|---|---|---|---|
Overall Score |
75.0% passed tests (6 passed / 2 failed). |
62.5% passed tests (5 passed / 3 failed). |
87.5% passed tests (7 passed / 1 failed). |
75.0% passed tests (6 passed / 2 failed). |
75.0% passed tests (6 passed / 2 failed). |
Change Ccc Downward Tilt¶
Data |
Change CCC Downward Tilt |
||||
---|---|---|---|---|---|
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
-0.01 |
0.00 |
-0.00 |
-0.00 |
-0.00 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
-0.00 |
0.00 |
-0.01 |
-0.00 |
-0.01 |
mean |
-0.01 |
0.00 |
-0.01 |
0.00 |
-0.01 |
Change Ccc Upward Tilt¶
Data |
Change CCC Upward Tilt |
||||
---|---|---|---|---|---|
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
-0.02 |
-0.06 |
-0.03 |
-0.03 |
-0.01 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
-0.00 |
-0.03 |
-0.02 |
-0.02 |
-0.03 |
mean |
-0.01 |
-0.04 |
-0.03 |
-0.03 |
-0.02 |
Percentage Unchanged Predictions Downward Tilt¶
Data |
Percentage Unchanged Predictions Downward Tilt |
||||
---|---|---|---|---|---|
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.97 |
0.98 |
1.00 |
0.98 |
0.99 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.99 |
0.97 |
0.97 |
0.95 |
0.96 |
mean |
0.98 |
0.97 |
0.98 |
0.96 |
0.97 |
Percentage Unchanged Predictions Upward Tilt¶
Data |
Percentage Unchanged Predictions Upward Tilt |
||||
---|---|---|---|---|---|
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
|
iemocap-2.3.0-emotion.dimensions.test.gold_standard |
0.61 |
0.42 |
0.62 |
0.63 |
0.72 |
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard |
0.78 |
0.75 |
0.81 |
0.73 |
0.70 |
mean |
0.70 |
0.58 |
0.72 |
0.68 |
0.71 |
Visualization Downward Tilt¶
Difference of predictions for original audio and audio with a downward spectral tilt. The allowed prediction difference \(\delta < 0.05\) is highlighted in green in the upper plot. The lower plot shows the distributions of the two predictions.
w2v2-b |
w2v2-L |
w2v2-L-robust |
w2v2-L-xls-r |
w2v2-L-vox |
---|---|---|---|---|