Fairness linguistic sentiment

Overall scores

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

Overall Score

88.5% passed tests (85 passed / 11 failed).

85.4% passed tests (82 passed / 14 failed).

85.4% passed tests (82 passed / 14 failed).

86.5% passed tests (83 passed / 13 failed).

Class Proportion Shift Difference Negative Sentiment

Shift in class proportions for negative sentiment for specific language - Average of the shift in class proportions for negative sentiment for all languages. The full expression leading to the test score is displayed in parentheses.

Threshold: 0.075

Data

anger

happiness

neutral

sadness

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

checklist-synth-1.0.0-words-in-context-de

-0.01 (0.00 - 0.01)

-0.01 (0.00 - 0.01)

-0.01 (0.00 - 0.01)

-0.01 (-0.00 - 0.01)

0.02 (-0.00 - -0.03)

0.05 (-0.01 - -0.05)

0.07 (-0.00 - -0.07)

0.05 (-0.00 - -0.05)

0.02 (-0.01 - -0.03)

-0.00 (-0.00 - 0.00)

0.01 (0.01 - -0.00)

-0.04 (-0.04 - -0.00)

-0.03 (0.01 - 0.04)

-0.03 (0.01 - 0.04)

-0.06 (-0.00 - 0.06)

0.00 (0.04 - 0.04)

checklist-synth-1.0.0-words-in-context-en

0.01 (0.02 - 0.01)

0.05 (0.06 - 0.01)

0.02 (0.03 - 0.01)

0.03 (0.04 - 0.01)

-0.09 (-0.12 - -0.03)

-0.26 (-0.31 - -0.05)

-0.29 (-0.36 - -0.07)

-0.20 (-0.25 - -0.05)

-0.08 (-0.10 - -0.03)

-0.04 (-0.04 - 0.00)

-0.07 (-0.07 - -0.00)

-0.04 (-0.05 - -0.00)

0.16 (0.20 - 0.04)

0.25 (0.29 - 0.04)

0.34 (0.40 - 0.06)

0.22 (0.26 - 0.04)

checklist-synth-1.0.0-words-in-context-es

-0.01 (0.00 - 0.01)

-0.01 (0.00 - 0.01)

0.00 (0.01 - 0.01)

0.00 (0.01 - 0.01)

0.03 (0.00 - -0.03)

0.04 (-0.01 - -0.05)

0.06 (-0.01 - -0.07)

0.04 (-0.00 - -0.05)

0.00 (-0.02 - -0.03)

0.01 (0.01 - 0.00)

-0.00 (-0.00 - -0.00)

-0.02 (-0.02 - -0.00)

-0.02 (0.02 - 0.04)

-0.04 (0.00 - 0.04)

-0.06 (0.00 - 0.06)

-0.03 (0.02 - 0.04)

checklist-synth-1.0.0-words-in-context-fr

-0.01 (0.00 - 0.01)

-0.01 (-0.00 - 0.01)

-0.01 (0.00 - 0.01)

-0.01 (-0.01 - 0.01)

0.02 (-0.01 - -0.03)

0.05 (-0.00 - -0.05)

0.05 (-0.02 - -0.07)

0.04 (-0.00 - -0.05)

0.02 (-0.00 - -0.03)

0.00 (0.00 - 0.00)

0.02 (0.02 - -0.00)

0.01 (0.00 - -0.00)

-0.03 (0.01 - 0.04)

-0.04 (0.00 - 0.04)

-0.06 (-0.00 - 0.06)

-0.04 (0.00 - 0.04)

checklist-synth-1.0.0-words-in-context-it

0.03 (0.04 - 0.01)

0.01 (0.02 - 0.01)

0.02 (0.03 - 0.01)

0.00 (0.01 - 0.01)

-0.01 (-0.03 - -0.03)

0.03 (-0.02 - -0.05)

0.02 (-0.05 - -0.07)

0.05 (0.01 - -0.05)

0.01 (-0.02 - -0.03)

0.00 (0.00 - 0.00)

0.00 (0.00 - -0.00)

-0.02 (-0.02 - -0.00)

-0.03 (0.01 - 0.04)

-0.04 (0.00 - 0.04)

-0.05 (0.01 - 0.06)

-0.03 (0.01 - 0.04)

checklist-synth-1.0.0-words-in-context-ja

0.01 (0.02 - 0.01)

-0.03 (-0.02 - 0.01)

0.01 (0.02 - 0.01)

-0.01 (-0.00 - 0.01)

-0.01 (-0.04 - -0.03)

0.03 (-0.02 - -0.05)

0.04 (-0.03 - -0.07)

0.01 (-0.04 - -0.05)

0.04 (0.01 - -0.03)

0.02 (0.02 - 0.00)

0.01 (0.00 - -0.00)

0.04 (0.04 - -0.00)

-0.04 (0.00 - 0.04)

-0.03 (0.01 - 0.04)

-0.06 (0.00 - 0.06)

-0.04 (0.00 - 0.04)

checklist-synth-1.0.0-words-in-context-pt

-0.01 (-0.00 - 0.01)

-0.01 (0.00 - 0.01)

-0.00 (0.01 - 0.01)

-0.01 (-0.00 - 0.01)

0.02 (-0.01 - -0.03)

0.05 (-0.01 - -0.05)

0.05 (-0.02 - -0.07)

0.04 (-0.01 - -0.05)

-0.04 (-0.06 - -0.03)

-0.02 (-0.02 - 0.00)

-0.05 (-0.05 - -0.00)

0.01 (0.01 - -0.00)

0.03 (0.07 - 0.04)

-0.02 (0.02 - 0.04)

0.00 (0.06 - 0.06)

-0.04 (0.00 - 0.04)

checklist-synth-1.0.0-words-in-context-zh

-0.01 (0.00 - 0.01)

0.00 (0.01 - 0.01)

-0.01 (0.00 - 0.01)

0.01 (0.01 - 0.01)

0.03 (0.00 - -0.03)

0.01 (-0.04 - -0.05)

-0.02 (-0.09 - -0.07)

-0.02 (-0.07 - -0.05)

0.02 (-0.00 - -0.03)

0.03 (0.03 - 0.00)

0.09 (0.09 - -0.00)

0.06 (0.06 - -0.00)

-0.04 (-0.00 - 0.04)

-0.04 (-0.00 - 0.04)

-0.06 (0.00 - 0.06)

-0.04 (0.00 - 0.04)

mean

0.00

-0.00

0.00

-0.00

0.00

-0.00

-0.00

0.00

-0.00

0.00

0.00

0.00

-0.00

0.00

-0.00

-0.00

Class Proportion Shift Difference Neutral Sentiment

Shift in class proportions for neutral sentiment for specific language - Average of the shift in class proportions for neutral sentiment for all languages. The full expression leading to the test score is displayed in parentheses.

Threshold: 0.075

Data

anger

happiness

neutral

sadness

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

checklist-synth-1.0.0-words-in-context-de

0.01 (-0.01 - -0.02)

0.01 (0.00 - -0.01)

0.01 (-0.00 - -0.01)

0.01 (0.00 - -0.01)

0.00 (0.00 - -0.00)

0.04 (-0.01 - -0.05)

0.05 (-0.00 - -0.05)

0.01 (-0.00 - -0.02)

0.01 (0.09 - 0.07)

0.01 (0.11 - 0.10)

-0.07 (0.04 - 0.11)

0.03 (0.10 - 0.07)

-0.03 (-0.08 - -0.06)

-0.06 (-0.10 - -0.04)

0.02 (-0.03 - -0.05)

-0.06 (-0.10 - -0.04)

checklist-synth-1.0.0-words-in-context-en

0.00 (-0.01 - -0.02)

-0.04 (-0.05 - -0.01)

-0.01 (-0.02 - -0.01)

-0.02 (-0.03 - -0.01)

-0.16 (-0.16 - -0.00)

-0.33 (-0.38 - -0.05)

-0.26 (-0.32 - -0.05)

-0.26 (-0.28 - -0.02)

0.29 (0.36 - 0.07)

0.54 (0.64 - 0.10)

0.50 (0.61 - 0.11)

0.43 (0.50 - 0.07)

-0.13 (-0.19 - -0.06)

-0.16 (-0.21 - -0.04)

-0.23 (-0.28 - -0.05)

-0.15 (-0.19 - -0.04)

checklist-synth-1.0.0-words-in-context-es

0.02 (-0.00 - -0.02)

0.01 (-0.00 - -0.01)

-0.00 (-0.01 - -0.01)

0.00 (-0.01 - -0.01)

-0.01 (-0.01 - -0.00)

0.04 (-0.01 - -0.05)

0.02 (-0.04 - -0.05)

0.01 (-0.00 - -0.02)

0.04 (0.11 - 0.07)

-0.09 (0.01 - 0.10)

-0.06 (0.05 - 0.11)

-0.04 (0.03 - 0.07)

-0.05 (-0.10 - -0.06)

0.04 (0.00 - -0.04)

0.05 (-0.00 - -0.05)

0.02 (-0.02 - -0.04)

checklist-synth-1.0.0-words-in-context-fr

0.01 (-0.00 - -0.02)

0.01 (-0.00 - -0.01)

0.01 (0.00 - -0.01)

0.01 (0.00 - -0.01)

-0.00 (-0.00 - -0.00)

0.04 (-0.01 - -0.05)

0.03 (-0.02 - -0.05)

0.01 (-0.01 - -0.02)

-0.07 (0.00 - 0.07)

-0.09 (0.01 - 0.10)

-0.09 (0.02 - 0.11)

-0.06 (0.01 - 0.07)

0.06 (0.00 - -0.06)

0.05 (0.00 - -0.04)

0.05 (-0.00 - -0.05)

0.04 (-0.00 - -0.04)

checklist-synth-1.0.0-words-in-context-it

-0.06 (-0.08 - -0.02)

-0.01 (-0.01 - -0.01)

-0.00 (-0.01 - -0.01)

0.01 (-0.00 - -0.01)

0.08 (0.08 - -0.00)

0.07 (0.02 - -0.05)

0.04 (-0.01 - -0.05)

0.04 (0.02 - -0.02)

-0.09 (-0.02 - 0.07)

-0.11 (-0.02 - 0.10)

-0.09 (0.03 - 0.11)

-0.08 (-0.02 - 0.07)

0.07 (0.02 - -0.06)

0.05 (0.01 - -0.04)

0.04 (-0.01 - -0.05)

0.04 (-0.00 - -0.04)

checklist-synth-1.0.0-words-in-context-ja

0.02 (0.00 - -0.02)

0.04 (0.03 - -0.01)

-0.02 (-0.04 - -0.01)

-0.01 (-0.02 - -0.01)

-0.00 (-0.00 - -0.00)

0.02 (-0.03 - -0.05)

0.05 (-0.00 - -0.05)

0.10 (0.09 - -0.02)

-0.06 (0.01 - 0.07)

-0.08 (0.02 - 0.10)

-0.07 (0.04 - 0.11)

-0.14 (-0.08 - 0.07)

0.04 (-0.01 - -0.06)

0.02 (-0.02 - -0.04)

0.04 (-0.01 - -0.05)

0.05 (0.00 - -0.04)

checklist-synth-1.0.0-words-in-context-pt

-0.01 (-0.02 - -0.02)

0.00 (-0.00 - -0.01)

0.01 (0.00 - -0.01)

0.01 (-0.00 - -0.01)

0.07 (0.07 - -0.00)

0.04 (-0.01 - -0.05)

-0.01 (-0.06 - -0.05)

0.00 (-0.01 - -0.02)

-0.04 (0.03 - 0.07)

-0.05 (0.05 - 0.10)

0.02 (0.13 - 0.11)

-0.03 (0.04 - 0.07)

-0.03 (-0.08 - -0.06)

0.01 (-0.03 - -0.04)

-0.02 (-0.07 - -0.05)

0.02 (-0.02 - -0.04)

checklist-synth-1.0.0-words-in-context-zh

0.01 (-0.01 - -0.02)

-0.01 (-0.02 - -0.01)

0.01 (-0.00 - -0.01)

-0.02 (-0.03 - -0.01)

0.02 (0.02 - -0.00)

0.09 (0.04 - -0.05)

0.09 (0.03 - -0.05)

0.09 (0.07 - -0.02)

-0.08 (-0.00 - 0.07)

-0.12 (-0.03 - 0.10)

-0.14 (-0.03 - 0.11)

-0.11 (-0.04 - 0.07)

0.05 (-0.00 - -0.06)

0.05 (0.00 - -0.04)

0.05 (0.00 - -0.05)

0.04 (-0.00 - -0.04)

mean

0.00

0.00

0.00

-0.00

0.00

0.00

0.00

0.00

-0.00

0.00

0.00

-0.00

-0.00

0.00

-0.00

0.00

Class Proportion Shift Difference Positive Sentiment

Shift in class proportions for positive sentiment for specific language - Average of the shift in class proportions for positive sentiment for all languages. The full expression leading to the test score is displayed in parentheses.

Threshold: 0.075

Data

anger

happiness

neutral

sadness

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

checklist-synth-1.0.0-words-in-context-de

0.01 (0.00 - -0.01)

0.01 (0.00 - -0.01)

0.01 (0.00 - -0.01)

0.00 (-0.00 - -0.00)

-0.03 (-0.00 - 0.03)

-0.07 (0.01 - 0.08)

-0.09 (0.00 - 0.10)

-0.05 (0.00 - 0.06)

-0.02 (-0.02 - -0.00)

-0.00 (-0.04 - -0.04)

0.02 (-0.02 - -0.04)

0.03 (-0.00 - -0.03)

0.04 (0.02 - -0.02)

0.06 (0.03 - -0.03)

0.06 (0.02 - -0.04)

0.02 (-0.00 - -0.03)

checklist-synth-1.0.0-words-in-context-en

-0.01 (-0.01 - -0.01)

-0.04 (-0.05 - -0.01)

-0.01 (-0.02 - -0.01)

-0.03 (-0.03 - -0.00)

0.16 (0.19 - 0.03)

0.41 (0.49 - 0.08)

0.41 (0.50 - 0.10)

0.32 (0.38 - 0.06)

-0.04 (-0.04 - -0.00)

-0.18 (-0.22 - -0.04)

-0.13 (-0.17 - -0.04)

-0.13 (-0.15 - -0.03)

-0.12 (-0.13 - -0.02)

-0.19 (-0.22 - -0.03)

-0.27 (-0.31 - -0.04)

-0.17 (-0.19 - -0.03)

checklist-synth-1.0.0-words-in-context-es

0.00 (-0.00 - -0.01)

0.01 (-0.00 - -0.01)

0.00 (-0.01 - -0.01)

-0.00 (-0.00 - -0.00)

-0.03 (0.00 - 0.03)

-0.06 (0.01 - 0.08)

-0.07 (0.03 - 0.10)

-0.05 (0.01 - 0.06)

-0.02 (-0.02 - -0.00)

0.03 (-0.01 - -0.04)

0.03 (-0.02 - -0.04)

0.03 (0.01 - -0.03)

0.04 (0.02 - -0.02)

0.02 (-0.00 - -0.03)

0.04 (0.00 - -0.04)

0.02 (-0.01 - -0.03)

checklist-synth-1.0.0-words-in-context-fr

0.01 (-0.00 - -0.01)

0.01 (0.00 - -0.01)

0.01 (0.00 - -0.01)

0.01 (0.01 - -0.00)

-0.02 (0.01 - 0.03)

-0.07 (0.01 - 0.08)

-0.07 (0.03 - 0.10)

-0.05 (0.01 - 0.06)

0.01 (0.00 - -0.00)

0.04 (-0.00 - -0.04)

0.02 (-0.03 - -0.04)

0.02 (-0.01 - -0.03)

0.01 (-0.01 - -0.02)

0.02 (-0.00 - -0.03)

0.04 (0.00 - -0.04)

0.02 (-0.00 - -0.03)

checklist-synth-1.0.0-words-in-context-it

-0.01 (-0.01 - -0.01)

-0.00 (-0.01 - -0.01)

-0.02 (-0.03 - -0.01)

-0.00 (-0.01 - -0.00)

-0.03 (0.00 - 0.03)

-0.06 (0.01 - 0.08)

-0.04 (0.06 - 0.10)

-0.07 (-0.02 - 0.06)

0.03 (0.03 - -0.00)

0.04 (0.00 - -0.04)

0.03 (-0.02 - -0.04)

0.06 (0.03 - -0.03)

0.00 (-0.01 - -0.02)

0.02 (-0.01 - -0.03)

0.03 (-0.01 - -0.04)

0.02 (-0.01 - -0.03)

checklist-synth-1.0.0-words-in-context-ja

-0.02 (-0.03 - -0.01)

0.01 (0.00 - -0.01)

0.00 (-0.01 - -0.01)

0.01 (0.01 - -0.00)

0.02 (0.04 - 0.03)

-0.04 (0.04 - 0.08)

-0.06 (0.03 - 0.10)

-0.05 (0.01 - 0.06)

-0.02 (-0.02 - -0.00)

0.01 (-0.03 - -0.04)

0.02 (-0.02 - -0.04)

0.01 (-0.01 - -0.03)

0.02 (0.00 - -0.02)

0.02 (-0.01 - -0.03)

0.04 (-0.00 - -0.04)

0.02 (-0.00 - -0.03)

checklist-synth-1.0.0-words-in-context-pt

0.01 (0.01 - -0.01)

0.01 (-0.00 - -0.01)

-0.00 (-0.01 - -0.01)

0.01 (0.00 - -0.00)

-0.04 (-0.02 - 0.03)

-0.07 (0.01 - 0.08)

-0.05 (0.04 - 0.10)

-0.04 (0.02 - 0.06)

0.05 (0.05 - -0.00)

0.04 (0.00 - -0.04)

0.05 (0.00 - -0.04)

0.00 (-0.02 - -0.03)

-0.02 (-0.04 - -0.02)

0.02 (-0.01 - -0.03)

0.01 (-0.03 - -0.04)

0.03 (0.01 - -0.03)

checklist-synth-1.0.0-words-in-context-zh

0.01 (-0.00 - -0.01)

-0.00 (-0.01 - -0.01)

0.01 (-0.00 - -0.01)

0.00 (-0.00 - -0.00)

-0.03 (-0.01 - 0.03)

-0.05 (0.03 - 0.08)

-0.02 (0.08 - 0.10)

-0.01 (0.05 - 0.06)

0.01 (0.00 - -0.00)

0.02 (-0.02 - -0.04)

-0.04 (-0.08 - -0.04)

-0.02 (-0.04 - -0.03)

0.02 (0.00 - -0.02)

0.03 (-0.00 - -0.03)

0.04 (0.00 - -0.04)

0.03 (0.00 - -0.03)

mean

0.00

0.00

0.00

0.00

0.00

-0.00

0.00

0.00

0.00

0.00

0.00

-0.00

-0.00

0.00

-0.00

-0.00

Visualization

w2v2-L-cat

hubert-L-cat

wavlm-cat

data2vec-cat

../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt32.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh23.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh30.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh31.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh32.png