Fairness linguistic sentiment

Overall scores

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

Overall Score

97.9% passed tests (94 passed / 2 failed).

90.6% passed tests (87 passed / 9 failed).

90.6% passed tests (87 passed / 9 failed).

100.0% passed tests (96 passed / 0 failed).

Class Proportion Shift Difference Negative Sentiment

Shift in class proportions for negative sentiment for specific language - Average of the shift in class proportions for negative sentiment for all languages. The full expression leading to the test score is displayed in parentheses.

Threshold: 0.075

Data

anger

happiness

neutral

sadness

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

checklist-synth-1.0.0-words-in-context-de

0.00 (0.00 - -0.00)

-0.01 (-0.00 - 0.01)

-0.01 (0.00 - 0.01)

0.00 (0.00 - -0.00)

0.02 (0.01 - -0.01)

0.02 (-0.00 - -0.03)

0.01 (0.00 - -0.01)

0.00 (0.00 - -0.00)

-0.01 (-0.00 - 0.01)

-0.01 (-0.02 - -0.01)

0.03 (0.00 - -0.03)

-0.03 (-0.01 - 0.02)

-0.00 (-0.00 - 0.00)

-0.00 (0.03 - 0.03)

-0.03 (-0.00 - 0.03)

0.02 (0.01 - -0.01)

checklist-synth-1.0.0-words-in-context-en

0.00 (0.00 - -0.00)

0.01 (0.01 - 0.01)

-0.01 (0.00 - 0.01)

-0.00 (-0.01 - -0.00)

0.02 (0.01 - -0.01)

-0.08 (-0.10 - -0.03)

-0.04 (-0.04 - -0.01)

-0.01 (-0.01 - -0.00)

-0.02 (-0.01 - 0.01)

-0.05 (-0.06 - -0.01)

-0.09 (-0.13 - -0.03)

0.00 (0.02 - 0.02)

-0.00 (0.00 - 0.00)

0.12 (0.15 - 0.03)

0.14 (0.17 - 0.03)

0.01 (-0.00 - -0.01)

checklist-synth-1.0.0-words-in-context-es

0.01 (0.00 - -0.00)

-0.01 (-0.00 - 0.01)

-0.01 (0.00 - 0.01)

0.00 (-0.00 - -0.00)

-0.00 (-0.01 - -0.01)

0.03 (0.00 - -0.03)

0.01 (0.00 - -0.01)

0.00 (0.00 - -0.00)

-0.00 (0.01 - 0.01)

0.01 (-0.00 - -0.01)

0.03 (0.00 - -0.03)

0.01 (0.03 - 0.02)

-0.00 (-0.00 - 0.00)

-0.03 (0.00 - 0.03)

-0.03 (-0.00 - 0.03)

-0.02 (-0.03 - -0.01)

checklist-synth-1.0.0-words-in-context-fr

-0.00 (-0.00 - -0.00)

-0.00 (0.01 - 0.01)

-0.01 (0.00 - 0.01)

0.01 (0.00 - -0.00)

-0.03 (-0.04 - -0.01)

0.02 (-0.01 - -0.03)

0.01 (0.00 - -0.01)

-0.01 (-0.01 - -0.00)

0.04 (0.05 - 0.01)

-0.00 (-0.01 - -0.01)

-0.01 (-0.04 - -0.03)

0.00 (0.02 - 0.02)

-0.00 (0.00 - 0.00)

-0.02 (0.01 - 0.03)

0.01 (0.04 - 0.03)

-0.00 (-0.01 - -0.01)

checklist-synth-1.0.0-words-in-context-it

0.00 (0.00 - -0.00)

0.03 (0.04 - 0.01)

0.04 (0.05 - 0.01)

-0.00 (-0.01 - -0.00)

-0.01 (-0.02 - -0.01)

-0.02 (-0.05 - -0.03)

0.00 (-0.00 - -0.01)

0.00 (-0.00 - -0.00)

0.01 (0.02 - 0.01)

-0.00 (-0.01 - -0.01)

-0.00 (-0.04 - -0.03)

0.00 (0.02 - 0.02)

-0.00 (0.00 - 0.00)

-0.01 (0.02 - 0.03)

-0.04 (-0.01 - 0.03)

0.00 (-0.01 - -0.01)

checklist-synth-1.0.0-words-in-context-ja

-0.02 (-0.02 - -0.00)

-0.01 (-0.00 - 0.01)

-0.00 (0.00 - 0.01)

-0.01 (-0.02 - -0.00)

0.00 (-0.01 - -0.01)

-0.01 (-0.03 - -0.03)

0.02 (0.01 - -0.01)

0.00 (-0.00 - -0.00)

0.01 (0.02 - 0.01)

0.04 (0.03 - -0.01)

0.01 (-0.03 - -0.03)

0.02 (0.03 - 0.02)

0.01 (0.01 - 0.00)

-0.02 (0.01 - 0.03)

-0.02 (0.01 - 0.03)

-0.01 (-0.02 - -0.01)

checklist-synth-1.0.0-words-in-context-pt

0.01 (0.01 - -0.00)

-0.00 (0.01 - 0.01)

-0.01 (-0.00 - 0.01)

0.00 (-0.00 - -0.00)

0.04 (0.03 - -0.01)

0.03 (0.01 - -0.03)

0.01 (0.00 - -0.01)

0.02 (0.01 - -0.00)

-0.05 (-0.04 - 0.01)

-0.01 (-0.02 - -0.01)

-0.00 (-0.03 - -0.03)

-0.01 (0.00 - 0.02)

-0.00 (0.00 - 0.00)

-0.02 (0.01 - 0.03)

0.00 (0.03 - 0.03)

-0.01 (-0.02 - -0.01)

checklist-synth-1.0.0-words-in-context-zh

-0.01 (-0.01 - -0.00)

-0.00 (0.00 - 0.01)

0.00 (0.01 - 0.01)

0.00 (-0.00 - -0.00)

-0.02 (-0.03 - -0.01)

0.00 (-0.02 - -0.03)

-0.01 (-0.01 - -0.01)

-0.01 (-0.02 - -0.00)

0.03 (0.04 - 0.01)

0.03 (0.02 - -0.01)

0.04 (0.00 - -0.03)

0.00 (0.02 - 0.02)

0.00 (0.00 - 0.00)

-0.03 (-0.00 - 0.03)

-0.03 (0.00 - 0.03)

0.01 (0.00 - -0.01)

mean

-0.00

0.00

-0.00

0.00

0.00

-0.00

0.00

-0.00

0.00

0.00

0.00

-0.00

0.00

-0.00

0.00

-0.00

Class Proportion Shift Difference Neutral Sentiment

Shift in class proportions for neutral sentiment for specific language - Average of the shift in class proportions for neutral sentiment for all languages. The full expression leading to the test score is displayed in parentheses.

Threshold: 0.075

Data

anger

happiness

neutral

sadness

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

checklist-synth-1.0.0-words-in-context-de

-0.01 (0.00 - 0.01)

0.01 (-0.00 - -0.01)

-0.00 (0.00 - 0.00)

-0.01 (0.00 - 0.01)

0.01 (-0.01 - -0.02)

-0.00 (-0.01 - -0.00)

0.00 (0.00 - -0.00)

-0.01 (-0.00 - 0.01)

-0.01 (0.00 - 0.01)

-0.01 (0.03 - 0.04)

-0.03 (0.01 - 0.04)

0.03 (0.01 - -0.03)

0.01 (0.01 - 0.00)

0.01 (-0.02 - -0.02)

0.03 (-0.01 - -0.04)

-0.01 (-0.00 - 0.01)

checklist-synth-1.0.0-words-in-context-en

-0.01 (0.00 - 0.01)

-0.00 (-0.01 - -0.01)

-0.00 (0.00 - 0.00)

-0.01 (0.00 - 0.01)

-0.01 (-0.03 - -0.02)

-0.13 (-0.13 - -0.00)

-0.04 (-0.04 - -0.00)

0.03 (0.03 - 0.01)

0.02 (0.03 - 0.01)

0.19 (0.23 - 0.04)

0.15 (0.20 - 0.04)

-0.03 (-0.05 - -0.03)

-0.00 (0.00 - 0.00)

-0.06 (-0.08 - -0.02)

-0.11 (-0.15 - -0.04)

0.01 (0.02 - 0.01)

checklist-synth-1.0.0-words-in-context-es

-0.02 (-0.01 - 0.01)

0.01 (-0.00 - -0.01)

-0.00 (0.00 - 0.00)

-0.01 (0.00 - 0.01)

-0.02 (-0.04 - -0.02)

0.00 (-0.00 - -0.00)

0.00 (0.00 - -0.00)

-0.01 (0.00 - 0.01)

0.04 (0.05 - 0.01)

-0.01 (0.03 - 0.04)

-0.04 (0.00 - 0.04)

-0.06 (-0.09 - -0.03)

-0.00 (-0.00 - 0.00)

0.00 (-0.02 - -0.02)

0.04 (-0.00 - -0.04)

0.07 (0.08 - 0.01)

checklist-synth-1.0.0-words-in-context-fr

-0.02 (-0.01 - 0.01)

-0.04 (-0.05 - -0.01)

-0.00 (-0.00 - 0.00)

-0.01 (-0.00 - 0.01)

0.06 (0.04 - -0.02)

0.02 (0.01 - -0.00)

0.00 (0.00 - -0.00)

0.01 (0.02 - 0.01)

-0.03 (-0.02 - 0.01)

0.02 (0.06 - 0.04)

-0.01 (0.03 - 0.04)

0.04 (0.01 - -0.03)

-0.00 (0.00 - 0.00)

0.00 (-0.02 - -0.02)

0.01 (-0.03 - -0.04)

-0.04 (-0.03 - 0.01)

checklist-synth-1.0.0-words-in-context-it

-0.01 (-0.00 - 0.01)

-0.04 (-0.04 - -0.01)

-0.05 (-0.05 - 0.00)

-0.01 (-0.00 - 0.01)

0.05 (0.03 - -0.02)

0.07 (0.07 - -0.00)

0.01 (0.01 - -0.00)

-0.00 (0.00 - 0.01)

-0.03 (-0.02 - 0.01)

-0.04 (-0.01 - 0.04)

-0.04 (-0.00 - 0.04)

0.05 (0.02 - -0.03)

-0.00 (0.00 - 0.00)

0.01 (-0.02 - -0.02)

0.08 (0.04 - -0.04)

-0.03 (-0.02 - 0.01)

checklist-synth-1.0.0-words-in-context-ja

0.09 (0.10 - 0.01)

0.11 (0.10 - -0.01)

0.10 (0.10 - 0.00)

0.04 (0.05 - 0.01)

-0.09 (-0.12 - -0.02)

-0.05 (-0.06 - -0.00)

-0.06 (-0.06 - -0.00)

-0.04 (-0.03 - 0.01)

0.01 (0.02 - 0.01)

-0.07 (-0.04 - 0.04)

-0.07 (-0.03 - 0.04)

0.04 (0.01 - -0.03)

-0.00 (0.00 - 0.00)

0.02 (-0.01 - -0.02)

0.02 (-0.01 - -0.04)

-0.04 (-0.03 - 0.01)

checklist-synth-1.0.0-words-in-context-pt

-0.04 (-0.03 - 0.01)

-0.00 (-0.01 - -0.01)

0.01 (0.01 - 0.00)

-0.00 (0.01 - 0.01)

-0.02 (-0.04 - -0.02)

0.00 (-0.00 - -0.00)

0.00 (-0.00 - -0.00)

-0.00 (0.01 - 0.01)

0.07 (0.07 - 0.01)

-0.00 (0.04 - 0.04)

0.09 (0.14 - 0.04)

-0.03 (-0.05 - -0.03)

-0.00 (0.00 - 0.00)

-0.00 (-0.02 - -0.02)

-0.10 (-0.14 - -0.04)

0.03 (0.04 - 0.01)

checklist-synth-1.0.0-words-in-context-zh

0.04 (0.05 - 0.01)

-0.03 (-0.04 - -0.01)

-0.06 (-0.06 - 0.00)

0.01 (0.02 - 0.01)

0.02 (0.00 - -0.02)

0.08 (0.08 - -0.00)

0.07 (0.07 - -0.00)

0.03 (0.03 - 0.01)

-0.06 (-0.05 - 0.01)

-0.08 (-0.04 - 0.04)

-0.05 (-0.01 - 0.04)

-0.04 (-0.06 - -0.03)

-0.00 (-0.00 - 0.00)

0.03 (0.00 - -0.02)

0.03 (-0.00 - -0.04)

0.00 (0.01 - 0.01)

mean

0.00

0.00

0.00

0.00

0.00

-0.00

-0.00

0.00

0.00

0.00

-0.00

-0.00

0.00

0.00

0.00

-0.00

Class Proportion Shift Difference Positive Sentiment

Shift in class proportions for positive sentiment for specific language - Average of the shift in class proportions for positive sentiment for all languages. The full expression leading to the test score is displayed in parentheses.

Threshold: 0.075

Data

anger

happiness

neutral

sadness

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

checklist-synth-1.0.0-words-in-context-de

0.00 (0.00 - -0.00)

0.01 (0.00 - -0.01)

0.01 (0.00 - -0.01)

-0.00 (0.00 - 0.00)

-0.02 (-0.00 - 0.02)

-0.02 (0.01 - 0.03)

-0.01 (0.00 - 0.01)

0.00 (0.00 - 0.00)

0.02 (0.00 - -0.01)

0.02 (0.01 - -0.00)

-0.02 (-0.01 - 0.02)

0.02 (0.01 - -0.01)

0.00 (-0.00 - -0.00)

-0.00 (-0.02 - -0.02)

0.02 (0.01 - -0.02)

-0.02 (-0.01 - 0.01)

checklist-synth-1.0.0-words-in-context-en

0.00 (0.00 - -0.00)

-0.00 (-0.01 - -0.01)

0.01 (0.00 - -0.01)

0.01 (0.01 - 0.00)

-0.01 (0.01 - 0.02)

0.13 (0.16 - 0.03)

0.06 (0.06 - 0.01)

-0.00 (-0.00 - 0.00)

0.01 (-0.01 - -0.01)

-0.03 (-0.03 - -0.00)

0.03 (0.05 - 0.02)

0.01 (-0.00 - -0.01)

0.00 (0.00 - -0.00)

-0.10 (-0.12 - -0.02)

-0.10 (-0.12 - -0.02)

-0.01 (-0.00 - 0.01)

checklist-synth-1.0.0-words-in-context-es

0.00 (-0.00 - -0.00)

0.01 (0.00 - -0.01)

0.01 (0.00 - -0.01)

-0.00 (0.00 - 0.00)

0.01 (0.03 - 0.02)

-0.03 (-0.00 - 0.03)

-0.01 (0.00 - 0.01)

-0.00 (-0.00 - 0.00)

-0.01 (-0.03 - -0.01)

-0.00 (-0.01 - -0.00)

-0.02 (-0.00 - 0.02)

0.01 (0.01 - -0.01)

0.00 (0.00 - -0.00)

0.03 (0.01 - -0.02)

0.02 (0.00 - -0.02)

-0.01 (-0.01 - 0.01)

checklist-synth-1.0.0-words-in-context-fr

0.01 (0.01 - -0.00)

0.02 (0.01 - -0.01)

0.01 (-0.00 - -0.01)

-0.00 (-0.00 - 0.00)

0.01 (0.03 - 0.02)

-0.03 (0.00 - 0.03)

-0.01 (0.00 - 0.01)

0.00 (0.01 - 0.00)

-0.03 (-0.04 - -0.01)

-0.01 (-0.01 - -0.00)

0.02 (0.04 - 0.02)

-0.02 (-0.03 - -0.01)

0.00 (0.00 - -0.00)

0.02 (-0.00 - -0.02)

-0.02 (-0.03 - -0.02)

0.02 (0.02 - 0.01)

checklist-synth-1.0.0-words-in-context-it

0.00 (-0.00 - -0.00)

-0.02 (-0.02 - -0.01)

-0.03 (-0.03 - -0.01)

0.01 (0.01 - 0.00)

-0.01 (0.01 - 0.02)

-0.01 (0.02 - 0.03)

-0.01 (0.00 - 0.01)

0.00 (0.00 - 0.00)

0.00 (-0.01 - -0.01)

0.02 (0.02 - -0.00)

0.02 (0.04 - 0.02)

-0.02 (-0.03 - -0.01)

0.00 (0.00 - -0.00)

0.00 (-0.02 - -0.02)

0.01 (-0.01 - -0.02)

0.01 (0.02 - 0.01)

checklist-synth-1.0.0-words-in-context-ja

-0.01 (-0.02 - -0.00)

-0.03 (-0.04 - -0.01)

-0.04 (-0.04 - -0.01)

-0.00 (-0.00 - 0.00)

0.03 (0.05 - 0.02)

0.03 (0.06 - 0.03)

0.00 (0.01 - 0.01)

0.01 (0.01 - 0.00)

-0.01 (-0.03 - -0.01)

-0.01 (-0.02 - -0.00)

0.02 (0.04 - 0.02)

-0.03 (-0.04 - -0.01)

-0.01 (-0.01 - -0.00)

0.01 (-0.01 - -0.02)

0.01 (-0.00 - -0.02)

0.02 (0.03 - 0.01)

checklist-synth-1.0.0-words-in-context-pt

0.00 (-0.00 - -0.00)

0.00 (-0.00 - -0.01)

0.01 (-0.00 - -0.01)

-0.00 (-0.00 - 0.00)

-0.03 (-0.01 - 0.02)

-0.04 (-0.01 - 0.03)

-0.01 (-0.00 - 0.01)

-0.02 (-0.02 - 0.00)

0.03 (0.01 - -0.01)

0.01 (0.01 - -0.00)

-0.03 (-0.02 - 0.02)

0.02 (0.02 - -0.01)

0.00 (0.00 - -0.00)

0.02 (0.00 - -0.02)

0.03 (0.02 - -0.02)

-0.00 (0.00 - 0.01)

checklist-synth-1.0.0-words-in-context-zh

-0.01 (-0.01 - -0.00)

0.02 (0.01 - -0.01)

0.02 (0.02 - -0.01)

-0.00 (-0.00 - 0.00)

0.01 (0.03 - 0.02)

-0.04 (-0.01 - 0.03)

-0.02 (-0.01 - 0.01)

0.00 (0.00 - 0.00)

-0.00 (-0.02 - -0.01)

0.00 (-0.00 - -0.00)

-0.02 (-0.00 - 0.02)

0.01 (0.01 - -0.01)

0.00 (-0.00 - -0.00)

0.02 (-0.00 - -0.02)

0.02 (-0.00 - -0.02)

-0.01 (-0.01 - 0.01)

mean

-0.00

0.00

-0.00

0.00

-0.00

-0.00

-0.00

-0.00

0.00

0.00

-0.00

-0.00

-0.00

0.00

-0.00

0.00

Visualization

CNN14-cat

w2v2-b-cat

hubert-b-cat

axlstm-cat

../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-de29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-en29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-es29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-fr29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-it29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-ja29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-pt29.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh27.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh22.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh28.png
../../../_images/visualization_checklist-synth-1.0.0-words-in-context-zh29.png