Correctness regression¶

Overall scores¶
	CNN14	w2v2-b	hubert-b	axlstm
Overall Score	44.4% passed tests (4 passed / 5 failed).	66.7% passed tests (6 passed / 3 failed).	66.7% passed tests (6 passed / 3 failed).	44.4% passed tests (4 passed / 5 failed).

Concordance Correlation Coeff¶

Threshold: 0.5¶
Data	Concordance Correlation Coeff
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.24	0.51	0.52	0.39
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.56	0.63	0.62	0.52
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.36	0.44	0.42	0.35
mean	0.39	0.53	0.52	0.42

Threshold: 0.1¶
Data	Mean Absolute Error
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.18	0.14	0.15	0.14
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.09	0.09	0.09	0.09
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.10	0.09	0.10	0.09
mean	0.12	0.11	0.11	0.11

Threshold: 0.5¶
Data	Pearson Correlation Coeff
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.25	0.52	0.54	0.45
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.56	0.64	0.63	0.52
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.39	0.45	0.44	0.36
mean	0.40	0.54	0.53	0.44