Correctness regression¶

Overall scores¶
	CNN14	w2v2-b	hubert-b	axlstm
Overall Score	22.2% passed tests (2 passed / 7 failed).	66.7% passed tests (6 passed / 3 failed).	55.6% passed tests (5 passed / 4 failed).	44.4% passed tests (4 passed / 5 failed).

Concordance Correlation Coeff¶

Threshold: 0.5¶
Data	Concordance Correlation Coeff
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.42	0.60	0.64	0.50
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.66	0.72	0.71	0.63
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.43	0.47	0.47	0.38
mean	0.50	0.60	0.60	0.50

Threshold: 0.1¶
Data	Mean Absolute Error
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.14	0.14	0.12	0.12
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.10	0.10	0.10	0.10
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.11	0.11	0.11	0.10
mean	0.12	0.12	0.11	0.11

Threshold: 0.5¶
Data	Pearson Correlation Coeff
Data	CNN14	w2v2-b	hubert-b	axlstm
iemocap-2.3.0-emotion.dimensions.test.gold_standard	0.42	0.62	0.64	0.51
msppodcast-2.6.1-emotion.dimensions.test-1.gold_standard	0.66	0.73	0.71	0.63
msppodcast-2.6.1-emotion.dimensions.test-2.gold_standard	0.45	0.51	0.49	0.38
mean	0.51	0.62	0.61	0.51