Answers A, B, C, D are not all equally likely - is it really accurate to use random baseline as comparison? #11

bmosaicml · 2023-04-19T19:43:49Z

I pulled the test data linked in the README, and I am noticing within each category there is basically never an even 25% split between A, B, C, and D..

The most imbalanced category is high school statistics, for which 47% of the answers are D.

I have two Qs: Is my analysis correct? I was using the test data downloadable from the main repo. Furthermore, if my analysis is correct wouldn't random baseline not be a fair comparison, since majority vote would do much better?

I used the data here: https://people.eecs.berkeley.edu/~hendrycks/data.tar

bmosaicml changed the title ~~Answers A, B, C, D are not all equally likely - why would a random baseline get 25%?~~ Answers A, B, C, D are not all equally likely - is it really accurate to use random baseline as comparison? Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Answers A, B, C, D are not all equally likely - is it really accurate to use random baseline as comparison? #11

Answers A, B, C, D are not all equally likely - is it really accurate to use random baseline as comparison? #11

bmosaicml commented Apr 19, 2023 •

edited

Loading

Answers A, B, C, D are not all equally likely - is it really accurate to use random baseline as comparison? #11

Answers A, B, C, D are not all equally likely - is it really accurate to use random baseline as comparison? #11

Comments

bmosaicml commented Apr 19, 2023 • edited Loading

bmosaicml commented Apr 19, 2023 •

edited

Loading