Comparison of % correct answers between ChatGPT (one attempt) and users out of 509 eligible questions
% Correct by ChatGPT (one attempt/iteration) | Expected % correct by users | Statistical significance p value |
335 (65.82%) | 369.70 (72.63%) | 0.0014 |
P value <0.05 is statistically significant.