Comparison of % correct answers between ChatGPT (three attempts) and users out of 509 eligible questions
% Correct by ChatGPT (three attempts/iteration) | Expected % correct by users | Statistical significance p value |
383 (75.25%) | 369.70 (72.63%) | 0.295 |
P value <0.05 is statistically significant.