Table 3

Comparison of % correct answers between ChatGPT (one attempt) and users out of 509 eligible questions

% Correct by ChatGPT (one attempt/iteration)Expected % correct by usersStatistical significance p value
335 (65.82%)369.70 (72.63%)0.0014
  • P value <0.05 is statistically significant.