Table 4

Comparison of % correct answers between ChatGPT (three attempts) and users out of 509 eligible questions

% Correct by ChatGPT (three attempts/iteration)Expected % correct by usersStatistical significance p value
383 (75.25%)369.70 (72.63%)0.295
  • P value <0.05 is statistically significant.