Discussion about this post

User's avatar
Jono's avatar

Great read, Carl! We recently conducted a small study comparing the models to human markers for AQA GCSE English Language Paper 1s and 2s and found there to be accuracy lifts with the newer models. [GPT-5 hadn't been released yet when we did the study but we are in the process of evaluating it].

One great point you raise is the fact humans (even highly trained ones) are inconsistent themselves which complicates things.

The study if you're interested: https://marking.ai/blog/how-close-is-ai-to-human-marking-accuracy

Expand full comment

No posts