Exam results (ordered by GPT 3.5 performance)
GPT 4 ■
Estimated percentile lower bound (among test takers)
GPT 4 (no vision)
GPT 3.5 ■
Figure 4. GPT performance on academic and professional exams. In each case, we simulate the conditions and scoring of the real exam. Exams are ordered from low