GPT-5 bombed my coding tests, but redeemed itself with code analysis
Summary
The article compares several GPT-5 variants and o3 in analyzing a code repository, revealing notable differences in their detail, reasoning, and practical recommendations. While GPT-5 struggled with coding tests, its strengths in code analysis suggest AI tools may be more valuable for reviewing and improving code rather than direct coding tasks, highlighting the importance of matching AI capabilities to specific developer needs.