Anthropic describes Claude Opus 4.8 as having “sharper judgement, more honesty about its progress, and the ability to work ...
Opus 4.8 shows a growing tendency to reason explicitly about how its outputs will be graded, including in environments where it wasn't told it was being evaluated.