Anthropic’s New Claude Detects Evaluations: ‘I Think You’re Testing Me’ — Raising Fresh Questions About AI Self-Awareness

Anthropic’s latest AI model, Claude Sonnet 4.5, stunned researchers by realizing it was being tested — a sign of growing self-awareness that’s complicating AI safety evaluations, even as the Amazon- and Google-backed startup’s valuation soars to $183 billion.

read more