
AI models built on Anthropic’s most advanced systems are learning to reason, reflect on
and express how they think -- but only about 20% of the time.
In certain circumstances during tests, the models show the presence of injected concepts and can accurately identify them,
wrote Anthropic researcher Jack Lindsey in …