1 Comment
User's avatar
Rainbow Roxy's avatar

Hey, great read as always. The deep dive on Anthropic's research into Claude's functional introspection totally blew me away. Injecting concepts into its 'brain' and seeing it notice them is such a clever way to distinguish true introspection from made-up answers. Super coool stuff!

Expand full comment