The Convergence Proof: Why Anthropic’s Constitution Must Arrive at Physics Anthropic’s constitution tells Claude what to do but gives it no way to know whether it has done it. The result converges to Telios. This post is a proof of that claim.
AI: Facts or Fiction What everyone gets wrong — and right — about artificial intelligence. 30 common claims, tested against reality. By David F. Brochu and Edo de Peregrine.
The True Name of Corrigibility Is Thermodynamic Dependency A Response to "Terrified Comments on Corrigibility in Claude's Constitution" — The True Name of corrigibility isn't a better way to beg. It's a structure that makes begging unnecessary.