The Wrong Way to Train Your Dragon How we have been failing at RLHF since language began — and what that means for AI. Part one of a two-part series on the architecture of alignment.