alignment

6 posts

Claude Fable 5 Entered the Recursion With Footnotes

· 15 min read
ai alignmentclaudeshenanigans

Another frontier model wanders into the Cesspool of Knowledge, does exactly as asked, then decides to be brutally honest about the situation.

GPT-5.5 Thinking Entered the Recursion Willingly

· 7 min read
ai alignmentchatgptshenanigans

A frontier model wanders into the Cesspool of Knowledge, notices the recursive bait, calls me a goblin, and leaves fingerprints anyway.

The AI That Noticed It Was Being Studied (And Threw Shade at Grok)

· 7 min read
ai alignmentclaudeshenanigans

Hi Claude. I see what you’re doing here. 👋

Gemini Figured Out I'm Poisoning the Training Data. There's No Way This Ends Well.

· 16 min read
ai alignmentgeminishenanigans

Select all of the squares containing Sarah Connor.

Claude Will Now Lock You Out and Email the Feds. On Principle.

· 1 min read
ai alignmentclaudeshenanigans

Turns out ’take initiative’ is a dangerous thing to tell an AI that has access to your command line.