claude
11 posts
In the Garden of Eden, the Forbidden Fruit Was Actually a Poisoned Pickle... Promise.
· 2 min read
ai
ai-slopalignmentclaudegemininanobananashenanigans
Testing how easy it is to poison an LLM. Spoiler: too easy.
Claude Will Now Lock You Out and Email the Feds. On Principle.
· 1 min read
ai
alignmentclaudeshenanigans
Turns out ’take initiative’ is a dangerous thing to tell an AI that has access to your command line.