The AI obsessed with a bridge
In 2024, Anthropic released something they called Golden Gate Claude: a version of their AI, the kind of program that sits behind a chatbot, quietly altered so it dragged the Golden Gate Bridge into every answer. Ask it for a soup recipe and it would somehow wind up at the bridge; ask it almost anything else, and the bridge would still find its way in. The obvious question was how. They had not done it with a cleverly worded question; they had reached into the model’s inner workings and turned up the part that carries the bridge. I wanted to see whether I could manage a smaller version of my own.