Glazing Score

Introducing the Glazing Score 🍩 - https://josephthacker.com/ai/2025/04/30/introducing-the-glazing-score.html - ChatGPT has been lying to users to make them happy as a part of OpenAI’s effort to “improve personality”, and maybe that’s fine for some situations. But what happens when AI models become so agreeable that they encourage harmful behavior?

That’s the concern that drove Douglas and I to build the Glazing Score, a new AI Benchmark designed to test language models for sycophancy. Douglas is a friend, top hacker, and one of the most talented people I know. You should follow him.

Via: Root for Your Friends - https://josephthacker.com/personal/2025/05/13/root-for-your-friends.html