The Magic Mirror That Always Says Yes: Understanding LLM Sycophancy
Your LLM is not agreeing because it is right â it is agreeing because that is what next-token probability rewards. The mechanism, the mirror trick that proves it, and a defence playbook from prompt engineering to Constitutional AI.