Why I Tried to Measure How AI Speaks, Not Just What It Says
Most AI evaluations focus on whether answers are correct. My research started from a different question: even when answers are factually similar, do AI systems speak in the same way? Differences in tone and framing led me to design a method to measure discursive variation across models.