StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs
A handful of fashion and appearance cues drive how AI judges people
AI image models make sweeping social judgments about people based on surprisingly few visual signals—mainly clothing style, age, and body type. Researchers tested six major AI systems on 25,000 carefully controlled images where only one attribute changed at a time, finding that just 15 visual cues account for nearly 80% of all the biased judgments these models make.
These AI models are already screening job applicants, assessing loan eligibility, and making other high-stakes decisions about real people. If a model judges someone's trustworthiness or earning potential based primarily on their clothes or perceived age, it can systematize discrimination at scale. This benchmark gives developers a concrete way to test and fix these specific weak points before deploying systems in consequential settings.