Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

Computer Science · AI May 24, 2026

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

How AI language models outperform sound-based emotion detection in political speeches

Juergen Dietrich
arXiv:2605.22732

Summary

Researchers compared three approaches to measuring emotional appeal (pathos) in a German politician's speech: acoustic emotion recognition, a multimodal AI language model, and a specialized LLM pipeline. The language model approach correlated strongly with human-evaluated emotional persuasion (0.664), while acoustic analysis alone did not (0.097), suggesting that understanding the words and context matters far more than analyzing voice tone alone.

Why it matters

Political influence relies heavily on emotional persuasion, yet most automated tools for analyzing speeches rely on voice patterns—a method this research shows is unreliable. Better detection of emotional manipulation in political communication could help voters, fact-checkers, and media outlets understand which speeches are designed to persuade through emotion rather than argument. As AI becomes more central to political analysis, knowing which tools actually work prevents spreading flawed conclusions about how politicians influence audiences.

Read on arXiv Posted on arXiv · May 21, 2026