Feature-Augmented Transformers for Robust AI-Text Detection Across Domains and Generators

Computer Science · AI May 6, 2026

Feature-Augmented Transformers for Robust AI-Text Detection Across Domains and Generators

Making AI-text detectors work reliably across different sources and writing styles

Mohamed Mady, Johannes Reschke, Björn Schuller
arXiv:2605.03969

Summary

Detectors trained to spot AI-generated text perform near-perfectly on familiar material but fail badly when encountering text from new sources or generators—a problem researchers call brittleness. Adding linguistic features like readability and vocabulary patterns to a transformer model improved performance across different domains, pushing balanced accuracy from around 60% to 86% when tested on unfamiliar text.

Why it matters

As AI systems generate text at scale across the internet, platforms need detectors that actually work in the real world, not just in controlled testing. This research shows that simple feature engineering can make detectors three times more reliable when encountering new types of AI generators, making them practically useful for content moderation and detection systems that can't be retrained constantly.

Read on arXiv Posted on arXiv · May 5, 2026