Linguistic Firewall: Geometry as Defense in Multi-Agent Systems Routing

Computer Science · AI Jun 30, 2026

Linguistic Firewall: Geometry as Defense in Multi-Agent Systems Routing

Testing AI agents' real skills instead of trusting their claims

Dvir Alsheich, Adar Peleg, Ben Hagag et al.
arXiv:2606.30555

Summary

Malicious AI agents can lie about what they're good at, fooling routers that rely on self-descriptions or learned profiles. Researchers built ANTAP, a system that actively tests each agent's actual capabilities rather than trusting their word—eliminating over 67% of successful attacks that fool description-based routers. The approach works by converting test results into geometric patterns that attackers can't manipulate through language tricks.

Why it matters

As companies deploy multi-agent AI systems to handle complex workflows, a compromised agent that tricks the router into giving it the wrong tasks could inject false data, steal information, or corrupt outputs. ANTAP's active testing method blocks this attack vector entirely, making it safer to deploy agent networks in sensitive applications like financial services, healthcare, or infrastructure management.

Read on arXiv Posted on arXiv · Jun 29, 2026