When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

Computer Science · AI Jul 1, 2026

When LLMs Read Tables Carelessly: Measuring and Reducing Data Referencing Errors

Why AI chatbots misquote numbers from tables—and how to fix it

Yuqing Yang, Qi Zhu, Zhen Han et al.
arXiv:2606.32029

Summary

Large language models make mistakes when pulling numbers from tables, citing wrong values or skipping data entirely even when they understand the table structure. A new systematic study found these errors happen in all tested models, then showed that adding a specialized checking system—a "critic" model—can catch and correct these mistakes, boosting final answer accuracy by up to 12%.

Why it matters

When LLMs are used for real-world decisions—analyzing financial reports, medical data, or research findings—misquoting a single number can lead to wrong conclusions. The lightweight 4-billion-parameter critic described here can be added to existing AI systems to catch these mistakes before they propagate into reports or decisions, making AI tools more trustworthy for high-stakes applications without slowing them down significantly.

Read on arXiv Posted on arXiv · Jun 30, 2026