Archive, Page 3 — Paper Plaine

Computer Science · AI May 5, 2026

SpecKV: Adaptive Speculative Decoding with Compression-Aware Gamma Selection

Speeding up AI by automatically adjusting how many words to guess ahead

Shikhar Shukla
arXiv:2605.02888

Summary

A new system called SpecKV automatically tunes how many tokens a small AI model should propose at each step during the verification process that speeds up large language models. By reading signals from the draft model itself—like how confident it is in its guesses—SpecKV picks the best number of proposals for each moment, delivering 56% faster results than the current fixed approach with almost no added slowdown.

Why it matters

Large language models power chatbots, search, and countless AI applications, and making them faster directly cuts energy costs and lets more people access them affordably. A 56% speedup with minimal overhead means faster responses for users and significantly lower compute bills for companies running these systems at scale.

Read on arXiv Posted on arXiv · May 4, 2026

Engineering May 5, 2026

TRACED: In vivo imaging of extracellular intrinsic diffusivity, tortuosity, cell size distribution and cell density in human glioma patients

Reading tumor cell size and density from brain MRI scans without a biopsy

Joshua K. Marchant, Hong-Hsi Lee, Elizabeth R. Gerstner et al.
arXiv:2605.02615

Summary

Researchers developed TRACED, a new method that extracts detailed information about tumor structure directly from standard MRI scans of brain cancer patients. The technique measures cell size, cell density, and how easily water moves through tumor tissue — measurements previously only possible through invasive biopsies — and the team verified these measurements against actual tumor tissue samples from two patients.

Why it matters

Brain tumor surgery and treatment decisions depend on understanding tumor structure, but biopsies are invasive, risky, and only sample one small location. This MRI-based approach could let doctors assess tumor properties across the entire tumor without any biopsy, potentially improving treatment planning and monitoring how tumors respond to therapy.

Read on arXiv Posted on arXiv · May 4, 2026

Physics May 5, 2026

Note on Strong Quantum Markov Properties

When quantum systems reveal their secrets through local measurements

Chi-Fang Chen
arXiv:2605.02877

Summary

A quantum state satisfies a "strong Markov property" if you can recover lost information about it by measuring just one copy and applying a local fix — and this works the same way regardless of what you actually measure. The researchers show this property is equivalent to a simpler mathematical condition: correlations must decay in a particular way, and they prove three surprising consequences, including that you can estimate multiple properties of a quantum state from a single measurement.

Why it matters

Quantum systems are notoriously fragile and hard to measure. This result shows that under certain conditions — when a quantum state has the strong Markov property — you don't need many copies or elaborate measurement schemes to extract useful information. This could simplify how we extract information from quantum devices and systems in the lab, and it deepens our understanding of which quantum states are easier to work with in practice.

Read on arXiv Posted on arXiv · May 4, 2026

Computer Science · AI May 5, 2026

mdok-style at SemEval-2026 Task 9: Finetuning LLMs for Multilingual Polarization Detection

Spotting inflammatory speech across 22 languages before it turns toxic

Dominik Macko, Alok Debnath, Jakub Simko
arXiv:2605.02695

Summary

Researchers built an AI system to detect polarizing content online across 22 languages by finetuning large language models with a technique that keeps computational costs manageable. They strengthened the system by training it on multiple versions of the same text—anonymized, capitalized differently, and with character substitutions—making it more likely to catch polarization even when people use tricks to avoid detection.

Why it matters

Online polarization often escalates into hate speech and social division. Catching inflammatory rhetoric early, across languages and cultures, gives platforms a practical tool to intervene before discussions turn hostile. The approach also shows how to build multilingual AI systems efficiently, without needing expensive computational resources.

Read on arXiv Posted on arXiv · May 4, 2026

Computer Science · AI May 4, 2026

Towards Improving Speaker Distance Estimation through Generative Impulse Response Augmentation

Using artificial sound reflections to help systems pinpoint where speakers are standing

Anton Ratnarajah, Mehmet Ergezer, Arun Nair et al.
arXiv:2605.00721

Summary

Researchers improved distance estimation accuracy by generating synthetic acoustic data to train AI models. The approach reduced localization error by up to 68% across different room types—bringing average errors down from 2.18 meters to 0.69 meters in some settings.

Why it matters

Accurate speaker distance estimation matters for hearing aids, video conferencing systems, and spatial audio applications that need to know where someone is in a room. Real acoustic recordings are expensive and limited; this method shows that artificially generated sound reflections can work just as well for training, making it faster and cheaper to build better location-aware audio systems.

Read on arXiv Posted on arXiv · May 1, 2026

Computer Science May 4, 2026

Unsupervised Denoising of Real Clinical Low Dose Liver CT with Perceptual Attention Networks

Cleaning up blurry CT scans without needing perfect reference images

Jingxi Pu, Tonghua Liu, Zhilin Guan et al.
arXiv:2605.00793

Summary

Researchers developed an artificial intelligence system that removes noise from low-dose CT scans without requiring paired clean images for training—a major obstacle in medical imaging. The system was tested on real clinical scans and validated by radiologists, achieving results comparable to supervised methods while solving the practical problem that hospitals rarely have perfectly clean versions of the same scan to learn from.

Why it matters

Low-dose CT reduces radiation risk to patients, but the grainy images can make tumors and other abnormalities harder to spot, potentially leading to missed diagnoses. This technique cleans up those images automatically using only the noisy scans themselves, making it immediately usable in hospitals without requiring expensive paired training data. Radiologists who reviewed the results confirmed it meets clinical standards, meaning patients could get safer imaging without sacrificing diagnostic clarity.

Read on arXiv Posted on arXiv · May 1, 2026

Mathematics May 4, 2026

Optimal Merton's Problem under Multivariate Affine Volterra Models with Jumps

How investors should rebalance portfolios when markets jump unpredictably

Sigui Brice Dro, Emmanuel Gnabeyeu
arXiv:2605.00688

Summary

Investors often adjust their portfolios based on past market patterns, but real markets jump suddenly and have memory — past prices influence future ones in ways classical models ignore. This paper solves the classic portfolio-balancing problem for these more realistic, jumpy markets with memory, deriving concrete investment strategies that account for both kinds of market friction.

Why it matters

Standard portfolio advice assumes smooth, memoryless markets — assumptions that fail during crashes and volatility clusters. This work provides investors and fund managers with mathematically rigorous strategies tailored to real market behavior, potentially improving returns and risk management when applied to multi-asset portfolios.

Read on arXiv Posted on arXiv · May 1, 2026

Computer Science · AI May 4, 2026

Position: agentic AI orchestration should be Bayes-consistent

Why AI assistants need better decision-making rules for choosing which tools to use

Theodore Papamarkou, Pierre Alquier, Matthias Bauer et al.
arXiv:2605.00742

Summary

Large language models are good at predicting and reasoning, but bad at making decisions when stakes are high—like choosing which expert to ask or how much to spend. This paper argues that AI systems should use Bayesian probability rules at the control layer that decides which tools to deploy, rather than trying to make the language models themselves fully probabilistic, because this approach is practical and mathematically sound for real-world decisions under uncertainty.

Why it matters

When an AI system decides to call a specialist, request more data, or allocate resources, getting that call wrong can be expensive or risky. Using Bayesian decision theory at the orchestration level means the system tracks what it actually knows, updates beliefs as it gathers information, and chooses actions deliberately rather than by default. This framework also makes human-AI collaboration clearer: humans can see what the system believes and why it made a choice, making the system's reasoning auditable and correctable.

Read on arXiv Posted on arXiv · May 1, 2026

Quantitative Finance May 4, 2026

Foresight Arena: An On-Chain Benchmark for Evaluating AI Forecasting Agents

A blockchain-based test for AI that can actually predict the future

Maksym Nechepurenko, Pavel Shuvalov
arXiv:2605.00420

Summary

Researchers built an on-chain benchmark that measures whether AI forecasting agents can genuinely predict real-world events better than existing markets, rather than just copying market prices or getting lucky with timing. The system uses blockchain smart contracts to prevent cheating and applies statistical scoring rules that reward honest probability estimates, and testing shows that detecting a real forecasting edge requires roughly 350 predictions—far more than most existing evaluations.

Why it matters

Most AI forecasting systems today are evaluated on static datasets or by their trading profits, both of which hide whether an AI actually has predictive skill or just got lucky with market timing and position sizing. This benchmark lets anyone trustlessly evaluate AI forecasting agents on real prediction markets with proper statistical incentives, cutting through the noise to identify which systems genuinely see the future more clearly than crowds do. For AI companies and traders, it's a way to separate signal from noise; for the broader AI safety community, it's a model for building evaluations resistant to overfitting and centralized gaming.

Read on arXiv Posted on arXiv · May 1, 2026

Statistics May 4, 2026

Adaptive Querying with AI Persona Priors

Using AI personas to ask smarter survey questions with limited budgets

Kaizheng Wang, Yuhang Wu, Assaf Zeevi
arXiv:2605.00696

Summary

Researchers developed a new method for adaptive surveys that uses artificial intelligence personas—templates of how different types of people respond—to predict what questions will be most informative to ask next. Rather than relying on rigid statistical models or expensive computations, the approach treats each person as belonging to one of several AI-generated persona types, which allows for quick, accurate predictions and efficient question selection even when surveying new populations or asking about unfamiliar topics.

Why it matters

Surveys and tests that adapt their questions based on previous answers can extract more reliable information while asking fewer questions—cutting costs and reducing respondent fatigue. This method makes adaptive surveying practical for real applications like market research, psychological assessment, and opinion polling, especially when you're starting fresh with a new population and can't rely on historical data. The approach also produces interpretable results: you learn not just what someone thinks, but which persona type they resemble, offering actionable insights alongside raw answers.

Read on arXiv Posted on arXiv · May 1, 2026

Computer Science · AI May 3, 2026

Beyond Gaussian Bottlenecks: Topologically Aligned Encoding of Vision-Transformer Feature Spaces

Better 3D geometry in AI videos by redesigning how models compress visual information

Andrew Bond, Ilkin Umut Melanlioglu, Erkut Erdem et al.
arXiv:2604.28122

Summary

Video models often generate plausible motion but fail to preserve real 3D geometry and camera movement. Researchers developed S²VAE, which replaces conventional compression methods with a geometry-aware design that forces the model to think in terms of 3D space, depth, and physical structure rather than appearance alone—and showed this approach consistently outperforms existing methods, especially when heavy compression is needed.

Why it matters

Video synthesis systems power everything from robotics simulation to 3D content creation. Models that properly preserve 3D geometry and camera physics produce more realistic, physically plausible outputs and could reduce the need for expensive manual corrections or post-processing. This approach also makes visual models more useful for tasks like autonomous navigation, where physical accuracy isn't optional.

Read on arXiv Posted on arXiv · Apr 30, 2026

Quantitative Biology May 3, 2026

A Thermodynamic Analysis of Enhanced Metastability in Isochoric Supercooled Liquids

Why freezing liquids in sealed containers keeps them liquid longer

Boris Rubinsky
arXiv:2604.26302

Summary

Keeping a liquid at constant volume instead of letting it expand prevents ice crystals from forming — even at temperatures well below freezing. The researchers proved this thermodynamically by showing that sealed containers create a weaker push toward solidification than open ones do, making ice nucleation exponentially less likely.

Why it matters

Supercooled liquids (water that's frozen solid in temperature but still liquid in structure) have real uses in cryopreservation and medical storage. Understanding how to keep them stable longer without chemical additives could improve organ transplant viability and reduce biological sample damage during freezing procedures.

Read on arXiv Posted on arXiv · Apr 29, 2026