PAPER PLAINE

Fresh research, simply explained. Updates twice daily.

A Concentration Inequality for the Covariance Matrix of an Arbitrary Subset of Random Vectors

Measuring uncertainty when you choose which data points to analyze

When statisticians select which data to use based on what that data looks like, standard mathematical guarantees break down. This paper proves new rules for how reliable sample covariance matrices remain even after such data-dependent selection—and shows these new rules are much tighter and more practical than existing workarounds. The results extend to realistic scenarios with weakly dependent observations and apply directly to clustering problems.

Many real machine-learning algorithms pick or filter their data based on what they see, not randomly in advance. Without reliable guarantees for this setting, practitioners can't know whether their statistical conclusions are trustworthy. This work closes that gap, providing theoretical backing for algorithms that adaptively select subsets of data while maintaining provable recovery guarantees—particularly relevant for clustering tasks where picking the right groups is inherently data-dependent.

Diagonal Hessian Approximation Based on Conjugacy Condition for Noisy Derivative-Free Optimization Problems in High Dimensions

A cheaper way to optimize when noise drowns out the signal

When optimizing a complex system using only function values (not gradients), noise can fool the algorithm into trusting bad data points. Researchers developed a simpler scaling mechanism that ignores unreliable rankings and instead tracks the successful steps the algorithm has already taken, cutting computational cost while improving reliability in high-noise conditions.

Many real-world optimization problems—from tuning industrial processes to training AI models with limited data—can't measure gradients directly and must contend with noisy measurements. This method makes high-dimensional optimization faster and more stable when noise is severe, without requiring expensive matrix calculations or gradient estimation that doesn't work reliably anyway.

Agentic AutoResearch forSpace Autonomy: An Auditable, LLM-Driven Research Agent for Aerospace Control Problems

AI that can teach spacecraft to fly themselves—and prove the results are real

Researchers built an AI agent that automatically designs control policies for spacecraft by proposing and testing tweaks to training code, then checking whether improvements are genuine or just statistical noise. On two docking and rendezvous problems, the AI-designed policies outperformed random parameter searches so decisively that on one task, undirected search produced no working solution at all while the AI approach succeeded every time.

Spacecraft currently rely on hand-coded control systems or policies developed through labor-intensive manual research. This framework could compress that development cycle while building in built-in verification that results are trustworthy—crucial for safety-critical aerospace applications where false confidence in a control system could end in collision or mission failure.

CLUSTER: Derivative-free optimization of smooth functions with parameter-change costs

Speeding up lab experiments when moving between settings costs time and money

A new algorithm called CLUSTER optimizes laboratory experiments about 50% faster than existing methods when there's a penalty for adjusting each parameter or group of parameters—such as when a robot must physically reposition equipment. The approach works especially well for real-world lab setups like optics experiments, and outperforms popular alternatives like Bayesian optimization.

Robot-controlled labs waste time and resources repositioning equipment between every tiny parameter adjustment. CLUSTER reduces this waste by being smarter about which parameters to change together, cutting experiment time significantly. For labs running hundreds of optimization experiments—from drug discovery to materials science—this 50% speedup translates directly to faster results and lower costs.

An algorithm to exactly compute minimal upper bounds in the Loewner order

Finding the smallest matrix that bounds a collection of matrices

Researchers developed an algorithm that can exactly compute the smallest upper bound for any group of matrices—a problem that matters across optimization, quantum computing, and control theory. The method finishes in at most n iterations and works by finding what's called a minimal upper bound in the Loewner order, a mathematical framework for comparing matrices.

Many optimization and engineering problems require finding a single matrix that bounds multiple others, but unlike ordering regular numbers, matrices often have no unique smallest upper bound. This algorithm provides a guaranteed way to find one, enabling faster and more precise solutions in quantum information processing, control systems design, and numerical computations that rely on comparing matrices in this specific way.

A Statistical and Machine Learning Framework for Operational Threshold Detection and Deployable Dispatch Controller Development in Hydrogen Multi-Energy Systems

Machine learning reveals hidden patterns in hydrogen energy systems

Researchers analyzed a year of real operating data from a hydrogen energy system and found that solar power alone explains nearly half of hydrogen production variation—but wind's importance only became visible when they switched from traditional statistics to machine learning methods. This revealed that wind affects hydrogen production in complex, non-linear ways that simple correlation measures completely miss, suggesting that solar and wind interact in ways traditional analysis can't detect.

Hydrogen systems are being built now as part of the shift to renewable energy, but operators don't yet know how to run them efficiently. This framework provides a practical toolkit for predicting when to make hydrogen and when to sell it back to the grid, potentially reducing waste and improving revenue. The finding that machine learning uncovers real dynamics hidden from traditional statistics means energy operators need both approaches working together to actually optimize these systems.

Differential Geometric Conditions for Koopman Linearizability of Control-Affine Systems

When can curved control systems be transformed into straight-line ones?

Researchers identified mathematical conditions that determine whether a nonlinear control system can be converted into a simpler linear form using a technique called Koopman linearization. The conditions—based on the geometric properties of the system's equations—are both necessary and sufficient for this transformation to work, providing engineers with a practical checklist to assess whether linearization is possible before attempting it.

Control engineers routinely work with nonlinear systems (robots, aircraft, power grids) that are hard to analyze and control. If a system can be Koopman linearized, standard linear control techniques become available, making design faster and more reliable. These geometric conditions let engineers quickly determine whether linearization will work for their specific system, avoiding wasted effort on impossible transformations.

How Low Can You Go? Active Learning for Sparse Model Discovery in the Ultra-Low-Data Limit

Finding the fewest measurements needed to discover nature's hidden rules

Scientists often need to collect enormous amounts of data to reverse-engineer the equations that govern complex systems — but that data is expensive and time-consuming to gather. This work shows a smarter sampling strategy that identifies the right measurements to take, cutting the data requirement dramatically. By selectively measuring the most informative moments in a system's evolution rather than sampling randomly, the method reconstructs governing equations for both ordinary and partial differential equations with a fraction of the usual data cost.

Discovering the equations behind real-world systems — from weather patterns to turbulent flows to chemical reactions — often requires costly experiments or simulations. This approach could make equation discovery practical in fields where data collection is expensive or slow, allowing engineers and scientists to understand complex behavior with far fewer measurements. For systems where each experiment costs time or money, needing 5 measurements instead of 50 makes the difference between feasible and infeasible research.

Consecutive Support Matching Induced Parameter Tuning Accelerates Momentum Iterative Hard Thresholding

A smarter way to speed up finding hidden signals in noisy data

Mathematicians have designed a faster algorithm for recovering sparse signals from incomplete measurements — a problem central to compression, medical imaging, and radar. The breakthrough is an adaptive method that automatically switches from careful exploration to high-speed convergence once it zeros in on the right solution, avoiding the manual tuning that usually slows down momentum-accelerated algorithms.

Sparse signal recovery underpins everything from MRI scanners to compressed sensing applications where you need to reconstruct images or signals from far fewer measurements than classical theory says possible. By automating parameter tuning, this method gets to accurate answers faster without requiring engineers to hand-tune settings for each new problem—cutting computational time while maintaining accuracy in both clean and noisy data.

Iterative Thresholding Pursuit with Continuation for \ell_{1-2}-Regularized Sparse Recovery

A faster way to recover hidden patterns from incomplete data

Researchers developed a new algorithm that reconstructs sparse signals—patterns hidden in incomplete measurements—more efficiently than existing methods. By combining two complementary mathematical techniques, the method converges faster and requires no advance knowledge of how sparse the underlying pattern actually is.

Sparse recovery is fundamental to medical imaging, radar, and data compression. Faster, more reliable algorithms mean clearer MRI scans with less radiation, better quality images from fewer measurements, and quicker processing of real-time signals in communications and sensing systems.

Ensemble Kalman Inversion as an Inertial Interacting Particle System

Stopping an algorithm from getting stuck by adding momentum and particle repulsion

A widely used optimization method called Ensemble Kalman Inversion can collapse prematurely, losing the diversity of candidate solutions it needs to find good answers. Researchers added inertia (momentum) and a repulsive force between particles to keep them from bunching together, preventing this collapse while maintaining mathematical guarantees that the method converges to optimal solutions.

Ensemble Kalman Inversion is used across science and engineering to solve inverse problems—inferring unknown causes from observed effects—in fields like medical imaging, materials science, and climate modeling. By fixing its tendency to fail on certain problems, this improved version makes the method more reliable without requiring derivatives, which are often expensive or impossible to compute in real applications.

A No-Regret Framework for Adaptive Incentive Design

How to nudge strategic people toward collective good while learning their preferences

A new method lets a central authority (like a regulator or planner) design taxes, subsidies, or payments that steer self-interested agents toward socially beneficial choices—while simultaneously figuring out what those agents actually want through their repeated responses. The framework guarantees that estimation errors shrink predictably and the total social cost loss stays close to optimal, even when agents' preferences start out completely unknown.

Policy makers constantly struggle to design incentives—carbon taxes, congestion pricing, welfare programs—without knowing exactly how people will respond or what constraints they face. This framework provides a principled way to adjust incentives over time as you learn, ensuring you don't waste resources on poorly-tuned policies while fumbling in the dark. It trades short-term exploration (slightly suboptimal incentives that reveal preferences) for long-term efficiency, with proven mathematical guarantees on how much welfare you'll recover.

Generalized matrix nearness problems II

Finding the best matrix match under complicated constraints

When you need to find a matrix that best approximates a complicated expression, you can't always solve it directly—but this paper shows how to do it anyway. The researchers developed an algorithm that always finds the best answer, works for multiple types of matrix problems, and does so using only standard computational techniques without needing to calculate gradients.

Matrix nearness problems appear in signal processing, computer vision, and control systems—anywhere engineers need to find the closest match to data while respecting real-world constraints. This work makes it practical to solve versions of these problems that were previously unsolvable, expanding what's computationally feasible in applications from image compression to robotic control.

Relations between categorifications of higher-dimensional type A cluster combinatorics

How different ways of organizing abstract algebra turn out to be the same

Mathematicians proved that three seemingly different ways of categorizing algebraic structures in higher dimensions are actually connected: two of them are built-up versions of a third one, obtained by removing certain extraneous structure. This explains a decades-old mystery about why a count of simple objects in one type of algebra always matches a count in a related type.

This work bridges two competing models for organizing complex algebraic objects, letting mathematicians working in different corners of the field understand they're studying the same underlying landscape. By revealing these hidden connections, it provides a unified foundation for higher-dimensional algebra—a framework that increasingly underpins applications from representation theory to mathematical physics.

Optimization over the intersection of manifolds

A simpler way to optimize when solutions must satisfy multiple geometric constraints

Mathematicians solved a long-standing puzzle in optimization: when a solution must lie on the intersection of two curved surfaces, two different regularity conditions that seemed different are actually equivalent. Using this insight, they designed a practical algorithm that stays on one surface while systematically approaching the other, and proved it reliably finds optimal solutions across problems ranging from data compression to fitting embeddings.

Many real problems—from compressing high-dimensional data to fitting machine learning models—require finding the best solution subject to multiple geometric constraints that intersect in complex ways. This work removes a major computational barrier: instead of struggling with coupled constraints, practitioners can now use a straightforward algorithm with guaranteed convergence. This opens the door to faster, more reliable solutions in fields like signal processing, dimensionality reduction, and scientific computing.

Watts vs. Bytes: Turning Data Centers into Grid Assets via Storage Compute Co-Optimization

Making data centers help power grids by timing computing and batteries together

Data centers can slash operating costs and help stabilize power grids by coordinating when they run computer tasks with when their backup batteries charge and discharge. A new framework shows that when grids get tight and can't accept more peak power, this coordination doubles the value of the battery system while still completing computing work on time.

As data centers consume more electricity, grids face real capacity limits. This approach lets data centers become grid helpers instead of problems—they can absorb power at off-peak times and reduce demand during crunch hours. For grid operators, that means deferring expensive infrastructure upgrades. For data center operators, it means lower bills and new revenue from selling grid services. Under tight grid conditions, the value compounds: the same battery system becomes twice as valuable simply because computation and storage work as a team.

Conformal Rigidity of Graphs: Subdifferentials and Orbit-Isometries

A new way to spot when graph structures have hidden symmetries

Researchers found a framework using symmetry properties to determine when a network's edge weights are already optimal for controlling how its vibrations spread. The discovery lets them certify this optimality by checking a single eigenvector instead of numerically solving complex equations, making the verification much faster and more reliable.

Networks with these symmetries appear throughout engineering, physics, and computer science — from electrical grids to molecular structures to recommendation systems. Being able to verify optimal configurations algebraically instead of numerically means engineers can confidently design these systems without the computational bottlenecks and rounding errors that plague existing methods.

Cavity shape reconstruction with a homogeneous Robin condition via a constrained coupled complex boundary method with ADMM

Finding hidden boundaries inside objects using partial measurement data

Researchers developed a new mathematical method to reconstruct the shape of an unknown internal or hidden boundary in an object when they can only measure conditions on the accessible outer surface. The technique converts the problem into a complex-valued mathematical framework and uses an optimization algorithm to find the boundary shape that best matches the measured data, even when measurements are noisy or imperfect.

This could improve medical imaging (like ultrasound or tomography) where doctors need to identify internal boundaries or detect cavities without full access to the object. It also applies to materials testing and nondestructive inspection, where engineers need to locate internal flaws or structural features by measuring only from the surface. The constrained optimization approach makes the method more robust when real-world measurements contain errors.

Penalty-Based First-Order Methods for Bilevel Optimization with Minimax and Constrained Lower-Level Problems

Solving nested optimization problems where both levels play competing roles

Researchers developed new algorithms for a class of optimization problems where you're trying to optimize something that depends on the solution to another optimization problem—and both levels involve competing objectives rather than simple minimization. The method works without strong mathematical assumptions and achieves significantly faster performance than prior approaches, especially for constrained problems where existing methods were up to 1,000 times slower.

This type of nested optimization appears in machine learning applications like training robust AI models that resist adversarial attacks, game-playing systems, and fairness-aware machine learning. Faster algorithms mean these systems can be trained in hours instead of days, making it practical to deploy protective techniques that were previously too slow to be useful in real applications.

SNAPO: Smooth Neural Adjoint Policy Optimization for Optimal Control via Differentiable Simulation

Training AI to make better decisions while instantly measuring risk exposure

Researchers developed SNAPO, a method that trains neural networks to make sequential decisions in complex systems while simultaneously computing how sensitive those decisions are to different inputs and conditions. Unlike existing approaches that either solve small problems slowly or train fast but blind, SNAPO trains a policy in minutes while automatically generating thousands of sensitivity measurements at essentially no extra cost — a single backward pass produces both the training signal and all the risk metrics.

Real-world decision systems need both speed and accountability. Energy traders need to know how their storage decisions respond to price swings; pension fund managers need to measure exposure across dozens of risk factors; pharmaceutical manufacturers must document how process changes affect product quality for regulators. SNAPO delivers these sensitivities during training rather than afterward, cutting computation time by orders of magnitude — sensitivity analysis that took hours now takes milliseconds — while keeping the same training budget. This makes AI-driven optimization practical for industries where understanding risk isn't optional.

Symmetric Bessmertnyĭ Realizations and Field Extension Problems in Characteristic 2 - A Differential Algebra Approach

A simpler way to check when complex systems have valid mathematical structures

Mathematicians found a purely algebraic method to verify when certain matrix structures—called Symmetric Bessmertnyĭ realizations—can exist in characteristic 2 fields, a setting where ordinary arithmetic rules break down. The new approach uses calculus-like tools on rational functions to reduce the problem from checking entire matrices to checking just their diagonal entries, making verification much simpler.

Linear systems theory relies on these realizations to describe how systems behave, and the new algebraic proof works in characteristic 2 fields, which appear in coding theory and digital systems where all arithmetic happens modulo 2. The simpler method makes it practical to verify whether a given system has a valid mathematical representation without running complex algorithms, and also reveals new connections between realizability and field extensions that could inform future designs.

Optimal Merton's Problem under Multivariate Affine Volterra Models with Jumps

How investors should rebalance portfolios when markets jump unpredictably

Investors often adjust their portfolios based on past market patterns, but real markets jump suddenly and have memory — past prices influence future ones in ways classical models ignore. This paper solves the classic portfolio-balancing problem for these more realistic, jumpy markets with memory, deriving concrete investment strategies that account for both kinds of market friction.

Standard portfolio advice assumes smooth, memoryless markets — assumptions that fail during crashes and volatility clusters. This work provides investors and fund managers with mathematically rigorous strategies tailored to real market behavior, potentially improving returns and risk management when applied to multi-asset portfolios.

Robust Constrained Optimization via Sliding Mode Control

A control-theory approach that solves optimization problems faster and under messy conditions

Researchers developed a new method for solving constrained optimization problems—a common task in engineering and science—by borrowing techniques from control theory. The approach guarantees that constraints are satisfied exactly and reaches the optimal solution in finite time, even when the problem is non-convex or the system is buffeted by noise and disturbances.

Most classical optimization methods assume clean data and ideal conditions, but real-world problems involve measurement errors, uncertainty, and unexpected disturbances. This framework solves that problem by building robustness directly into the method, allowing engineers and scientists to find good solutions reliably in noisy, uncertain environments—from robotics to power systems to machine learning.

Hypergraph independence bounds: from maximum degree to average degree

When sparse networks hide large independent sets, how dense ones must too

Mathematicians proved that if you can guarantee a certain minimum size of non-connected nodes in networks with a strict upper limit on connections per node, then the same guarantee automatically holds for networks with that same average connection level. The result bridges two different ways of measuring network sparsity and applies to hypergraphs—the generalization of networks where edges can connect more than two nodes at once.

This theorem simplifies proofs across multiple network structures by eliminating the need to separately verify bounds under different sparsity conditions. Graph theorists and computer scientists studying network properties, coloring algorithms, and combinatorial optimization can now transfer known results between maximum-degree and average-degree settings, reducing redundant work and expanding what we know about when large independent sets must exist in sparse networks.

Extremal graphs for average size of maximal matchings in bicyclic graphs

Finding the graph shapes that give the smallest average matchings

Mathematicians determined the minimum possible average size of maximal matchings in bicyclic graphs — networks with exactly two cycles — and identified exactly which graph shape achieves this minimum. For any such graph with n vertices, the average matching size cannot drop below (4n−11)/(2n−5), with equality occurring only when two triangles share an edge and extra vertices hang off one corner.

This completes a research program started years ago on matching problems in increasingly complex graphs. The methods used here — breaking down the problem by identifying which small matchings drive the minimum — create a template for solving similar extremal problems on other graph families, potentially accelerating progress on open questions in combinatorics.

Cliques in minimally globally rigid graphs

Why the densest possible rigid structures must be complete and symmetric

Mathematicians have proven that certain rigid geometric structures—ones that can't be deformed without breaking their constraints—must actually be the simplest possible version if they contain a dense enough subgroup of connections. The finding confirms a 20-year-old prediction about how rigidity and connectivity relate in multidimensional space.

This result helps engineers and mathematicians understand the boundaries between minimal rigidity and redundancy. In applications like robot design, mechanical linkages, and structural analysis, knowing exactly when a structure must be completely symmetric versus when it can be sparser tells engineers how much flexibility they have in their designs without sacrificing stability.

Semidefinite and linear programming bounds for sum-rank-metric codes and non-existence results

Finding the limits of codes that protect data sent across networks

Researchers developed new mathematical tools to determine the maximum size of error-correcting codes designed for modern communication systems like distributed storage and network coding. Using optimization techniques including semidefinite programming, they found sharper upper limits on code size than previous methods and proved that certain theoretically perfect codes cannot actually exist.

Error-correcting codes are fundamental to reliable data transmission—from cloud storage to wireless communications. These tighter bounds help engineers understand what's theoretically possible and avoid wasting resources searching for codes that don't exist, while the new optimization methods could improve the design of more efficient communication systems.