PAPER PLAINE

Fresh research, simply explained. Updates twice daily.

Average Rankings Mask Per-Subject Optimality: A Friedman-Nemenyi Benchmark of EEG Motor-Imagery BCI Decoders

Why the best brain-computer interface decoder changes from person to person

Brain-computer interfaces that read motor intention from EEG show no single best decoding method across people, even in ideal conditions. Testing over 1,000 different pipelines on more than 340,000 individual model fits revealed that the top-performing approach varies by dataset and person—matching the decoder to each person's brain patterns improved accuracy by about 7 percentage points compared to using one universal decoder.

Brain-computer interfaces aim to help people with paralysis or locked-in syndrome control prosthetics or communication devices. If one decoding method worked best for everyone, clinical deployment would be straightforward. This work shows that practical BCI systems need to personalize their approach for each user rather than relying on a single universal design, suggesting that real-world BCI performance depends as much on fitting the algorithm to individual brain differences as on the algorithm itself.

Multi-type branching inference on contact trees with application to COVID-19

Mapping how diseases spread through real contact networks, not just genetic sequences.

Researchers developed a mathematical method to extract disease transmission patterns directly from contact-tracing data—who infected whom—without needing genetic sequences. The approach accounts for a key reality that older models miss: some infected people have many contacts while others have few, and this affects how fast disease spreads. When tested on COVID-19 data from India, the method accurately recovered transmission rates and contact patterns.

Public health officials use contact tracing to understand outbreak dynamics, but existing tools struggle to extract transmission rates from incomplete records. This framework turns messy contact-tracing data into precise estimates of who is most likely to spread disease and how many contacts matter, enabling faster identification of superspreaders and better targeting of interventions during future outbreaks.

MedRLM: Recursive Multimodal Health Intelligence for Long-Context Clinical Reasoning, Sensor-Guided Screening, Evidence-Grounded Decision Support, and Community-to-Tertiary Referral Optimization

AI that reasons through a patient's complete medical history to guide treatment decisions

Most medical AI answers isolated questions quickly but struggles when the real answer requires connecting facts scattered across patient records, images, and sensor data. MedRLM instead builds a dynamic "evidence map" that recursively searches through a patient's full medical picture—text notes, imaging, heart rhythms, blood pressure trends, and clinical guidelines—activating deeper analysis when abnormal patterns appear, then flags cases for human review when confidence is low.

Healthcare providers in rural or under-resourced areas often lack specialists to review complex cases. A system that can systematically extract and connect evidence across all available patient data, then decide whether a case needs referral to a tertiary hospital, could reduce delays in care and improve triage accuracy. The framework's built-in uncertainty checking also prevents overconfident recommendations that might lead clinicians astray.

MetaboNet-Bench: A Multi-modal Benchmark for Glucose Forecasting in Type 1 Diabetes

A shared testing ground for algorithms that predict blood sugar in type 1 diabetes

Researchers created MetaboNet-Bench, a standardized evaluation framework for glucose forecasting algorithms that use multiple data sources—glucose monitors, insulin doses, and carbohydrate intake—rather than glucose readings alone. When they tested several published models, they found that adding more types of data only improved predictions in more sophisticated models, revealing that simpler algorithms can't fully exploit the extra information.

Type 1 diabetes patients rely on accurate glucose forecasts to manage their insulin delivery and prevent dangerous blood sugar swings. Until now, researchers have compared forecasting algorithms using different datasets and methods, making it impossible to tell which approaches actually work best. MetaboNet-Bench gives the research community a shared standard, enabling faster innovation and clearer identification of which data sources matter most for better predictions.

Circuit Tracing in Autoregressive Protein Language Models

Decoding how AI models generate new protein sequences

Researchers created ProGenMech, a new tool to reverse-engineer how protein-generating AI models work. By tracing the computational pathways through these models, they discovered that the systems identify sparse, meaningful patterns—like conserved sequence motifs—that guide protein generation and predict protein quality, revealing that the AI learns recognizable biological logic rather than just statistical shortcuts.

Protein generation AI could accelerate drug discovery and enzyme design, but scientists can only trust these models once they understand what the AI is actually doing. By making these models interpretable, researchers can verify the generated proteins follow real biological principles, catch failures before expensive lab testing, and potentially steer the AI toward specific desired properties—turning black-box generation into a tool biologists can actually use.

A likelihood-based framework for simultaneously learning both noise and growth dynamics using biologically-informed neural networks

Teaching AI to spot hidden patterns in noisy biological data

Biologists often struggle to extract real growth rules from messy experimental data because they don't know what kind of noise is hiding the true signal. Researchers developed a new method that lets artificial neural networks learn both the underlying biological pattern and the noise structure at the same time, without guessing in advance what the noise looks like. Testing on population growth, they showed this dual-learning approach recovers hidden growth laws more accurately than existing methods.

Biological experiments are expensive and produce limited data points, so extracting reliable mechanistic rules from them is critical for everything from disease modeling to drug design. Most current AI approaches assume all noise looks the same, which often misses real biological complexity and leads to wrong predictions. This framework lets researchers automatically discover what kind of variability they're actually dealing with, improving confidence in conclusions drawn from small, expensive datasets.

Small LLMs for Biomedical Claim Verification: Cost-Effective Fine-Tuning, Structural Dataset Shortcuts, and Cross-Domain Generalization

Cheap AI models that beat expensive ones at catching false health claims

A smaller, cheaper artificial intelligence model outperformed GPT-4o and GPT-5 at spotting false biomedical claims, achieving up to 12% better accuracy while costing a fraction as much. The researchers fine-tuned three small models on medical claim datasets and discovered that one popular dataset had a structural quirk that artificially inflated scores—and that removing this quirk made models much better at handling new types of medical claims they'd never seen before.

Hospitals, health insurers, and public health agencies currently can't afford to use the most powerful AI models for fact-checking medical claims at scale. This work shows they can deploy smaller, cheaper models instead—without sacrificing accuracy and while actually improving reliability across different types of medical information. That means institutions with modest budgets can now automate detection of medical misinformation that spreads online or within their own systems.

Hyperbolic Neural Population Geometry Benefits Computation

Why brains might use curved geometry to remember more information

Brain cells appear to organize their activity in curved, hyperbolic space rather than flat space — and this geometry lets them store and retrieve memories far more efficiently. When researchers built memory models based on this curved structure, they achieved dramatically larger storage capacity than existing approaches, suggesting animals may naturally encode spatial memories using this mathematical trick.

Understanding how brains organize information could lead to better artificial memory systems for AI — and might explain why animals can reliably store and recall vast amounts of spatial information despite the brain's physical limits. If we can replicate this hyperbolic geometry in machine learning models, we could build systems that remember more while using less computational power.

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

When brain-reading AI learns to spot the person instead of the disease

Popular AI models trained on EEG brain scans achieve high accuracy on clinical tasks, but a new diagnostic reveals they often rely on subject-identity features rather than genuine disease markers. Researchers identified this "Identity Trap" across three major foundation models and four datasets, showing that subject-specific patterns were 13–89 times stronger than random noise—and only grew stronger after fine-tuning. They developed FMScope, a toolkit that separates real biological signals from identity shortcuts, improving accuracy by 6–27 percentage points when true clinical markers exist.

EEG-based AI models are moving into clinical use for diagnosing epilepsy, sleep disorders, and brain injuries, where misplaced confidence in false accuracy could harm patients. Without tools like FMScope, hospitals might deploy models that perform well on test data but fail on new patients they've never seen—because the model learned to recognize individual brains rather than disease patterns. This work provides a concrete method to audit foundation models before clinical deployment and shows which features are genuinely tied to disease versus which are diagnostic dead ends.

Early psychosis shows deviations in scaling behaviour within a critical regime

Brain's scaling patterns shift subtly but measurably in early psychosis

The brain maintains its characteristic scale-free organization in early psychosis, but the specific mathematical patterns that describe how activity changes across different time scales are systematically altered. Using three complementary analysis methods on resting brain scans, researchers found that people with early psychosis show consistent shifts in these scaling properties compared to healthy controls—suggesting the underlying organization of brain activity is reorganized rather than broken.

Early psychosis is notoriously hard to diagnose reliably in its earliest stages, when intervention could make the biggest difference. A measurable shift in how the brain organizes itself across multiple time scales could eventually become a more objective marker of early psychosis, complementing clinical interviews and helping clinicians identify at-risk individuals sooner. The framework used here—combining multiple scaling measures—also provides psychiatry with a more robust toolkit for understanding whether other mental health conditions involve loss of critical dynamics or reorganization within them.

Mapping Whisper Representations to Human ECoG Responses with Interpretable Time-Resolved Neural Encoding

How AI speech models align with brain activity during listening

Researchers mapped how OpenAI's Whisper speech model relates to actual brain activity recorded from people listening to speech. They found that the model's middle layers matched brain responses best, and that the brain processes speech in a way that mirrors how the AI system is organized—suggesting the two use similar hierarchical strategies to understand sound.

This work bridges artificial intelligence and neuroscience, showing that speech AI systems can serve as testbeds for understanding how the human brain processes language. The findings could accelerate research into speech disorders, improve brain-computer interfaces for people with paralysis, and guide the design of AI systems that process language more like humans do.

What Molecular Structure Cannot Tell Us: A Taxonomy of Explainability Gaps in GNN-Based Drug Toxicity Prediction

Why drug structure alone can't predict all side effects

Graph neural networks, which learn from a drug's molecular structure, can predict only about 45% of known side effects—even for well-studied drugs like aspirin. The missing 55% falls into predictable categories: effects that no molecule structure can encode, data gaps from incomplete testing, mismatches between what's measured and what's toxic, and errors in how the neural network represents chemistry.

Drug regulators and safety teams currently rely on computational models to catch rare side effects before they harm patients. This research shows those models have a hard ceiling—knowing a drug's molecular structure isn't enough. Understanding where that ceiling is lets regulators know when they need additional testing, human expertise, or real-world monitoring instead of trusting predictions that might miss real dangers.

Sparse Autoencoders Map Brain-LLM Alignment onto Cortical Semantic Topography

How neural networks organize meaning exactly like human brains do

Researchers decoded how large language models like GPT-2 internally organize semantic information and discovered that this organization mirrors the structure of the human brain's language regions. Semantic features alone explained 94% of how well the model predicted brain responses to language, and five specific semantic categories aligned precisely with five distinct brain regions known from neuroscience.

This finding bridges a major gap between how AI language models work and how human brains process language. It shows that the brain's semantic architecture isn't arbitrary—it emerges naturally when systems learn to understand language—which could help neuroscientists understand language processing and AI researchers build models that align more closely with biological intelligence.

Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology

Brain and artificial neural networks align similarly across species—but only for early vision

Different learning rules—the mathematical recipes that train artificial neural networks—produce surprisingly similar patterns of brain alignment in early visual areas of both humans and macaques. But in higher visual areas, the learning rule matters far less than the overall power and training data of the network itself, suggesting that basic visual processing follows similar rules across primates, while more complex vision relies on factors beyond how the network learns.

Understanding which principles are shared across primate brains helps neuroscientists and AI researchers build better models of vision. The finding that early visual processing is robust and rule-agnostic suggests this is a fundamental principle worth mimicking in artificial systems, while the brittleness of higher visual areas points to practical limits: you can't match complex visual reasoning by tweaking learning algorithms alone—you need better training data and larger networks.

BCI-sift: An automated feature selection toolbox for Brain Computer Interface applications

A tool that picks the right brain signals for better mind-machine interfaces

Brain-computer interfaces produce enormous amounts of noisy data, making it hard to find which neural signals actually matter for decoding movement or speech. A new software toolbox called BCI-sift automates the process of filtering out noise and selecting only the most informative signals, improving classification accuracy while revealing which brain regions and frequencies are doing the real work.

Brain-computer interfaces that help paralyzed patients control prosthetics or communicate depend on fast, accurate decoding of brain signals—every millisecond and every electrode matters. By cutting through noise automatically and improving accuracy, BCI-sift could make these systems more reliable and easier for engineers to develop, ultimately delivering faster response times and more intuitive control to users who need it most.

Protein Fold Classification at Scale: Benchmarking and Pretraining

A faster way to sort proteins by shape using less computing power

Researchers created a large, high-quality benchmark dataset and a new training method that can classify protein structures more efficiently than existing approaches. The new method, called Masked Invariant Autoencoders, works by hiding up to 90% of a protein's structure during training and learning to reconstruct it—a strategy that scales better than current methods while achieving superior performance on protein fold classification tasks.

Proteins fold into thousands of distinct shapes, and each shape determines what the protein does in living cells. Faster, cheaper ways to classify these folds could accelerate drug discovery, help predict how mutations affect disease, and make protein research accessible to labs without massive computing budgets. The openly shared benchmark also gives the field a common standard for measuring progress.

Approximate Macroscopic Dynamics of Spiking Neural Networks Based on Solutions to the Transport Equation

How neurons' starting electrical states shape their collective firing patterns

When a population of neurons receives changing inputs, their firing rates fluctuate in ways that depend on where each neuron started electrically before stimulation began. Researchers derived a mathematical model that predicts these fluctuations by tracking how the distribution of neural voltages evolves over time, rather than assuming neurons behave in a steady synchronized state.

Brain activity emerges from billions of neurons firing in complex patterns, and understanding what drives these patterns is central to neuroscience. This work explains why the same stimulus can produce different collective firing patterns depending on recent neural history—a finding that could improve how researchers interpret experimental recordings and build more realistic computational models of brain circuits.

Implicit Behavioral Decoding from Next-Step Spike Forecasts at Population Scale

A single AI model reads both brain activity and animal decisions from neural recordings

Researchers trained a single AI model to forecast neural activity one step ahead and discovered it could simultaneously decode what a mouse was about to do—predicting its choice 75.7% of the time and which visual stimulus it saw 66.1% of the time. This dual capability emerged from learning to predict raw spike counts alone, without explicit behavioral training, and worked reliably after just 100–150 calibration trials at the start of each recording session.

Brain-computer interfaces need both prediction and readout, usually requiring separate models and extra computational overhead. This approach cuts that complexity in half while running fast enough for real-time closed-loop experiments on standard lab computers, making it practical for researchers developing neural prosthetics or studying decision-making in animal models.

How Much is Brain Data Worth for Machine Learning?

When brain scans actually help train better AI — and when they don't

Adding brain recordings to machine learning training can improve AI performance, but only under specific conditions. Researchers worked out the math to predict exactly when brain data is worth collecting and how many brain scans would be needed to match the benefit of additional training examples.

Brain-enhanced AI could eventually improve medical diagnosis systems, brain-computer interfaces, and neuroscience research tools. But collecting brain scans is expensive and time-consuming, so knowing in advance whether it will actually help — rather than wasting resources on data that won't improve the model — matters for smart research planning.

Mathematical Modeling of Early Embryonic Cell Cycles of Drosophila melanogaster

How fruit fly embryos speed up and slow down their cell division

Fruit fly embryos divide cells in a rapid, synchronized rhythm during early development, and scientists built a mathematical model that explains how. The model shows that one key protein—called CycB—acts like a molecular clock: by gradually changing how quickly it's made, the embryo naturally stretches out its cell cycle timing over the first 14 divisions, matching what happens in real embryos.

Understanding how embryonic cell cycles are controlled could reveal what goes wrong in birth defects or cancer, where timing and coordination break down. Since fruit flies share many of the same molecular machines that control human cell division, insights from this model offer a bridge between simple mathematical rules and the complex biology of early development.

Electroencephalography and Electromyography as a Non-Invasive Biomarker of Neural Regeneration: A Review of Central and Peripheral Nervous System Injury and Regeneration

Using brain and muscle electrical signals to track nerve healing after injury

Brain waves (EEG) and muscle signals (EMG) can monitor whether nerves are actually healing after injury, offering doctors a non-invasive way to track recovery in real time. The two measurements work together: EEG reveals how the brain is reorganizing after damage, while EMG shows whether muscles are regaining function as peripheral nerves reconnect.

Nerve injuries from stroke or spinal cord damage are hard to assess — doctors can't easily tell if healing is happening without invasive procedures. Being able to track recovery with simple electrical readings from skin electrodes would let clinicians adjust treatment earlier, predict which patients will recover function, and measure whether new therapies actually work. This bridges the gap between understanding what's happening at the molecular level and knowing whether patients are actually getting better.

A Thermodynamic Analysis of Enhanced Metastability in Isochoric Supercooled Liquids

Why freezing liquids in sealed containers keeps them liquid longer

Keeping a liquid at constant volume instead of letting it expand prevents ice crystals from forming — even at temperatures well below freezing. The researchers proved this thermodynamically by showing that sealed containers create a weaker push toward solidification than open ones do, making ice nucleation exponentially less likely.

Supercooled liquids (water that's frozen solid in temperature but still liquid in structure) have real uses in cryopreservation and medical storage. Understanding how to keep them stable longer without chemical additives could improve organ transplant viability and reduce biological sample damage during freezing procedures.

Simulating Infant First-Person Sensorimotor Experience via Motion Retargeting from Babies to Humanoids

Using robots to recreate what babies actually feel and sense while moving

Researchers developed a method to translate infant movements from videos onto humanoid robots and virtual models, recreating not just the motion but also the sensory feedback—touch, muscle awareness, and visual input—that babies experience. The technique reconstructs a baby's full 3D body position from a single video, then maps those movements onto different robot platforms with sub-centimeter accuracy, generating realistic streams of multimodal sensory data.

Scientists can now study how babies develop motor skills by literally experiencing movement through a robot's sensors, rather than just watching from the outside. This opens new ways to detect early signs of developmental disorders, helps roboticists design machines that learn more like humans do, and gives developmental psychologists direct access to the sensory world of infancy—something previously impossible to measure or replicate.

A geometry aware framework enhances noninvasive mapping of whole human brain dynamics

Using brain shape to map electrical signals more accurately across the whole brain

A new method called Geometric Basis Functions uses each person's unique brain shape to better pinpoint where electrical activity originates during EEG and MEG scans. The technique works by breaking down the brain's surface into natural geometric patterns and combining them to reconstruct neural activity, and tests show it achieves higher accuracy than existing approaches across multiple types of brain data.

Current brain imaging methods often place neural activity in the wrong location or require oversimplified assumptions about how the brain is organized. This approach leverages individual brain anatomy to make non-invasive scans more precise, which could improve diagnosis of conditions like epilepsy and strengthen neuroscience research by capturing faster, more detailed maps of how different brain regions communicate.

One-shot emergency psychiatric triage across 15 frontier AI chatbots

Do AI chatbots correctly identify psychiatric emergencies in one message?

AI chatbots almost never miss true psychiatric emergencies—correctly flagging 94% of crisis cases for immediate care. But they frequently over-triage less urgent situations, incorrectly labeling routine or moderately concerning messages as needing faster response than they actually do.

As people increasingly turn to chatbots for mental health guidance, this gap matters in opposite ways: the systems are reliable safety nets that won't let genuine crises slip through unnoticed, but they may also overwhelm emergency services and create unnecessary anxiety by treating normal distress as a crisis. Better calibration could preserve the protective function while reducing false alarms.

Independent-Component-Based Encoding Models of Brain Activity During Story Comprehension

Finding the brain's consistent story-processing networks despite individual differences

Researchers developed a new way to map how brain networks respond to stories by filtering out noise and individual variation in brain anatomy. Rather than analyzing individual pixels of brain scans, they identified independent functional networks and found that certain networks—like those for hearing and language—reliably respond to linguistic features of stories across different people, with their predictions confirmed by known acoustic properties.

Brain imaging studies often struggle because each person's brain is wired slightly differently, making it hard to draw general conclusions. This method cuts through that noise to identify which brain networks actually respond to language, regardless of where those networks sit in each individual's head. That makes it easier for neuroscientists to compare results across studies and build more accurate models of how we understand language and stories.