CaMBRAIN introduces a novel Mamba-based State Space Model (SSM) for real-time, continuous EEG inference, achieving state-of-the-art results with significantly higher throughput than existing methods.
Electroencephalography (EEG) is a critical, non-invasive method to monitor electrical brain activity. EEGs can span anywhere from a couple seconds to multiple hours, posing a major hurdle for existing deep learning methods due to two major factors: (1) existing EEG models are predominantly built upon the attention mechanism, incurring quadratic scaling as the sequence length increases, and (2) raw EEG signals must be processed in a sliding-window fashion due to fixed-length input requirements, preventing global understanding of the entire signal. To this extent, we propose CaMBRAIN - the first Causal, Mamba-based state space model (SSM) capable of real-time inference of EEG signals, arguing that bidirectional approaches are needlessly expensive given the causal, unidirectional nature of EEG. However, training such a model is non-trivial, as crucial EEG events can be extremely brief - within fractions of a second - yet separated by long intervals spanning minutes. Current EEG methods use self-supervised objectives that optimize for signal reconstruction, but these are not well suited for streaming SSMs; they fail to explicitly train the hidden state to retain the salient long-range context needed for streaming inference. We therefore introduce a multi-stage self-supervised training pipeline specifically tailored to encourage long-range memory retention and strong performance on EEG signals, while preserving the linear-time complexity of state space models. CaMBRAIN achieves state-of-the-art (SOTA) results across 3 different EEG datasets with >10x higher throughput than existing models, enabling the first model capable of long-range, continuous inference of variable-length EEG signals.
Benchmarking Positional Encoding Strategies for Transformer-Based EEG Foundation Models
This paper benchmarks five positional encoding strategies for transformer-based…
A Multi-dimensional Framework for Evaluating Generalization in EEG Foundation Models
The paper proposes a multi-dimensional evaluation framework to assess EEG founda…
Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detect…
This study compares multiple post-hoc explainable AI methods (e.g., DeepSHAP, Gr…
A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity
The paper demonstrates that the valence structure learned by modern LLMs aligns…
Test Time Training for Supervised Causal Learning
The paper proposes Test-Time Training for Supervised Causal Learning (TTT-SCL),…
Predicting Causal Effects from Natural Language Queries using Structured Representations
The paper introduces Query2Effect, a large-scale benchmark, and a two-step frame…
Certified Policy Optimisation for Nested Causal Bandits via PAC-Bayes Risk
The paper introduces Nested Contextual Causal Bandits (NCCBs) to model multi-tim…
Task Structure Reverses Layerwise State Encoding in Sequence Models
The paper demonstrates that the location and nature of state encoding in sequenc…