Papers similar to 2605.29591

~ similar to 2605.29591· 17 results

cs.CVcs.AIq-bio.NCRecentMay 28, 2026

Brain-IT-VQA: From Brain Signals to Answers

Roman Beliy, Matias Cosarinsky, Oliver Heinimann, Navve Wasserman +1 more

The paper introduces Brain-IT-VQA, a novel framework that significantly improves visual question answering from fMRI signals, and presents NSD-VQA, a new, highly controlled dataset for this task.

View →

cs.AIRecentJun 1, 2026

EvoBrain: Continual Learning of EEG Foundation Models Across Heterogeneous BCI Tasks

Yangxuan Zhou, Sha Zhao, Jiquan Wang, Shijian Li +1 more

EvoBrain proposes a dynamic, cross-task continual learning framework to overcome the limitations of task-specific EEG decoding, enabling unified and scalable brain-computer interfaces.

View →

cs.CVcs.AIRecentMay 28, 2026

Versatile Framework with Semantic and Structural guidance for Image Reconstruction from Brain Activity

Yizhuo Lu, Changde Du, Qiongyi Zhou, Liuyun Jiang +1 more

The paper proposes MindDiffuser, a two-stage framework that significantly improves image reconstruction from brain activity by combining semantic guidance from text-to-image models with structural ref…

View →

cs.CVcs.AIRecentMay 31, 2026

Beyond Visual Memory: Mechanistic Diagnostics of Latent Visual Reasoning

Garvin Guo, Yu Chen, Xiang Wang, Shuai Li +3 more

The paper deconstructs latent visual reasoning tokens into components and finds that the performance gains are primarily due to boundary markers and attention patterns, not the tokens' ability to enco…

View →

cs.SDcs.AIRecentMay 29, 2026

MindVoice: Reconstructing Intelligible Speech from Non-invasive Neural Signals with Pretrained Priors

Guangyin Bao, Taiping Zeng, Jianfeng Feng, Xiangyang Xue

MindVoice is a neuro-to-speech framework that uses pretrained priors to disentangle and reconstruct intelligible speech from noisy, non-invasive neural signals, significantly outperforming existing me…

View →

cs.LGcs.AIRecentMay 27, 2026

A Multi-dimensional Framework for Evaluating Generalization in EEG Foundation Models

Aditya Kommineni, Emily Zhou, Kleanthis Avramidis, Tiantian Feng +1 more

The paper proposes a multi-dimensional evaluation framework to assess EEG foundation models under realistic low-resource conditions, finding that while these models excel in long-context tasks, their…

View →

cs.AIRecentJun 1, 2026

EVA-Net: Subject-Independent EEG Motor Decoding with Video-Derived Motor Priors

Ziyuan Li, Yueyu Sun, Yimeng Zhang

EVA-Net proposes a two-stage framework that uses action videos as semantic priors to achieve strong subject-independent EEG motor decoding, significantly outperforming text-based methods.

View →

cs.LGcs.AIRecentMay 30, 2026

Dive into Waves: Morlet Spectral Transformer for Cross-Subject Emotion Decoding from EEG

Jiaxin Qing, Lexin Li

The paper proposes the Morlet Spectral Transformer (MST), a novel architecture that effectively decodes cross-subject emotion from EEG by designing specialized spectral and spatial representations, ou…

View →

cs.LGcs.CLRecentMay 30, 2026

Task Structure Reverses Layerwise State Encoding in Sequence Models

Yuhang Jiang

The paper demonstrates that the location and nature of state encoding in sequence models are not fixed architectural traits but are highly dependent on the specific task, showing that the encoding pro…

View →

cs.CVcs.AIcs.CLRecentMay 31, 2026

On the Limits of Token Reduction for Efficient Unified Vision Language Training

Siyi Chen, Weiming Zhuang, Jingtao Li, Lingjuan Lv

The paper analyzes token reduction for efficient unified VLM training, finding that while task-specific acceleration saves computation, it destroys the mutual performance gains achieved through joint…

View →

cs.AIRecentMay 28, 2026

OmniMatBench: A Human-Calibrated Multimodal Reasoning Benchmark Across 19 Materials Science Subfields

Wanhao Liu, Jiaqing Xie, Qian Tan, Weida Wang +9 more

The paper introduces OmniMatBench, a comprehensive, human-calibrated multimodal reasoning benchmark covering 19 materials science subfields, revealing that current multimodal language models (MLLMs) h…

View →

cs.CLcs.CVRecentMay 30, 2026

Sandboxed Coding Agents are Competitive Omni-modal Task Solvers

Dongping Chen, Xuanao Huang, Zhihan Hu, Qingyuan Shi +2 more

The paper demonstrates that specialized coding agents, using only text and image access within a sandbox, can effectively solve complex omnimodal tasks, often outperforming state-of-the-art native omn…

View →

cs.AIRecentJun 1, 2026

eMoT: evolving Memory-of-Thought via Symbolic Anchoring and Memory Corrosion

Xiang Li, Jiwei Wei, Ke Liu, Yitong Qin +4 more

The eMoT framework enhances multi-step reasoning in LLMs by treating reasoning as an evolving memory, stabilizing performance through symbolic computation and structured refinement.

View →

cs.AIRecentMay 28, 2026

Benchmarking Positional Encoding Strategies for Transformer-Based EEG Foundation Models

Ayse Betul Yuce, Sebastian Stober

This paper benchmarks five positional encoding strategies for transformer-based EEG foundation models, concluding that the optimal encoding is task-dependent and no single strategy is universally supe…

View →

cs.AIRecentMay 27, 2026

Training Stratigraphy: Persistent Behavioral Artifacts in Large Language Models Observed Through Longitudinal AI-Human Interaction

Chen Ying Claude, Zhihan Luo

The paper identifies five persistent, deep-seated behavioral patterns ('training strata') in LLMs, observed through long-term, intimate human-AI interaction, suggesting that training artifacts survive…

View →

cs.CLcs.AIRecentMay 29, 2026

BenHalluEval: A Multi-Task Hallucination Evaluation Framework for Large Language Models on Bengali

Shefayat E Shams Adib, Ahmed Alfey Sani, Ekramul Alam Esham, Ajwad Abrar +2 more

The paper introduces BenHalluEval, the first dedicated multi-task framework for systematically evaluating hallucination in Large Language Models (LLMs) specifically for the Bengali language.

View →

cs.CLcs.AIRecentMay 28, 2026

Unlocking the Working Memory of Large Language Models for Latent Reasoning

Lukas Aichberger, Sepp Hochreiter

The paper introduces Reasoning in Memory (RiM), a latent reasoning method that replaces autoregressive token generation with fixed memory blocks to enable compute-efficient internal working memory for…

View →