"Mamba SSM" | ArxivCSExplorer

20 results for “Mamba SSM”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.AIcs.CLcs.LGRecentJun 1, 2026

Forget Attention: Importance-Aware Attention Is All You Need

The paper proposes SISA (SSM-Informed Softmax Attention), a novel hybrid attention mechanism that integrates state-space model (SSM) importance signals directly into the attention score, achieving sta…

View →

cs.CVcs.AIRecentMay 29, 2026

Zamba2-VL Technical Report

Hassan Shapourian, Kasra Hejazi, Olabode M. Sule, Beren Millidge

Zamba2-VL is a new suite of vision-language models built on the Zamba2 hybrid architecture, achieving state-of-the-art performance and significantly improved inference efficiency compared to leading T…

View →

cs.CVEmpiricalRecentJul 24, 2026

IR275K: A Benchmark for Infrared Multi-Frame Super-Resolution Toward Efficient Remote Sensing

Jie Deng, Heyang Wang, Changxin Wang, Junkai Shen +5 more

This paper introduces IR275K, a curated benchmark for multi-frame super-resolution in infrared remote sensing, and evaluates CGMamba, a lightweight state-space model, achieving state-of-the-art perfor…

View →

cs.CVcs.CRcs.SIRecentMay 14, 2026

Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation

Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene, Abdelmalik Taleb-Ahmed +2 more

This study systematically evaluates Vision Mamba models for detecting AI-generated images, finding that while they show promise, their current strengths and limitations must be understood relative to…

View →

cs.AIRecentJun 1, 2026

Physically-Constrained Mamba-SDE for Remaining Useful Life Prediction under Irregular Observations

Deyu Zhuang, Peiliang Gong, Yang Shao, Liyuan Shu +3 more

The paper proposes PC-MambaSDE, a physically-constrained continuous-time framework that accurately predicts Remaining Useful Life (RUL) despite irregular sensor observations and ensures physically pla…

View →

cs.CVcs.AIRecentJun 1, 2026

RPCASSM: Robust PCA State Space Model For Infrared Small Target Detection

Pingping Liu, Aohua Li, Yubing Lu, Jin Kuang +2 more

The paper proposes RPCASSM, a novel state space model leveraging Robust PCA (RPCA) to accurately detect and segment infrared small targets by separately modeling background and target information base…

View →

cs.ARcs.DCEmpiricalRecentJun 29, 2026

COSM: A Cooperative Scheduling Framework for Concurrent PIM and CPU Execution on Mobile Devices

Yilong Zhao, Fangxin Liu, Onur Mutlu, Mingyu Gao +3 more

The paper introduces COSM, a cooperative scheduling framework to facilitate concurrent operation of Processing-in-Memory (PIM) and CPU tasks on mobile platforms, improving PIM throughput by up to 2.8x…

View →

cs.CLcs.AIcs.LGRecentMay 30, 2026

Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink

Yuhang Jiang

The paper demonstrates that in Mamba-2, single-bucket probes can detect a large functional signature (detection layer) that is not fully responsible for the actual computation (execution layer), chall…

View →

cs.CLRecentMay 31, 2026

Benchmarking Local LLMs for Natural-Language-to-SQL Querying in Biopharmaceutical Manufacturing: An Empirical Benchmark on Consumer-Grade Hardware

Sagar Bhetwal, Rajan Bastakoti, Nirajan Acharya, Gaurav Kumar Gupta

This study benchmarks four local LLMs for natural-language-to-SQL querying in biopharma manufacturing, finding that general-purpose code-tuned models like Llama 3.1 8B and Qwen 2.5 Coder 7B outperform…

View →

cs.AIcs.LGRecentMay 30, 2026

EnergyMamba: An Uncertainty-Aware Graph-Enhanced Selective State Space Model for Energy Consumption Prediction

Dahai Yu, Rongchao Xu, Lin Jiang, Guang Wang

EnergyMamba proposes an uncertainty-aware, graph-enhanced selective state space model to significantly improve both the accuracy and reliability of energy consumption prediction by explicitly modeling…

View →

cs.CVcs.AIRecentMay 29, 2026

MyoSem: Aligning Electromyography to Natural-Language Action Semantics for Hand Action Understanding

Chiyue Wang, Dong She, Yang Gao, Zhanpeng Jin

MyoSem introduces an EMG-action semantic alignment framework that transforms low-level muscle signals into a shared semantic space, enabling bidirectional retrieval between EMG data and natural langua…

View →

cs.LGcs.MAEmpiricalRecentJun 22, 2026

MAS-PromptBench: When Does Prompt Optimization Improve Multi-Agent LLM Systems?

Juyang Bai, Laixi Shi

This paper systematically studies the potential of prompt optimization in multi-agent systems (MAS) across various setups, revealing significant gains but also open challenges.

View →

cs.IRcs.AIEmpiricalRecentJul 1, 2026

MemSyco-Bench: Benchmarking Sycophancy in Agent Memory

Zhishang Xiang, Zerui Chen, Yunbo Tang, Zhimin Wei +4 more

Proposed MemSyco-Bench benchmark for evaluating memory-induced sycophancy in agent systems, measuring when and how valid memories should be used.

View →

cs.ROEmpiricalRecentJul 23, 2026

FORGE-plus: Force-Budgeted Recovery for Contact-Rich Assembly with a Frozen LLM Supervisor

Kyupaeck Jeff Rah, Midum Oh

A two-layer framework using a large language model for force-conditioned reinforce learning with recovery maneuvers and force signatures.

View →

cs.SEcs.AIRecentMay 27, 2026

DeltaMCP: Incremental Regeneration via Spec-Aware Transformation for MCP servers

Aditya Pujara, Xiaogang Zhu, Hsiang-Ting Chen

DeltaMCP is a specification-aware, incremental regeneration tool that efficiently updates Model Context Protocol (MCP) servers by only modifying affected tooling when a service's OpenAPI specification…

View →

cs.DScs.LGstat.MLRecentJun 3, 2026

A General Framework for Dynamic Consistent Submodular Maximization

Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard +2 more

The paper develops a general framework for dynamic consistent submodular maximization, achieving constant-factor approximations with sublinear consistency for both cardinality and rank-$k$ matroid con…

View →

cs.LGcs.AIRecentMay 28, 2026

LLMs Without Deep Neural Networks: New Architecture, Benefits and Case Study

Vincent Granville

The paper introduces a novel, non-deep neural network architecture that achieves the performance of LLMs by finding the global optimum of the loss function in a single, closed-form iteration, eliminat…

View →

cs.ROcs.AIEmpiricalRecentJul 1, 2026

FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model

Chenyang Ma, Yue Yang, Radu Corcodel, Siddarth Jain +3 more

This paper introduces FurnitureVLA, a systematic study of real-scale bimanual furniture assembly using Vision-Language-Action models, improving simulation success from 48% to 80% and reducing errors.

View →

cs.AIcs.CLcs.IRRecentMay 31, 2026

Don't Ask the LLM to Track Freshness: A Deterministic Recipe for Memory Conflict Resolution

Vikas Reddy, Sumanth Challaram

The paper proposes a deterministic, version-aware aggregation method that significantly outperforms existing LLM-based systems for resolving memory conflicts in fact consolidation tasks.

View →

cs.CRcs.AIcs.LGRecentMay 11, 2026

MambaNetBurst: Direct Byte-level Network Traffic Classification without Tokenization or Pretraining

Gayan K. Kulatilleke, Siamak Layeghy, Mahsa Baktashmotlagh, Marius Portmann

MambaNetBurst introduces a compact, tokenizer-free byte-level classifier using a Mamba-2 backbone to achieve strong network traffic classification without requiring pre-training or complex data prepro…

View →