ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:

20 results for “Mamba SSM”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.AIcs.CLcs.LGRecentJun 1, 2026

Forget Attention: Importance-Aware Attention Is All You Need

Soohyeong Shin, Yeongwook Yang

The paper proposes SISA (SSM-Informed Softmax Attention), a novel hybrid attention mechanism that integrates state-space model (SSM) importance signals directly into the attention score, achieving sta…

View →
cs.CVcs.AIRecentMay 29, 2026

Zamba2-VL Technical Report

Hassan Shapourian, Kasra Hejazi, Olabode M. Sule, Beren Millidge

Zamba2-VL is a new suite of vision-language models built on the Zamba2 hybrid architecture, achieving state-of-the-art performance and significantly improved inference efficiency compared to leading T…

View →
cs.CVcs.CRcs.SIRecentMay 14, 2026

Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation

Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene, Abdelmalik Taleb-Ahmed +2 more

This study systematically evaluates Vision Mamba models for detecting AI-generated images, finding that while they show promise, their current strengths and limitations must be understood relative to…

View →
cs.AIRecentJun 1, 2026

Physically-Constrained Mamba-SDE for Remaining Useful Life Prediction under Irregular Observations

Deyu Zhuang, Peiliang Gong, Yang Shao, Liyuan Shu +3 more

The paper proposes PC-MambaSDE, a physically-constrained continuous-time framework that accurately predicts Remaining Useful Life (RUL) despite irregular sensor observations and ensures physically pla…

View →
cs.CVcs.AIRecentJun 1, 2026

RPCASSM: Robust PCA State Space Model For Infrared Small Target Detection

Pingping Liu, Aohua Li, Yubing Lu, Jin Kuang +2 more

The paper proposes RPCASSM, a novel state space model leveraging Robust PCA (RPCA) to accurately detect and segment infrared small targets by separately modeling background and target information base…

View →
cs.CLcs.AIcs.LGRecentMay 30, 2026

Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink

Yuhang Jiang

The paper demonstrates that in Mamba-2, single-bucket probes can detect a large functional signature (detection layer) that is not fully responsible for the actual computation (execution layer), chall…

View →
cs.CLRecentMay 31, 2026

Benchmarking Local LLMs for Natural-Language-to-SQL Querying in Biopharmaceutical Manufacturing: An Empirical Benchmark on Consumer-Grade Hardware

Sagar Bhetwal, Rajan Bastakoti, Nirajan Acharya, Gaurav Kumar Gupta

This study benchmarks four local LLMs for natural-language-to-SQL querying in biopharma manufacturing, finding that general-purpose code-tuned models like Llama 3.1 8B and Qwen 2.5 Coder 7B outperform…

View →
cs.AIcs.LGRecentMay 30, 2026

EnergyMamba: An Uncertainty-Aware Graph-Enhanced Selective State Space Model for Energy Consumption Prediction

Dahai Yu, Rongchao Xu, Lin Jiang, Guang Wang

EnergyMamba proposes an uncertainty-aware, graph-enhanced selective state space model to significantly improve both the accuracy and reliability of energy consumption prediction by explicitly modeling…

View →
cs.CRRecentApr 13, 2026

Short Message Service (SMS) Phishing Attacks and Defenses: A Systematic Review

Mir Mehedi A. Pritom, Seyed Mohammad Sanjari, Maraz Mia, Ashfak Md Shibli +3 more

This systematic review analyzes the current state of SMS phishing (smishing) attacks and defenses, organizing existing research into four pillars to identify gaps and propose future mitigation strateg…

View →
cs.CVcs.AIRecentMay 29, 2026

MyoSem: Aligning Electromyography to Natural-Language Action Semantics for Hand Action Understanding

Chiyue Wang, Dong She, Yang Gao, Zhanpeng Jin

MyoSem introduces an EMG-action semantic alignment framework that transforms low-level muscle signals into a shared semantic space, enabling bidirectional retrieval between EMG data and natural langua…

View →
cs.SEcs.AIRecentMay 27, 2026

DeltaMCP: Incremental Regeneration via Spec-Aware Transformation for MCP servers

Aditya Pujara, Xiaogang Zhu, Hsiang-Ting Chen

DeltaMCP is a specification-aware, incremental regeneration tool that efficiently updates Model Context Protocol (MCP) servers by only modifying affected tooling when a service's OpenAPI specification…

View →
cs.DScs.LGstat.MLRecentJun 3, 2026

A General Framework for Dynamic Consistent Submodular Maximization

Paul Dütting, Federico Fusco, Silvio Lattanzi, Ashkan Norouzi-Fard +2 more

The paper develops a general framework for dynamic consistent submodular maximization, achieving constant-factor approximations with sublinear consistency for both cardinality and rank-$k$ matroid con…

View →
cs.LGcs.AIRecentMay 28, 2026

LLMs Without Deep Neural Networks: New Architecture, Benefits and Case Study

Vincent Granville

The paper introduces a novel, non-deep neural network architecture that achieves the performance of LLMs by finding the global optimum of the loss function in a single, closed-form iteration, eliminat…

View →
cs.AIcs.CLcs.IRRecentMay 31, 2026

Don't Ask the LLM to Track Freshness: A Deterministic Recipe for Memory Conflict Resolution

Vikas Reddy, Sumanth Challaram

The paper proposes a deterministic, version-aware aggregation method that significantly outperforms existing LLM-based systems for resolving memory conflicts in fact consolidation tasks.

View →
cs.CRcs.AIcs.LGRecentMay 11, 2026

MambaNetBurst: Direct Byte-level Network Traffic Classification without Tokenization or Pretraining

Gayan K. Kulatilleke, Siamak Layeghy, Mahsa Baktashmotlagh, Marius Portmann

MambaNetBurst introduces a compact, tokenizer-free byte-level classifier using a Mamba-2 backbone to achieve strong network traffic classification without requiring pre-training or complex data prepro…

View →
cs.CLcs.AIcs.CYRecentMay 31, 2026

Implicit Geographic Inference in LLM Medical Triage: Language-Driven Disparities in Emergency Recommendations

Qi Han Wong

The study demonstrates that LLMs exhibit significant, language-driven disparities in medical triage recommendations, recommending emergency care more frequently for English and Arabic prompts, even wh…

View →
cs.CCq-bio.QMRecentJun 1, 2026

Structure-Informed Multiple Sequence Alignment: A Formal Model and Hardness Results

Yoshiki Kanazawa, Naphan Benchasattabuse, Michal Hajdušek, Rodney Van Meter

The paper formally models structure-informed multiple sequence alignment (MSA-S) as an NP-complete optimization problem, establishing a strong computational complexity baseline for the field.

View →
cs.LOcs.AIRecentMay 27, 2026

Token Optimization Strategies for LLM-Based Oracle-to-PostgreSQL Migration

Oleg Grynets, Dmytro Babarytskyi, Vasyl Lyashkevych

This paper formalizes token optimization as a multi-objective constrained transformation problem for LLM-based Oracle-to-PostgreSQL migration, demonstrating that adaptive routing offers the best balan…

View →
cs.AIcs.CLRecentMay 28, 2026

Demystifying Data Organization for Enhanced LLM Training

Yalun Dai, Yangyu Huang, Tongshen Yang, Yonghan Wang +7 more

This paper proposes four guidelines and two novel data ordering methods (STR and SAW) to systematically optimize data organization, significantly enhancing the stability and performance of LLM trainin…

View →
cs.CRRecentMay 7, 2026

ClawGuard: Out-of-Band Detection of LLM Agent Workflow Hijacking via EM Side Channel

Leo Linqian Gan, Jeffery Wu, Longyuan Ge, Lanqing Yang +5 more

ClawGuard introduces a passive, out-of-band security monitor that detects LLM agent workflow hijacking by analyzing unique electromagnetic (EM) emanations generated during agent skill execution.

View →