ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:

20 results for “Backpropagation through time (BPTT)”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.LGcs.AIEmpiricalComprehensiveRecentJun 4, 2026

Pretraining Recurrent Networks without Recurrence

Akarsh Kumar, Phillip Isola

This paper proposes Supervised Memory Training (SMT), a method for training nonlinear RNNs that sidesteps recurrent credit propagation entirely.

View →
q-bio.NCcs.AIRecentMay 27, 2026

Misalignment Between Backpropagation and the Hierarchy of Brain Responses to Images

Joséphine Raugel, Maximilian Seitzer, Marc Szafraniec, Huy V. Vo +5 more

While backpropagated gradients can predict human brain activity in the visual cortex, their spatial and temporal organization fundamentally diverges from the expected patterns of a biologically plausi…

View →
cs.LGcs.AIRecentMay 27, 2026

On the Learnability of Test-Time Adaptation: A Recovery Complexity Perspective

Zhi Zhou, Ming Yang, Shi-Yu Tian, Kun-Yang Yu +2 more

The paper establishes the first theoretical framework for analyzing the learnability of Test-Time Adaptation (TTA) under non-stationary data streams by introducing Recovery Complexity, which quantifie…

View →
cs.LGcs.AIRecentMay 28, 2026

Test Time Training for Supervised Causal Learning

Zizhen Deng, Jiaru Zhang, Rui Ding, Huang Bojun +4 more

The paper proposes Test-Time Training for Supervised Causal Learning (TTT-SCL), a novel framework that dynamically generates training data aligned with specific test instances to significantly improve…

View →
cs.LGcs.AIcs.CLRecentJun 3, 2026

Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)

Nizar Islah, Istabrak Abbes, Irina Rish, Sarath Chandar +1 more

This paper proposes a method to recover recoverability structure from failed traces of post-trained language models, enabling test-time routing and post-training analysis.

View →
cs.AIRecentMay 27, 2026

The Shape of Overthinking: Backtracking Bursts in Long Reasoning Traces

Navid Rezazadeh, Arash Gholami Davoodi

The paper analyzes backtracking dynamics in long reasoning traces to distinguish between useful self-correction and unproductive revision, finding that correct reasoning exhibits early, isolated repai…

View →
cs.CLcs.AIRecentMay 31, 2026

TimeSage-MT: A Multi-Turn Benchmark for Evaluating Agentic Time Series Reasoning

Yaxuan Kong, Qingren Yao, Yuqi Nie, Yichen Li +6 more

The paper introduces TimeSage-MT, a comprehensive multi-turn benchmark designed to rigorously test an LLM agent's ability to perform complex, evolving time series analysis, revealing critical gaps in…

View →
cs.LGcs.AIRecentMay 27, 2026

TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints

Abhijit Chakraborty, Suddhasvatta Das, Yash Shah, Vivek Gupta +1 more

TIMEGATE introduces a resource-aware policy layer that manages continual ML adaptation by dynamically budgeting time and evaluation resources, achieving significant compute and energy savings without…

View →
cs.AIcs.HCcs.LGRecentMay 27, 2026

CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models

Abhilash Durgam, Nyle Siddiqui, Jeffrey A. Chan-Santiago, Qiushi Fu +2 more

CaMBRAIN introduces a novel Mamba-based State Space Model (SSM) for real-time, continuous EEG inference, achieving state-of-the-art results with significantly higher throughput than existing methods.

View →
cs.CRcs.LGRecentApr 24, 2026

Self-Supervised Learning for Android Malware Detection on a Time-Stamped Dataset

Annan Fu, Hao Pei, Maryam Tanha

The paper proposes a time-aware self-supervised learning framework using BYOL to improve Android malware detection robustness by accurately accounting for app release times.

View →
cs.AIcs.LGRecentMay 30, 2026

SHARP: Sleep-based Hierarchical Accelerated Replay for Long Range Non-Stationary Temporal Pattern Recognition

Jayanta Dey, Shikhar Srivastava, Itamar Lerner, Christopher Kanan +1 more

SHARP proposes a novel sleep-based hierarchical replay framework to efficiently learn long-range non-stationary temporal patterns in streaming data, achieving improved context retention and predictive…

View →
cs.LGcs.AIRecentMay 28, 2026

The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer

Tianhua Chen

This book provides a compact, derivation-oriented mathematical primer that connects major families of generative AI models, showing their underlying structural relationships.

View →
cs.CRcs.AIcs.LGRecentMay 21, 2026

TimeGuard: Channel-wise Pool Training for Backdoor Defense in Time Series Forecasting

Quang Duc Nguyen, Siyuan Liang, Yiming Li, Fushuo Huo +1 more

The paper proposes TimeGuard, a novel channel-wise pool training defense, to significantly improve the robustness of time series forecasting against backdoor attacks by addressing signal dilution and…

View →
eess.SYcs.LGRecentJun 1, 2026

Physics-Guided Recurrent State-Space Neural Networks for Multi-Step Prediction

Ruiyuan Li, Ajay Seth, Manon Kok

The paper proposes PG-RSSNN, a physics-guided recurrent state-space neural network that improves multi-step prediction stability and accuracy compared to both pure black-box and pure physics models, e…

View →
cs.LGcs.AIRecentJun 1, 2026

Why Do Time Series Models Need Long Context Windows?

Luca Butera, Giovanni De Felice, Andrea Cini, Cesare Alippi

The paper argues that long context windows are necessary for time series forecasting not just to capture long-range dependencies, but primarily to reduce uncertainty about the underlying data-generati…

View →
cs.LGcs.AIRecentMay 27, 2026

QuITE: Query-Based Irregular Time Series Embedding

JungHoon Lim

The paper introduces QuITE, a plug-and-play embedding module that uses learnable query tokens to effectively embed irregular multivariate time series data into latent representations compatible with e…

View →
cs.AIRecentMay 28, 2026

BitTP: The Lightweight Trajectory Prediction Model with BitLLM for Edge-Devices

Mincheol Kang, Hyunjin Lim, Bomin Kang, Daehee Park

The paper proposes BitTP, a lightweight bitlinear architecture that quantizes LLM-based trajectory predictors to 1.58-bit weights while keeping activations full-precision, enabling high-performance de…

View →
cs.CRRecentApr 2, 2026

Spike-PTSD: A Bio-Plausible Adversarial Example Attack on Spiking Neural Networks via PTSD-Inspired Spike Scaling

Lingxin Jin, Wei Jiang, Maregu Assefa Habtie, Letian Chen +4 more

The paper introduces Spike-PTSD, a novel, biologically inspired adversarial attack framework that successfully compromises the robustness of Spiking Neural Networks (SNNs) by modeling abnormal neural…

View →
cs.LGcs.AIRecentMay 31, 2026

ChronosAD: Leveraging Time Series Foundation Models for Accurate Anomaly Detection

Uzair Khan, Luigi Capogrosso, Francesco Biondani, Michele Magno +3 more

ChronosAD introduces a novel architecture that uses time series foundation models and a custom Temporal Block to achieve robust and highly accurate anomaly detection across diverse domains.

View →
cs.CLcs.AIRecentJun 1, 2026

A Primer in Post-Training Reasoning Data: What We Know About How It Works

Yaoming Li, Guangxiang Zhao, Qilong Shi, Lin Sun +2 more

This paper synthesizes over 150 scattered studies and reports to provide the first comprehensive primer on post-training reasoning data, organizing the field around data objects, utility, construction…

View →