Papers similar to 2606.01339

~ similar to 2606.01339· 18 results

cs.LGcs.AIRecentJun 1, 2026

Why Do Time Series Models Need Long Context Windows?

Luca Butera, Giovanni De Felice, Andrea Cini, Cesare Alippi

The paper argues that long context windows are necessary for time series forecasting not just to capture long-range dependencies, but primarily to reduce uncertainty about the underlying data-generati…

View →

cs.LGcs.AIRecentMay 27, 2026

Online Irregular Multivariate Time Series Forecasting via Uncertainty-Driven Dual-Expert Calibration

Haonan Wen, Hanyang Chen, Songhe Feng

The paper proposes Under-Cali, an uncertainty-driven dual-expert calibration framework, to achieve stable and efficient online forecasting for irregularly sampled multivariate time series.

View →

cs.LGcs.AIRecentMay 27, 2026

QuITE: Query-Based Irregular Time Series Embedding

JungHoon Lim

The paper introduces QuITE, a plug-and-play embedding module that uses learnable query tokens to effectively embed irregular multivariate time series data into latent representations compatible with e…

View →

cs.CLcs.AIRecentJun 1, 2026

SimSD: Simple Speculative Decoding in Diffusion Language Models

Junxia Cui, Haotian Ye, Runchu Tian, Hongcan Guo +8 more

The paper proposes SimSD, a plug-and-play speculative decoding algorithm that adapts diffusion language models (dLLMs) to achieve fast, token-level acceleration by restoring causal masking capabilitie…

View →

cs.AIRecentMay 28, 2026

NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs

Shuaidi Wang, Zhan Zhuang, Ruping Huang, Yu Zhang

The paper introduces NaRA, a noise-aware LoRA technique that dynamically adapts fine-tuning parameters based on the noise level during diffusion, significantly improving the performance of Diffusion L…

View →

cs.AIRecentJun 1, 2026

Bridging the Last Mile of Time Series Forecasting with LLM Agents

Yuhua Liao, Zetian Wang, Qiangqiang Nie, Zhenhua Zhang

The paper introduces an LLM-agent framework to solve the 'last-mile forecasting' problem, bridging the gap between raw statistical predictions and business-ready forecasts by incorporating weakly stru…

View →

cs.LGcs.AIRecentMay 28, 2026

Beyond MSE: Improving Precipitation Nowcasting with Multi-Quantile Regression

Gijs van Nieuwkoop, Siamak Mehrkanoon

The paper demonstrates that replacing standard pointwise losses (like MSE) with multi-quantile regression significantly improves precipitation nowcasting accuracy and provides valuable risk estimates…

View →

cs.LGcs.AIRecentMay 28, 2026

HARP: Hadamard-Preconditioned Adaptive Rotation Processor for Extreme LLM Quantization

Artur Zagitov, Gleb Molodtsov, Aleksandr Beznosikov

HARP introduces a novel, adaptive, learnable orthogonal processor that significantly improves the robustness and accuracy of extreme low-bit LLM quantization compared to fixed methods.

View →

cs.LGcs.AIRecentMay 29, 2026

GPU Forecasters: Language Models as Selective Surrogates for Kernel Runtime Optimization

Zaid Khan, Justin Chih-Yao Chen, Jaemin Cho, Elias Stengel-Eskin +1 more

This paper demonstrates that Large Language Models (LLMs) can serve as accurate and selective surrogates for costly GPU kernel performance measurements, significantly expanding the search space for op…

View →

cs.LGcs.AIRecentJun 1, 2026

VLBM: Variational Latent Basis Modeling for OOD Robust Multivariate Time Series Forecasting

Xudong Zhang, Jierui Lei, Jiacheng Li, Lingdong Shen +2 more

The paper proposes VLBM, a latent basis modeling framework, to achieve state-of-the-art robustness in multivariate time series forecasting, particularly when facing rare but high-impact out-of-distrib…

View →

cs.AIcs.LGRecentMay 30, 2026

SHARP: Sleep-based Hierarchical Accelerated Replay for Long Range Non-Stationary Temporal Pattern Recognition

Jayanta Dey, Shikhar Srivastava, Itamar Lerner, Christopher Kanan +1 more

SHARP proposes a novel sleep-based hierarchical replay framework to efficiently learn long-range non-stationary temporal patterns in streaming data, achieving improved context retention and predictive…

View →

cs.CRcs.LGRecentMay 21, 2026

Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference

Rui Li, Wenyuan Wu, Weijie Miao

The paper proposes a decision-aware quadratic replacement for the ReLU activation function, enabling low-degree, calibration-lossless polynomial approximations for neural network inference under Fully…

View →

cs.CLcs.LGRecentMay 29, 2026

Cognitive Fatigue in Autoregressive Transformers: Formalization and Measurement

Riju Marwah, Ritvik Garimella, Vishal Pallagani, Atishay Jain +2 more

The paper formalizes LLM degradation during long generation as 'cognitive fatigue' and introduces the Fatigue Index (FI), a measurable, model-agnostic diagnostic tool for real-time monitoring.

View →

cs.LGcs.AIstat.MLRecentJun 3, 2026

AdaKoop: Efficient Modeling of Nonlinear Dynamics from Nonstationary Data Streams with Koopman Operator Regression

Naoki Chihara, Ren Fujiwara, Yasuko Matsubara, Yasushi Sakurai

AdaKoop introduces an efficient streaming algorithm that models complex nonlinear dynamics from nonstationary data streams by leveraging the Koopman operator theory, achieving state-of-the-art accurac…

View →

cs.LGcs.AIRecentJun 1, 2026

FLARE: Diffusion for Hybrid Language Model

Yuchen Zhu, Jing Shi, Chongjian Ge, Hao Tan +8 more

FLARE is a systematic conversion framework that enables a single checkpoint to support both autoregressive (AR) and diffusion-style parallel decoding for hybrid-attention large language models, achiev…

View →

cs.LGcs.AIRecentMay 29, 2026

BudgetDraft: Acceptance-Aware Multi-View Training for Sparse-KV Speculative Decoding

Liang He, Jingbo Wen, Qishi Zhan, Yixiong Chen +3 more

BudgetDraft introduces an acceptance-aware multi-view training method that trains a sparse-KV speculative decoder to maintain high acceptance rates across varying context lengths and sparsity levels,…

View →

stat.MLcs.AIcs.LGRecentMay 29, 2026

Entropic Projection Alignment: Estimating, Explaining, and Improving Model Performance Under Distribution Shift

Salim I. Amoukou, Emanuele Albini, Tom Bewley, Saumitra Mishra +1 more

The paper introduces Entropic Projection Alignment (EPA), a unified framework that estimates, explains, and improves model performance under distribution shift by aligning source and target distributi…

View →

cs.AIcs.LGRecentMay 27, 2026

Adaptive Reservoir Computing for Multi-Scenario Chaotic System Forecasting

Shadmehr Zaregarizi, Khashayar Yavari

The paper introduces an adaptive reservoir computing framework that tailors Echo State Networks (ESNs) to specific evaluation scenarios, achieving a high score on the CTF-4-Science Lorenz benchmark fo…

View →