Papers similar to 2605.29467

~ similar to 2605.29467· 18 results

cs.LGmath.STstat.MERecentJun 1, 2026

Network Learning with Semi-relaxed Gromov-Wasserstein

Charles Dufour, Ulysse Naepels, Leonardo V. Santoro

The paper proposes a semi-relaxed Gromov-Wasserstein objective to estimate the latent connectivity structure of large-scale networks, achieving statistically consistent and efficient recovery of the u…

View →

cs.LGcs.AIRecentMay 31, 2026

Beyond Task-Agnostic: Task-Aware Grouping for Communication-Efficient Multi-Task MoE Inference

Zhiyao Xu, Aoxue Liu, Zhanjie Ding, Dan Zhao +2 more

The paper proposes Task-Aware Coactivation Grouping (TACG) to significantly reduce communication costs in multi-task MoE inference by grouping experts based on task-specific co-activation patterns, ou…

View →

cs.LGcs.AIRecentJun 1, 2026

Variational Learning for Insertion-based Generation

Yangtian Zhang, Zhe Wang, Arthur Gretton, Rex Ying +3 more

The paper introduces the Insertion Process (IP), a novel stochastic generative model that learns variable-length, non-monotonic sequence generation by explicitly modeling the insertion order of tokens…

View →

cs.LGcs.AIRecentMay 28, 2026

Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting

Amirhossein Ghaffari, Saeid Sheikhi, Ekaterina Gilman

The paper proposes GC-MoE, a graph-conditioned Mixture of Experts framework, to improve traffic forecasting by assigning personalized, specialized forecasting experts to individual road segments.

View →

cs.LGcs.AIRecentJun 1, 2026

VLBM: Variational Latent Basis Modeling for OOD Robust Multivariate Time Series Forecasting

Xudong Zhang, Jierui Lei, Jiacheng Li, Lingdong Shen +2 more

The paper proposes VLBM, a latent basis modeling framework, to achieve state-of-the-art robustness in multivariate time series forecasting, particularly when facing rare but high-impact out-of-distrib…

View →

cs.LGcs.AIRecentMay 27, 2026

Learning Compositional Latent Structure with Vector Networks

Niclas Pokel, Benjamin F. Grewe

The paper introduces the Vector Network (VN), a novel recurrent architecture that replaces fixed weight matrices with reusable weight atoms, enabling superior compositional generalization by making st…

View →

math.STstat.MEstat.MLTheoreticalRecentJun 9, 2026

Conformal Prediction for Dyadic Regression Under Complex Missingness

Robert Lunde, Minjie Yang, Elizaveta Levina, Ji Zhu

This paper develops a framework for conformal prediction in dyadic regression problems under complex missingness mechanisms.

View →

math.STstat.MEstat.MLTheoreticalRecentJun 9, 2026

Conformal Prediction for Dyadic Regression Under Complex Missingness

Robert Lunde, Minjie Yang, Elizaveta Levina, Ji Zhu

This paper develops a framework for conformal prediction in dyadic regression problems under complex missingness mechanisms.

View →

cs.CRcs.LGRecentJun 2, 2026

Bayesian Membership Privacy for Graph Neural Networks

Sinan Yıldırım, Megha Khosla

The paper introduces Bayesian Membership Privacy (BMP), a sampling-aware framework that accurately quantifies node-level membership privacy in Graph Neural Networks by treating graph sampling probabil…

View →

cs.CRcs.AIcs.LGRecentMay 27, 2026

Mind the Gap: Mixtures of Gaussians in Approximate Differential Privacy

Huikang Liu, Aras Selvi, Wolfram Wiesemann

The paper introduces 'mixture mechanisms,' a novel class of additive noise mechanisms that achieve approximate differential privacy by mixing multiple Gaussian distributions, resulting in lower noise…

View →

cs.CRcs.AIcs.LGRecentMay 27, 2026

Mind the Gap: Mixtures of Gaussians in Approximate Differential Privacy

Huikang Liu, Aras Selvi, Wolfram Wiesemann

The paper introduces 'mixture mechanisms,' a novel class of additive noise mechanisms that achieve differential privacy for real-valued queries, significantly reducing noise compared to the standard G…

View →

cs.AIRecentMay 28, 2026

NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs

Shuaidi Wang, Zhan Zhuang, Ruping Huang, Yu Zhang

The paper introduces NaRA, a noise-aware LoRA technique that dynamically adapts fine-tuning parameters based on the noise level during diffusion, significantly improving the performance of Diffusion L…

View →

cs.LGcs.AIRecentJun 1, 2026

ProbMoE: Differentiable Probabilistic Routing for Mixture-of-Experts

Heng Zhao, Zilei Shao, Guy Van den Broeck, Zhe Zeng

The paper introduces ProbMoE, a probabilistic routing framework that tackles the non-differentiability of top-$k$ routing in Mixture-of-Experts (MoE) models, achieving strong performance with improved…

View →

cs.LGstat.MLRecentJun 3, 2026

Graph Cascades: Contagion-Based Mesoscopic Rewiring for Structure-Aware Graph Machine Learning

Meher Chaitanya, My Le, Luana Ruiz

The paper introduces Graph Cascades, a mesoscopic rewiring technique that enhances Graph Neural Networks by promoting node pairs with strong multi-hop connections to direct edges, improving performanc…

View →

cs.CRRecentApr 26, 2026

Rényi Pufferfish Privacy with Gaussian-based Priors: From Single Gaussian to Mixture Model

Wenjin Yang, Ni Ding, Zijian Zhang, Zhen Li +4 more

This paper develops improved Gaussian mechanisms for Rényi Pufferfish Privacy (RPP) by incorporating Gaussian and Gaussian-mixture priors, significantly reducing the required noise and improving the p…

View →

cs.AIcs.LGRecentJun 1, 2026

Evidence-Gated LLM Priors for Multi-Objective Bayesian Optimization

Jiangyu Chen, Banyi

The paper proposes an objective-wise reputation-market mechanism to dynamically calibrate and gate LLM-generated expert priors in multi-objective Bayesian optimization, showing that dynamic calibratio…

View →

cs.LGcs.AIRecentMay 28, 2026

The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer

Tianhua Chen

This book provides a compact, derivation-oriented mathematical primer that connects major families of generative AI models, showing their underlying structural relationships.

View →

cs.CRcs.DSRecentApr 30, 2026

Variational and Majorization Principles in Lattice Reduction

Javier Blanco-Romero, Florina Almenares Mendoza

The paper uses majorization theory to analyze lattice reduction, showing that local swaps smooth the Gram-Schmidt profile and deriving variational and telescoping identities for the worst-case profile…

View →