Papers similar to 2606.05140

~ similar to 2606.05140· 18 results

cs.NEmath.APmath.PRRecentJun 4, 2026

Quantifying Uncertainty In Wide Two-Layer Neural Networks: On The Law Of The Limiting Fluctuation Process

Arnaud Descours, Arnaud Guillin, Geoffrey Lacour, Manon Michel +2 more

This paper develops a novel, computationally efficient method to quantify the uncertainty in wide neural network predictions by characterizing the limiting random fluctuations using stochastic evoluti…

View →

cond-mat.dis-nnquant-phstat.MLRecentJun 4, 2026

Nonreversible Gauge Fields in Fokker--Planck Dynamics: Supersymmetric Hamiltonians and Learned Finite Forces

Masayuki Ohzeki

The paper reformulates nonreversible perturbations of Fokker--Planck dynamics as gauge fields, providing a unified operator viewpoint to analyze relaxation processes and develop methods for learning o…

View →

cs.LGcs.AImath.DSRecentMay 27, 2026

The Hamilton-Jacobi Theory of Deep Learning

Jose Marie Antonio Miñoza, Erika Fille T. Legara, Christopher P. Monterola

This paper establishes an exact mathematical correspondence between training and inference in deep learning and the solution of Hamilton-Jacobi partial differential equations, unifying multiple theore…

View →

stat.MLcs.CRcs.LGRecentMay 22, 2026

On the Stability of Spherical Hellinger-Kantorovich Flows and Their Implications for Differential Privacy

Aratrika Mustafi, Soumya Mukherjee

This paper develops a perturbation theory for spherical Hellinger-Kantorovich (SHK) gradient flows, providing explicit, time-dependent bounds on divergence metrics to guarantee differential privacy fo…

View →

math.STcs.CCcs.DSRecentMay 28, 2026

Low-degree estimation thresholds in planted hypergraphs and tensor PCA

Daniel Fu, Youngtak Sohn

The paper analyzes low-degree estimation thresholds for recovering hidden signals in planted hypergraphs and tensor PCA, establishing sharp phase transitions and providing polynomial-time recovery alg…

View →

cs.CRcs.DSRecentApr 30, 2026

Variational and Majorization Principles in Lattice Reduction

Javier Blanco-Romero, Florina Almenares Mendoza

The paper uses majorization theory to analyze lattice reduction, showing that local swaps smooth the Gram-Schmidt profile and deriving variational and telescoping identities for the worst-case profile…

View →

math.NAcs.LGRecentJun 1, 2026

Spectral Audit of In-Context Operator Networks

Zhiwei Gao, Liu Yang, George Em Karniadakis

The paper introduces a Jacobian-based spectral audit to evaluate neural operators, demonstrating that standard prediction error metrics fail to capture crucial local dynamical structures and operator…

View →

cs.CRcs.CCRecentJun 2, 2026

Collision Resistance of Single-Layer Neural Nets

Marco Benedetti, Andrej Bogdanov, Enrico M. Malatesta, Marc Mézard +4 more

The paper analyzes the algorithmic complexity of finding collisions in single-layer binary neural networks, establishing that the collision resistance depends critically on the activation function's t…

View →

cs.CRRecentMay 7, 2026

$α$-Wasserstein Mechanism for Rényi Pufferfish Privacy

Ni Ding, Wenjin Yang, Zijian Zhang

The paper introduces the $\alpha$-Wasserstein mechanism to achieve Rényi Pufferfish Privacy using Laplace and Gaussian noise, demonstrating that it generalizes existing privacy frameworks and reduces…

View →

cs.DScs.CRmath.NTRecentMay 17, 2026

Module Lattice Security (Part III): Structured CVP Distance on the Log-Unit Lattice

Ming-Xing Luo

The paper analyzes the structured CVP distance on the log-unit lattice of cyclotomic fields, significantly reducing the conjectured CDPR factor for the ML-KEM cryptosystem from exponential to sub-poly…

View →

cs.LGcs.CLcs.CVRecentJun 2, 2026

Neuron Populations Exhibit Divergent Selectivity with Scale

Amil Dravid, Yasaman Bahri, Alexei A. Efros, Yossi Gandelsman

The study finds that specific, interpretable neuron populations (Rosetta Neurons) exhibit predictable, scale-dependent changes in selectivity and specialization as neural models grow larger.

View →

cs.LGcs.AImath.OCRecentMay 28, 2026

Singularity-aware Optimization via Randomized Geometric Probing: Towards Stable Non-smooth Optimization

Ruoran Xu, Borong She, Xiaobo Jin, Qiufeng Wang

The paper introduces Singularity-aware Adam (S-Adam), a novel optimizer that stabilizes deep learning training in non-smooth loss landscapes by dynamically damping updates based on local geometric ins…

View →

cs.CRRecentApr 19, 2026

Breaking Euston: Recovering Private Inputs from Secure Inference by Exploiting Subspace Leakage

Jiaqi Zhao, Fengwei Wang

This paper demonstrates that the Euston secure inference framework, which uses SVD-based matrix transmission to save bandwidth, leaks private input data by exploiting subspace leakage of random masks.

View →

cs.DScs.CCmath.CORecentMay 29, 2026

High-Dimensional Expanders, the Sparsest Cut Problem, and Steurer's Conjecture

Farzam Ebrahimnejad, Shayan Oveis Gharan

The paper refutes Steurer's conjecture regarding the existence of large constant-separated sets within families of unit-norm vectors with low average correlation, using high-dimensional expanders to s…

View →

cs.LGRecentJun 1, 2026

Why Are DMD Students Lazy? Understanding the Copying Behavior in Few-Step Distillation

Shucheng Li, Iolo Jones, Alexander Tong, Michael M. Bronstein

This paper investigates the phenomenon of 'copying' in Distribution Matching Distillation (DMD), finding that high-dimensional distillation causes student models to spontaneously reproduce the teacher…

View →

math.ATcs.CGmath-phRecentMay 27, 2026

Gauge Geometry of Hodge Zero-Mode Transport in Parameter-Dependent Topological Data Analysis

Satoshi Kanno, Rei Nishimura, Hiroshi Yamauchi, Yoshi-aki Shimada

The paper introduces a computational framework using Hodge zero-modes to track the geometry of topological features in parameter-dependent data, providing metrics like curvature and holonomy to quanti…

View →

stat.MLcs.AIcs.LGRecentMay 29, 2026

Interpreting FCDNNs via RG on Exponential Family

Fuzhou Gong, Zigeng Xia

The paper establishes that the training process of fully connected deep neural networks (DNNs) on exponential family data is mathematically equivalent to performing a Renormalization Group (RG) calcul…

View →

cs.LGcs.AIcs.CCRecentMay 28, 2026

Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't

Anej Svete, William Merrill, Ryan Cotterell, Ashish Sabharwal

The paper analyzes the expressivity of padded transformers, proving that their computational power is primarily determined by model depth and numeric precision, rather than attention type or width.

View →