"Deep learning" | ArxivCSExplorer

20 results for “Deep learning”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.LGstat.MLTheoreticalRecentJun 9, 2026

Limitations of Learning Tanh Neural Networks with Finite Precision

This paper investigates limitations of learning tanh neural networks under finite-precision computations and Lp accuracy guarantees.

View →

cs.LGcs.AIcs.DSEmpiricalRecentJun 19, 2026

Breaking chains with trees: Deep learning with $\mathcal{O}(\log N)$ parallel time complexity

Neeraj Mohan Sushma, Aditya Nagarsekar, Cabrel Teguemne Fokam, Robin Schiewer +3 more

This paper proposes Hierarchical Block-Local Learning (HBLL), a framework for training deep neural networks without full end-to-end backpropagation, achieving $\mathcal{O}(\log N)$ parallel time compl…

View →

cs.NEEmpiricalRecentJun 28, 2026

Supervised Hebbian learning in Deep Counterstream Associative Networks

Andreas Knoblauch

A new error backpropagation method called supervised counterstream learning is proposed for deep associative networks, which only requires recognition of errors during training and backpropagates corr…

View →

cs.LGmath.STstat.MLTheoreticalRecentJul 26, 2026

A Statistical Difference between Single-Layer Learning and Hierarchical Learning in Wide Neural Networks

Sumio Watanabe

This paper compares two theoretical frameworks for hierarchical neural networks with a finite but large number of hidden units and shows that training input-to-hidden weights reduces generalization er…

View →

cs.DCcs.LGEmpiricalRecentJun 29, 2026

GPU Parallelization Strategies for Forward and Backward Propagation in Shallow Neural Networks: A CUDA-Based Comparative Study

Rania Zitouni, Nadine Bousdjira, Sarah Hasnaoui, Amel Sadoun +1 more

This paper compares and optimizes CUDA strategies for a shallow neural network, achieving a 1.41x speedup on a large dataset.

View →

cs.CVcs.AIcs.CRRecentMar 30, 2026

Detection of Adversarial Attacks in Robotic Perception

Ziad Sharawy, Mohammad Nakshbandi, Sorin Mihai Grigorescu

This paper addresses the vulnerability of DNNs used in robotic semantic segmentation to adversarial attacks by proposing specialized detection strategies to enhance safety in robotic perception system…

View →

cs.LGcs.AIRecentMay 28, 2026

LLMs Without Deep Neural Networks: New Architecture, Benefits and Case Study

Vincent Granville

The paper introduces a novel, non-deep neural network architecture that achieves the performance of LLMs by finding the global optimum of the loss function in a single, closed-form iteration, eliminat…

View →

cs.LGcs.AIRecentMay 28, 2026

Automatically Differentiable Nonlinear Tensor Networks (ADNTNs) for Exponential Compression of Deep Neural Networks

Andrzej Cichocki, Michal Wietczak

The paper introduces Automatically Differentiable Nonlinear Tensor Networks (ADNTNs) to achieve massive, structured compression of deep neural networks, demonstrating compression ratios up to 77,000x…

View →

cs.LGstat.MLTheoreticalRecentJul 28, 2026

Algorithmic Separation between Constant-Depth and Logarithmic-Depth Neural Networks

Yunwei Ren, Zihao Wang, Jason D. Lee

This paper provides the first algorithmic separation between constant-depth and logarithmic-depth networks, identifying a class of Boolean functions that logarithmic-depth networks can learn efficient…

View →

cs.CRcs.LGRecentJun 2, 2026

Long-Term and Short-Term Transistor Aging in Deep Neural Networks: Impact and Mitigation

Alireza Sarmadi, Virinchi Roy Surabhi, Prashanth Krishnamurthy, Hussam Amrouch +2 more

This paper analyzes the impact of long-term and short-term transistor aging on Deep Neural Network (DNN) inference accuracy and proposes an aging-aware retraining methodology to maintain performance e…

View →

cs.CCcs.LGcs.LORecentMay 28, 2026

The Complexity of Verifying Feedforward Neural Networks in Quantised Settings

Eric Alsmann, Martin Lange, Marco Sälzer

This paper analyzes the computational complexity of verifying feedforward neural networks when their weights are restricted to finite-width arithmetic, finding that verification remains NP-complete fo…

View →

cs.LGcs.AIRecentMay 27, 2026

Learning Compositional Latent Structure with Vector Networks

Niclas Pokel, Benjamin F. Grewe

The paper introduces the Vector Network (VN), a novel recurrent architecture that replaces fixed weight matrices with reusable weight atoms, enabling superior compositional generalization by making st…

View →

cs.ARcs.LGcs.NEEmpiricalRecentJul 16, 2026

Toward Energy-Efficient and Low-Power Arrhythmia Detection for Wearable Devices

Floriaan Bulten, Yawar Rasheed, Arlene John, Vincenzo Stoico +1 more

This paper proposes approximations to reduce power and energy consumption of deep learning architectures for arrhythmia detection in wearable devices, achieving 64.9% power reduction and 61.5% energy…

View →

cs.LGRecentJun 3, 2026

BBOmix: A Tabular Benchmark for Hyperparameter Optimization of Unsupervised Biological Representation Learning

Luca Thale-Bombien, Jan Ewald, Ralf König, Aaron Klein

This paper introduces BBOmix, an open-source benchmark for unsupervised representation learning on real-world biological data.

View →

cs.CRcs.LGRecentApr 18, 2026

Towards Deep Encrypted Training: Low-Latency, Memory-Efficient, and High-Throughput Inference for Privacy-Preserving Neural Networks

Nges Brian Njungle, Eric Jahns, Michel A. Kinsy

This paper develops optimized algorithms and a pipeline architecture for high-throughput, memory-efficient batch processing of encrypted neural network inference, significantly improving performance o…

View →

cs.LGmath.NAmath.OCTheoreticalRecentJun 29, 2026

Convergence of Continual Learning in Homogeneous Deep Networks

Matan Schliserman, Gon Buzaglo, Itay Evron, Daniel Soudry

This paper characterizes continual classification in homogeneous models as sequential projections and identifies regularity properties for local linear convergence.

View →

cs.LGcs.AIRecentMay 30, 2026

Memory-Efficient LLM Training with Dynamic Sparsity: From Stability to Practical Scaling

Qiao Xiao, Boqian Wu, Patrik Okanovic, Tomasz Sternal +5 more

The paper introduces Sparse Memory-Efficient Training (SMET), a method that stabilizes and optimizes Dynamic Sparse Training (DST) for large language models, enabling stable and memory-efficient spars…

View →