Limitations of Learning Tanh Neural Networks with Finite Precision
This paper investigates limitations of learning tanh neural networks under finite-precision computations and Lp accuracy guarantees.
This paper extends previous results for ReLU networks to the tanh setting and provides a novel construction of sharply localized bump functions.
Before reading this…
To understand this paper, make sure you know these concepts first:
- Neural networksfind papers →
- Finite-precision computationsfind papers →
- Lp accuracy guaranteesfind papers →
Abstract
More Like ThisWe investigate limitations of learning $\tanh$ neural networks from point evaluations under finite-precision computations and $L^p$ accuracy guarantees, building on Berner, Grohs, and Voigtländer (2023). Our approach is based on a novel construction of sharply localized bump functions via iterated $\tanh$ activations. Using this mechanism, we show that, in a finite-precision setting, no adaptive randomized algorithm based on $m$ samples can achieve a convergence rate higher than the Monte Carlo rate $O(m^{-1/p})$ in the $L^p$ norm, unless the sampling budget grows exponentially with the size of the network parameters and architecture. The results reveal fundamental limitations imposed by finite precision on the learnability of classes containing localized bump functions, extending previous results for ReLU networks to the $\tanh$ setting.