🤖 AI Summary
Implicit regularization in deep learning—such as the bias introduced by early stopping or Dropout—is notoriously difficult to characterize analytically, particularly under complex training strategies where general estimation methods are lacking. This work proposes an empirical framework based on gradient matching that quantifies the discrepancy between weight updates and loss gradients, enabling estimation of implicit regularization in arbitrary deep networks without requiring analytical derivations. The approach provides the first scalable and general-purpose analysis of biases induced by sophisticated training mechanisms. It not only recovers the classical result that early stopping is equivalent to ℓ² regularization but also reveals that Dropout similarly introduces a significant implicit ℓ² regularization effect in deep networks.
📝 Abstract
Deep learning systems are known to exhibit implicit regularization (alt. implicit bias), favoring simple solutions instead of merely minimizing the loss function. In some cases, we can analytically derive the implicit regularization -- connecting it to an equivalent penalty that augments the learning objective. However, modern deep learning systems are complex, carrying modifications to the training procedure and architecture (e.g. early stopping, minibatching, dropout) whose effects are not always directly interpretable. Although estimating the resulting implicit regularization could aid theorists in algorithm design and practitioners in interpreting their hyperparameter choices, this problem has received little direct attention. It is also tractable: regularization makes weight updates deviate from loss gradients, promising a signal for identifying implicit bias. Here we provide gradient matching methods that can be used to empirically estimate the implicit regularization. Our method works on networks with known regularization, recovering popular explicit penalties like $\ell_1$ and $\ell_2$. It also replicates known implicit effects, like the quadratic weight penalty induced by early stopping in gradient descent, demonstrating that it can be used to test theories of implicit regularization. Crucially, because our method is empirical, it can handle implicit regularization in arbitrary networks. We demonstrate this use by characterizing the effects of dropout in deep networks, showing implicit $\ell_2$ effects in this popular method. Our work shows that practitioners can use gradient matching to understand regularization in networks with implicit biases that are too complicated to derive analytically.