Non-Vacuous Generalization Bounds: Can Rescaling Invariances Help?

📅 2025-09-30

📈 Citations: 0

✨ Influential: 0

career value

196K/year

🤖 AI Summary

ReLU networks’ weight rescaling invariance renders conventional PAC-Bayes generalization bounds vacuous and inconsistent—identical functions yield vastly disparate complexity estimates. Method: This work lifts the PAC-Bayes framework to function space, defining a rescaling-invariant complexity measure within a function-equivalent representation. Leveraging KL divergence bounds and the data processing inequality, it constructs data-dependent, non-vacuous, and tighter generalization guarantees. Contribution/Results: The approach eliminates parameter redundancy’s influence on the bound, significantly reducing complexity estimation bias on standard architectures. By operating directly on functional equivalence classes rather than parameterizations, it yields a more discriminative theoretical tool for deep learning generalization analysis—providing the first rescaling-invariant, non-vacuous PAC-Bayes bound grounded in function-space geometry.

Technology Category

Application Category

📝 Abstract

A central challenge in understanding generalization is to obtain non-vacuous guarantees that go beyond worst-case complexity over data or weight space. Among existing approaches, PAC-Bayes bounds stand out as they can provide tight, data-dependent guarantees even for large networks. However, in ReLU networks, rescaling invariances mean that different weight distributions can represent the same function while leading to arbitrarily different PAC-Bayes complexities. We propose to study PAC-Bayes bounds in an invariant, lifted representation that resolves this discrepancy. This paper explores both the guarantees provided by this approach (invariance, tighter bounds via data processing) and the algorithmic aspects of KL-based rescaling-invariant PAC-Bayes bounds.

Problem

Research questions and friction points this paper is trying to address.

Addresses rescaling invariances in PAC-Bayes generalization bounds

Proposes invariant representation to resolve weight distribution discrepancies

Explores algorithmic aspects of rescaling-invariant KL-based bounds

Innovation

Methods, ideas, or system contributions that make the work stand out.

PAC-Bayes bounds in invariant representation

Resolving rescaling invariances in ReLU networks

KL-based rescaling-invariant generalization guarantees

🔎 Similar Papers

No similar papers found.