Hinge Regression Trees and HRT-Boost: Newton-Optimized Oblique Learning for Compact Tabular Models

📅 2026-05-22
📈 Citations: 0
Influential: 0
📄 PDF

career value

201K/year
🤖 AI Summary
This work addresses the optimization challenges in oblique decision trees arising from discrete and non-convex splitting functions by proposing a novel formulation that recasts each oblique split as a nonlinear least-squares problem over two linear predictors. The resulting max/min envelope exhibits ReLU-like representational capacity and enables efficient node-wise optimization via a damped Newton method. Building on this, the authors develop HRT, a universal approximation framework with an explicit \(O(\delta^2)\) approximation rate, and introduce HRT-Boost—an ensemble algorithm that synergistically integrates functional gradient descent. Experimental results demonstrate that a single HRT tree achieves performance on par with state-of-the-art baselines, while HRT-Boost maintains competitive accuracy with substantially reduced model size, offering both theoretical guarantees and compactness.
📝 Abstract
Learning high-quality oblique decision trees remains a significant challenge due to the discrete and non-convex nature of split optimization. We present the Hinge Regression Tree (HRT) framework, which reframes each oblique split as a nonlinear least-squares problem over two linear predictors whose max/min envelope induces ReLU-like representation capacity. We show that the resulting node-level optimization can be interpreted as a damped Newton method, and we establish the monotonic decrease of the node objective for its backtracking line-search variant. We establish, theoretically, that HRT is a universal approximator with an explicit $O(δ^2)$ approximation rate. Building upon this base learner, we propose HRT-Boost, a mathematically synergistic ensemble extension that couples node-level Newton updates with stage-wise functional gradient descent. We show that this ensemble construction admits a stage-wise empirical risk reduction guarantee under the squared loss. Empirical evaluations on synthetic and real-world benchmarks show that HRT is highly competitive with established single-tree baselines, and HRT-Boost compares favorably with strong ensemble baselines and often yields substantially more compact models. The code is publicly available at https://github.com/Hongyi-Li-sz/HRT-Boost.
Problem

Research questions and friction points this paper is trying to address.

oblique decision trees
split optimization
non-convex optimization
compact tabular models
universal approximation
Innovation

Methods, ideas, or system contributions that make the work stand out.

oblique decision trees
Newton optimization
universal approximation
functional gradient boosting
compact tabular models