🤖 AI Summary
Uniform computational resource allocation across all mesh nodes in graph neural PDE solvers leads to inefficiency, as it ignores spatially heterogeneous physical complexity. Method: We propose a spatially adaptive quantization framework that employs a lightweight auxiliary model to dynamically identify high-error regions and accordingly assign differentiated bitwidths to node, edge, and cluster features. The method integrates state-of-the-art graph neural architectures—including MP-PDE and GraphViT—to enable PDE solving on unstructured meshes. Contribution/Results: Our approach achieves Pareto-optimal accuracy–cost trade-offs across diverse physical field tasks. Under identical computational budgets, it improves solution accuracy by up to 50% over uniform quantization baselines, while significantly enhancing resource utilization efficiency and generalization capability.
📝 Abstract
Physical systems commonly exhibit spatially varying complexity, presenting a significant challenge for neural PDE solvers. While Graph Neural Networks can handle the irregular meshes required for complex geometries and boundary conditions, they still apply uniform computational effort across all nodes regardless of the underlying physics complexity. This leads to inefficient resource allocation where computationally simple regions receive the same treatment as complex phenomena. We address this challenge by introducing Adaptive Mesh Quantization: spatially adaptive quantization across mesh node, edge, and cluster features, dynamically adjusting the bit-width used by a quantized model. We propose an adaptive bit-width allocation strategy driven by a lightweight auxiliary model that identifies high-loss regions in the input mesh. This enables dynamic resource distribution in the main model, where regions of higher difficulty are allocated increased bit-width, optimizing computational resource utilization. We demonstrate our framework's effectiveness by integrating it with two state-of-the-art models, MP-PDE and GraphViT, to evaluate performance across multiple tasks: 2D Darcy flow, large-scale unsteady fluid dynamics in 2D, steady-state Navier-Stokes simulations in 3D, and a 2D hyper-elasticity problem. Our framework demonstrates consistent Pareto improvements over uniformly quantized baselines, yielding up to 50% improvements in performance at the same cost.