π€ AI Summary
Existing machine learning methods for solving multi-input, multi-scale partial differential equations (PDEs) suffer from poor generalization and high computational cost. To address these limitations, this paper proposes a novel Transformer-based architecture. Our method introduces four key innovations: (1) a dual-sequence structure that decouples grid points from query points; (2) a gated conditional embedding (GCE) layer enabling flexible incorporation of variable-dimensional physical inputs; (3) Hilbert-curve-based spatial reordering combined with block-wise embedding to compress sequence length; and (4) a multi-scale attention mechanism to enhance cross-scale feature modeling. Evaluated on multi-physics benchmarks, our approach achieves superior accuracy and inference efficiency over state-of-the-art methods. It significantly reduces computational overhead for large-scale geometric models and enables real-time, high-resolution PDE solving.
π Abstract
Partial Differential Equations (PDEs) are fundamental for modeling physical systems, yet solving them in a generic and efficient manner using machine learning-based approaches remains challenging due to limited multi-input and multi-scale generalization capabilities, as well as high computational costs. This paper proposes the Multi-input and Multi-scale Efficient Transformer (MMET), a novel framework designed to address the above challenges. MMET decouples mesh and query points as two sequences and feeds them into the encoder and decoder, respectively, and uses a Gated Condition Embedding (GCE) layer to embed input variables or functions with varying dimensions, enabling effective solutions for multi-scale and multi-input problems. Additionally, a Hilbert curve-based reserialization and patch embedding mechanism decrease the input length. This significantly reduces the computational cost when dealing with large-scale geometric models. These innovations enable efficient representations and support multi-scale resolution queries for large-scale and multi-input PDE problems. Experimental evaluations on diverse benchmarks spanning different physical fields demonstrate that MMET outperforms SOTA methods in both accuracy and computational efficiency. This work highlights the potential of MMET as a robust and scalable solution for real-time PDE solving in engineering and physics-based applications, paving the way for future explorations into pre-trained large-scale models in specific domains. This work is open-sourced at https://github.com/YichenLuo-0/MMET.