Universal Approximation of Operators with Transformers and Neural Integral Operators

📅 2024-09-01

🏛️ arXiv.org

📈 Citations: 1

✨ Influential: 0

career value

192K/year

🤖 AI Summary

This work investigates the universal approximation capabilities of Transformers and neural integral operators in Banach spaces. Addressing three core problems: (1) whether Transformers can universally approximate integral operators between Hölder spaces; (2) whether neural integral operators exist that universally approximate arbitrary continuous linear or nonlinear operators between Banach spaces; and (3) how to overcome regularity constraints for broader approximation. We first establish, for the first time, the universal approximation property of Transformers for integral operators acting between Hölder spaces. Second, we propose a generalized neural integral operator based on the Gavurin integral and prove its universal approximation theorem for continuous operators between arbitrary Banach spaces. Third, we incorporate Leray–Schauder mappings into the Transformer architecture to eliminate dependence on smoothness assumptions on input/output spaces. These results provide a rigorous functional-analytic foundation for operator learning and extend the theoretical scope of deep learning in infinite-dimensional tasks such as PDE solving and physics-informed modeling.

Technology Category

Application Category

📝 Abstract

We study the universal approximation properties of transformers and neural integral operators for operators in Banach spaces. In particular, we show that the transformer architecture is a universal approximator of integral operators between H""older spaces. Moreover, we show that a generalized version of neural integral operators, based on the Gavurin integral, are universal approximators of arbitrary operators between Banach spaces. Lastly, we show that a modified version of transformer, which uses Leray-Schauder mappings, is a universal approximator of operators between arbitrary Banach spaces.

Problem

Research questions and friction points this paper is trying to address.

Study universal approximation of transformers in Banach spaces

Show transformers approximate integral operators in Hölder spaces

Prove neural integral operators approximate arbitrary Banach space operators

Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformers approximate integral operators universally

Neural integral operators use Gavurin integral

Modified transformers employ Leray-Schauder mappings

🔎 Similar Papers

Continuum Attention for Neural Operators