🤖 AI Summary
To address the challenge of real-time, seamless deployment of deep learning models within digital audio workstations (DAWs), this paper introduces an open-source, PyTorch-native framework for efficient integration of neural audio models into professional audio plugins. The framework provides a unified, model-agnostic interface with built-in variable-length buffer management, sample-rate adaptation, precise latency compensation, and parameter control—all implemented entirely in Python to significantly lower plugin development barriers. Leveraging a cross-platform SDK, it enables one-click export of standard VST3 and Audio Unit (AU) binaries compatible with major DAWs. Experimental evaluation demonstrates real-time performance and robustness across diverse tasks, including audio effect modeling, timbre transfer, and sample generation. The framework has been widely adopted by researchers, industry practitioners, and creative artists, accelerating the transition of neural audio technologies from academic research to practical music production and artistic creation.
📝 Abstract
Neural audio processing has unlocked novel methods of sound transformation and synthesis, yet integrating deep learning models into digital audio workstations (DAWs) remains challenging due to real-time / neural network inference constraints and the complexities of plugin development. In this paper, we introduce the Neutone SDK: an open source framework that streamlines the deployment of PyTorch-based neural audio models for both real-time and offline applications. By encapsulating common challenges such as variable buffer sizes, sample rate conversion, delay compensation, and control parameter handling within a unified, model-agnostic interface, our framework enables seamless interoperability between neural models and host plugins while allowing users to work entirely in Python. We provide a technical overview of the interfaces needed to accomplish this, as well as the corresponding SDK implementations. We also demonstrate the SDK's versatility across applications such as audio effect emulation, timbre transfer, and sample generation, as well as its adoption by researchers, educators, companies, and artists alike. The Neutone SDK is available at https://github.com/Neutone/neutone_sdk