HARP 2.0: Expanding Hosted, Asynchronous, Remote Processing for Deep Learning in the DAW

πŸ“… 2025-03-04
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
Music creators face significant challenges in seamlessly integrating deep learning models into digital audio workstations (DAWs). Method: We propose a hosted, asynchronous remote processing framework enabling transparent integration. It comprises a VST/AU-compliant DAW plugin that communicates with Gradio endpoints via a lightweight pyharp Python API, supporting real-time audio/MIDI streaming, MIDI event serialization, and inference of joint audio-MIDI annotation modelsβ€”all with native in-plugin rendering of interactive UIs and results. Contribution/Results: This work introduces the first DAW-native support for general-purpose MIDI generation and multimodal joint audio-MIDI annotation models. It enables zero-context-switch creative AI workflows and achieves bounded end-to-end latency (<200 ms under typical conditions), while significantly improving plugin stability and cross-platform compatibility (macOS, Windows, Linux). The framework lowers the barrier to adopting AI tools in music production and fosters efficient collaboration between music technology developers and creators.

Technology Category

Application Category

πŸ“ Abstract
HARP 2.0 brings deep learning models to digital audio workstation (DAW) software through hosted, asynchronous, remote processing, allowing users to route audio from a plug-in interface through any compatible Gradio endpoint to perform arbitrary transformations. HARP renders endpoint-defined controls and processed audio in-plugin, meaning users can explore a variety of cutting-edge deep learning models without ever leaving the DAW. In the 2.0 release we introduce support for MIDI-based models and audio/MIDI labeling models, provide a streamlined pyharp Python API for model developers, and implement numerous interface and stability improvements. Through this work, we hope to bridge the gap between model developers and creatives, improving access to deep learning models by seamlessly integrating them into DAW workflows.
Problem

Research questions and friction points this paper is trying to address.

Integrates deep learning models into DAW software.
Enables audio and MIDI transformations via remote processing.
Bridges gap between model developers and creative professionals.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hosted asynchronous remote processing for DAWs
Integration of deep learning models in DAWs
Support for MIDI and audio/MIDI labeling models
πŸ”Ž Similar Papers
No similar papers found.