Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

๐Ÿ“… 2025-12-09
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Current Web-based solutions for 3D Gaussian Splatting (3DGS) and neural rendering are fragmented and deployment-heavy, lacking support for dynamic content and generative modeling. This paper introduces the first lightweight, WebGPU-native platform enabling real-time neural rendering and dynamic generative 3D modeling directly in browsers. Our approach unifies reconstruction and generation within a single framework. Key contributions include: (1) a standardized Gaussian generator protocol supporting per-frame ONNX inference and plugin-based Gaussian distribution updates; (2) the first in-browser integration of 3DGS reconstruction with generative paradigmsโ€”fully compatible with MLP-based 3DGS, 4DGS, neural avatars, and stylized post-processing; and (3) a TypeScript-wrapped three.js plugin library enabling low-barrier deployment, reproducibility, and cross-method evaluation. Experiments demonstrate significantly higher rendering efficiency than existing Web viewers under identical asset conditions, accelerating the practical adoption of the 3DGS family of methods.

Technology Category

Application Category

๐Ÿ“ Abstract
Neural rendering, particularly 3D Gaussian Splatting (3DGS), has evolved rapidly and become a key component for building world models. However, existing viewer solutions remain fragmented, heavy, or constrained by legacy pipelines, resulting in high deployment friction and limited support for dynamic content and generative models. In this work, we present Visionary, an open, web-native platform for real-time various Gaussian Splatting and meshes rendering. Built on an efficient WebGPU renderer with per-frame ONNX inference, Visionary enables dynamic neural processing while maintaining a lightweight, "click-to-run" browser experience. It introduces a standardized Gaussian Generator contract, which not only supports standard 3DGS rendering but also allows plug-and-play algorithms to generate or update Gaussians each frame. Such inference also enables us to apply feedforward generative post-processing. The platform further offers a plug in three.js library with a concise TypeScript API for seamless integration into existing web applications. Experiments show that, under identical 3DGS assets, Visionary achieves superior rendering efficiency compared to current Web viewers due to GPU-based primitive sorting. It already supports multiple variants, including MLP-based 3DGS, 4DGS, neural avatars, and style transformation or enhancement networks. By unifying inference and rendering directly in the browser, Visionary significantly lowers the barrier to reproduction, comparison, and deployment of 3DGS-family methods, serving as a unified World Model Carrier for both reconstructive and generative paradigms.
Problem

Research questions and friction points this paper is trying to address.

Fragmented viewer solutions hinder 3D Gaussian Splatting deployment and dynamic content support
Existing pipelines are heavy and constrained, limiting real-time neural rendering in browsers
Lack of a unified, open web platform for integrating and deploying 3DGS-family methods
Innovation

Methods, ideas, or system contributions that make the work stand out.

WebGPU-based real-time Gaussian Splatting rendering platform
Per-frame ONNX inference for dynamic neural content processing
Standardized plug-and-play Gaussian Generator contract system
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yuning Gong
Shanghai AI Laboratory
Y
Yifei Liu
Shanghai AI Laboratory
Yifan Zhan
Yifan Zhan
The University of Tokyo
3D Vision
Muyao Niu
Muyao Niu
The University of Tokyo
AIGCComputational Imaging3D Vision
Xueying Li
Xueying Li
Helmholtz-Centre for Environmental Research - UFZ
climate changehydrologyremote sensing
Y
Yuanjun Liao
Shanghai AI Laboratory
J
Jiaming Chen
Sichuan University
Y
Yuanyuan Gao
Shanghai AI Laboratory
J
Jiaqi Chen
Shanghai AI Laboratory
M
Minming Chen
Shanghai AI Laboratory
L
Li Zhou
Shanghai AI Laboratory
Y
Yuning Zhang
Shanghai AI Laboratory
W
Wei Wang
Shanghai AI Laboratory
X
Xiaoqing Hou
Shanghai AI Laboratory
H
Huaxi Huang
Shanghai AI Laboratory
S
Shixiang Tang
Shanghai AI Laboratory
L
Le Ma
Shanghai AI Laboratory
D
Dingwen Zhang
Northwestern Polytechnical University
X
Xue Yang
Shanghai Jiao Tong University
Junchi Yan
Junchi Yan
FIAPR & ICML Board Member, SJTU (2018-), SII (2024-), AWS (2019-2022), IBM (2011-2018)
Computational IntelligenceAI4ScienceMachine LearningAutonomous Driving
Y
Yanchi Zhang
Sichuan University
Yinqiang Zheng
Yinqiang Zheng
The University of Tokyo
Optical ImagingCamera SystemComputer Vision
X
Xiao Sun
Shanghai AI Laboratory
Zhihang Zhong
Zhihang Zhong
Researcher, Shanghai AI Laboratory
Computer visionDeep learning