Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI

📅 2024-12-04
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Generative AI deployment in cloud computing faces critical bottlenecks in energy efficiency and computational security. Method: This project proposes a photonic cloud computing architecture tailored for generative AI, establishing an optical computing center enabling seamless integration across edge and metropolitan-area networks. It innovatively implements optical-domain input encoding, optical neural network (ONN) model modulation, and parallel matrix multiplication, achieving the first native deployment of photonic computing at the edge–metropolitan network layer. Leveraging ONN modulation and wavelength-division-multiplexed (WDM) optical interconnects, it overcomes fundamental energy-efficiency and data-security limitations of conventional electronic cloud architectures. Contribution/Results: Experimental evaluation demonstrates an energy efficiency of 118.6 mW/TOPs—two orders of magnitude higher than state-of-the-art electronic solutions. End-to-end execution of complex generative models—including Stable Diffusion—is successfully realized, empirically validating the feasibility and practicality of photonic acceleration for generative AI workloads.

Technology Category

Application Category

📝 Abstract
The rapid advancement of generative artificial intelligence (AI) in recent years has profoundly reshaped modern lifestyles, necessitating a revolutionary architecture to support the growing demands for computational power. Cloud computing has become the driving force behind this transformation. However, it consumes significant power and faces computation security risks due to the reliance on extensive data centers and servers in the cloud. Reducing power consumption while enhancing computational scale remains persistent challenges in cloud computing. Here, we propose and experimentally demonstrate an optical cloud computing system that can be seamlessly deployed across edge-metro network. By modulating inputs and models into light, a wide range of edge nodes can directly access the optical computing center via the edge-metro network. The experimental validations show an energy efficiency of 118.6 mW/TOPs (tera operations per second), reducing energy consumption by two orders of magnitude compared to traditional electronic-based cloud computing solutions. Furthermore, it is experimentally validated that this architecture can perform various complex generative AI models through parallel computing to achieve image generation tasks.
Problem

Research questions and friction points this paper is trying to address.

Reducing power consumption in cloud computing for generative AI
Enhancing computational scale and security in optical networks
Deploying seamless optical computing across edge-metro networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Optical cloud computing across edge-metro network
Modulating inputs and models into light
High energy efficiency with 118.6 mW/TOPs
🔎 Similar Papers
No similar papers found.
Sizhe Xing
Sizhe Xing
Fudan university
Optical Communication
Aolong Sun
Aolong Sun
Fudan University
Silicon PhotonicsOptical CommunicationPhotonic ComputingMultimode Optics
C
Chengxi Wang
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Y
Yizhi Wang
Centre for Photonic Systems, Electrical Engineering Division, Department of Engineering, University of Cambridge, Cambridge CB3 0FA, UK.
Boyu Dong
Boyu Dong
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Junhui Hu
Junhui Hu
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
X
Xuyu Deng
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
A
An Yan
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Y
Yingjun Liu
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Fangchen Hu
Fangchen Hu
Zhangjiang Laboratory, Shanghai, China
Z
Zhongya Li
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Ouhan Huang
Ouhan Huang
Fudan Univertisy
Visible Light CommunicationHuman Pose EstimationUWB
J
Junhao Zhao
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Y
Yingjun Zhou
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Z
Ziwei Li
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
J
Jianyang Shi
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Xi Xiao
Xi Xiao
Oak Ridge National Laboratory | University of Alabama at Birmingham
LLM / MLLM EfficiencyImage / Video GenerationImage / Video Understanding
R
R. Penty
Centre for Photonic Systems, Electrical Engineering Division, Department of Engineering, University of Cambridge, Cambridge CB3 0FA, UK.
Qixiang Cheng
Qixiang Cheng
Centre for Photonic Systems, Electrical Engineering Division, Department of Engineering, University of Cambridge, Cambridge CB3 0FA, UK.
Nan Chi
Nan Chi
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China
Junwen Zhang
Junwen Zhang
School of Information Science and Technology, Fudan University, Shanghai, China; Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, China