EvoGP: A GPU-accelerated Framework for Tree-Based Genetic Programming

📅 2025-01-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the scalability bottleneck of tree-based genetic programming (TGP) in symbolic regression, feature engineering, and scientific modeling—caused by its compute-intensive nature—this paper introduces the first full-stack GPU-accelerated framework for TGP. Our method introduces three key innovations: (1) a novel tensorized tree encoding that maps heterogeneous tree structures onto fixed-shape tensors; (2) a parallel genetic operations framework built upon shared primitives, enabling dual-granularity parallelism—population-level and data-level—in fitness evaluation; and (3) custom CUDA kernels and a unified operator library, complemented by a multi-task benchmark suite. Experiments demonstrate that our framework achieves up to 140.89× speedup over the state-of-the-art GPU-accelerated TGP methods, while maintaining or improving accuracy. Its effectiveness and generalizability are validated across symbolic regression, classification, and robot control tasks.

Technology Category

Application Category

📝 Abstract
Tree-based Genetic Programming (TGP) is a key evolutionary algorithm widely used in symbolic regression, feature engineering, and scientific modeling. Its high computational demands make GPU acceleration essential for scalable and high-performance evolutionary computation. However, GPU acceleration of TGP faces three key challenges: inefficient tree encoding, highly heterogeneous genetic operations, and limited parallelism in fitness evaluation. To address these challenges, we introduce EvoGP, a comprehensive GPU-accelerated TGP framework. First, we design a tensorized encoding scheme to represent tree with different structures as tensors with the same shape, optimizing memory access and enabling efficient parallel execution. Second, we propose a unified parallel framework for genetic operations by leveraging shared computational primitives and implementing dedicated CUDA kernels for scalable performance. Third, we present a fully parallel fitness evaluation strategy for symbolic regression, exploiting both population-level and data-level parallelism to maximize GPU utilization. Moreover, we implement a comprehensive library to provide rich algorithm operators and benchmark problems. EvoGP is extensively tested on various tasks, including symbolic regression, classification, and robotics control, demonstrating its versatility and effectiveness across diverse application scenarios. Experimental results show that EvoGP achieves up to a 140.89x speedup over the state-of-the-art GPU-based TGP implementation, while maintaining or exceeding the accuracy of baseline methods. EvoGP is open-source and accessible at: https://github.com/EMI-Group/evogp.
Problem

Research questions and friction points this paper is trying to address.

Tree-based Genetic Programming
Efficiency Bottleneck
Parallel Computing
Innovation

Methods, ideas, or system contributions that make the work stand out.

EvoGP
GPU-accelerated Tree-based Genetic Programming
Parallel Fitness Evaluation
🔎 Similar Papers
No similar papers found.
L
Lishuang Wang
Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
Z
Zhihong Wu
Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
Kebin Sun
Kebin Sun
Southern University of Science and Technology
artificial intelligence
Zhuozhao Li
Zhuozhao Li
Southern University of Science and Technology
Distributed SystemsHigh-performance ComputingCloud Computing
R
Ran Cheng
Department of Data Science and Artificial Intelligence, and Department of Computing, The Hong Kong Polytechnic University, Hong Kong SAR, China