Using MLIR Transform to Design Sliced Convolution Algorithm

📅 2025-11-22

📈 Citations: 0

✨ Influential: 0

career value

178K/year

🤖 AI Summary

To address the performance optimization challenge of 2D convolutions in MLIR, this paper proposes SConvTransform—a declarative optimization framework built upon the Transform dialect. The method introduces *convolutional slicing analysis*, which integrates static shape inference with parametric affine modeling to automatically derive optimal tiling strategies and data layouts—including packing—enabling end-to-end, structured lowering of Linalg convolutions to hardware-friendly low-level operations. Designed as a modular, reusable, and edge-case-aware framework, SConvTransform significantly improves maintainability and extensibility. Evaluation on ARM Scalable Matrix Extension (SME) and Intel AVX-512 achieves 60% and 67% of peak theoretical performance, respectively. These results validate the efficacy of static-analysis-driven, structured tiling and demonstrate strong cross-architecture portability.

Technology Category

Application Category

📝 Abstract

This paper proposes SConvTransform, a Transform dialect extension that provides operations for optimizing 2D convolutions in MLIR. Its main operation, SConvOp, lowers Linalg convolutions into tiled and packed generic operations through a fully declarative transformation pipeline. The process is guided by a Convolution Slicing Analysis that determines tile sizes and data layout strategies based on input and filter shapes, as well as target architecture parameters. SConvOp handles edge cases by splitting irregular regions and adjusting affine maps where needed. All packing and tiling operations are derived from a parametric set of affine equations, enabling reusable and analyzable transformations. Although functional correctness was the primary goal of this work, the experimental evaluation demonstrates the effectiveness of SConvTransform, achieving good enough performance across different target architectures. Future work will focus on optimizing performance and porting to other target devices. When applied to standard convolution configurations, the generated code achieves up to 60% of peak performance on ARM SME and 67% on Intel AVX512. These results validate the benefit of combining static shape analysis with structured tiling and packing strategies within the MLIR Transform dialect. Furthermore, the modular design of SConvTransform facilitates integration with future extensions, enabling continued optimization of convolution workloads through MLIR's extensible compilation infrastructure.

Problem

Research questions and friction points this paper is trying to address.

Optimizing 2D convolutions through MLIR Transform dialect extensions

Lowering convolutions into tiled operations using declarative transformations

Determining optimal tile sizes through convolution slicing analysis

Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends MLIR Transform dialect for convolution optimization

Lowers convolutions into tiled operations via declarative pipeline

Uses parametric affine equations for packing and tiling

🔎 Similar Papers

No similar papers found.

💼 Related Jobs

AI Model Optimization Architect

Qualcomm

$158,400.00 - $237,600.00

San Diego, California, United States of America

Software Engineer, Systems ML - Compilers