Yueming Yuan, Ahan Gupta, et al. X-MoE: Enabling Scalable Training for Emerging Mixture-of-Experts Architectures on HPC Platforms. SC 2025.
Ahan Gupta, Yueming Yuan, et al. SPLAT: A framework for optimised GPU code-generation for SParse reguLar ATtention. OOPSLA 2025.
Hoa La*, Ahan Gupta*, et al. MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models. (Equal contribution, under submission)
Muyan Hu, Ahan Gupta, et al. VTC: DNN Compilation with Virtual Tensors for Data Movement Elimination. (Under submission)
Ahan Gupta, Hao Guo, et al. FLuRKA: Fast fused Low-Rank & Kernel Attention. (Under submission)