🤖 AI Summary
The environmental impact of programming language choice on AI energy consumption remains underexplored, particularly across training and inference phases. Method: This study conducts the first systematic, empirical comparison of energy consumption between Python and R for machine learning, using standardized hardware and direct power measurement. We evaluate five regression and five classification algorithms under reproducible conditions, concurrently logging execution time and real-time power draw. Contribution/Results: Language selection induces energy differences of up to 99.16%; 95% of test cases exhibit statistically significant energy disparities (p < 0.05), with the more efficient language varying by task type. These findings establish programming language as a critical, previously overlooked software-level factor influencing AI’s carbon footprint. The work provides empirically grounded, actionable guidance for sustainable language selection in green machine learning practice.
📝 Abstract
The utilization of Machine Learning (ML) in contemporary software systems is extensive and continually expanding. However, its usage is energy-intensive, contributing to increased carbon emissions and demanding significant resources. While numerous studies examine the performance and accuracy of ML, only a limited few focus on its environmental aspects, particularly energy consumption. In addition, despite emerging efforts to compare energy consumption across various programming languages for specific algorithms and tasks, there remains a gap specifically in comparing these languages for ML-based tasks. This paper aims to raise awareness of the energy costs associated with employing different programming languages for ML model training and inference. Through this empirical study, we measure and compare the energy consumption along with run-time performance of five regression and five classification tasks implemented in Python and R, the two most popular programming languages in this context. Our study results reveal a statistically significant difference in costs between the two languages in 95% of the cases examined. Furthermore, our analysis demonstrates that the choice of programming language can influence energy efficiency significantly, up to 99.16% during model training and up to 99.8% during inferences, for a given ML task.