🤖 AI Summary
Sparse in-situ geothermal gradient measurements and low accuracy of conventional empirical models hinder reliable geothermal resource assessment in Colombia.
Method: This study develops, for the first time, a high-resolution (1 km) machine learning spatial prediction framework integrating heterogeneous geospatial features—including geological, geophysical, and topographic variables—optimized via advanced feature engineering to enhance generalization. We employ Random Forest, XGBoost, and Geographically Weighted Regression, rigorously validated through k-fold cross-validation and interpreted using SHAP (Shapley Additive Explanations).
Contribution/Results: The best-performing model achieves an R² of 0.89—substantially outperforming classical empirical formulas. We generate a national-scale geothermal gradient map and identify five underexplored high-potential target zones. These results provide a robust, data-driven foundation for quantitative resource evaluation and strategic exploration site selection in Colombia.