TabPFN Extensions for Interpretable Geotechnical Modelling

📅 2026-03-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of parameter inference from sparse and heterogeneous borehole data in geotechnical engineering by introducing TabPFN—a Transformer-based tabular foundation model—into geotechnical modeling for the first time. Leveraging in-context learning, the approach enables Bayesian iterative imputation of missing mechanical parameters and soil type classification without retraining, supporting interpretable inference. Through cosine similarity embedding visualization, posterior distribution estimation, and SHAP analysis, the method reveals unsupervised separation of soil classes and uncovers physically meaningful inter-parameter relationships. It successfully distinguishes clay from sand and significantly improves imputation accuracy for four key mechanical parameters, with posterior uncertainties aligning with established geotechnical principles and reproducing classical empirical relationships such as the Skempton compression index.

Technology Category

Application Category

📝 Abstract
Geotechnical site characterisation relies on sparse, heterogeneous borehole data where uncertainty quantification and model interpretability are as critical as predictive accuracy for reliable engineering decisions. This paper presents an exploratory investigation into the use of TabPFN, a transformer-based tabular foundation model using in-context learning, and its extension library tabpfn-extensions for two geotechnical inference tasks: (1) soil-type classification using N-value and shear-wave velocity data from a synthetic geotechnical dataset, and (2) iterative imputation of five missing mechanical parameters ($s_\mathrm{u}$, $E_{\mathrm{u}}$, ${σ'}_\mathrm{p}$, $C_\mathrm{c}$, $C_\mathrm{v}$) in benchmark problem BM/AirportSoilProperties/2/2025. We apply cosine-similarity analysis to TabPFN-derived embeddings, visualise full posterior distributions from an iterative inference procedure, and compute SHAP-based feature importance, all without model retraining. Learned embeddings clearly separate Clay and Sand samples without explicit soil-type supervision; iterative imputation improves predictions for four of five target parameters, with posterior widths that reflect physically reasonable parameter-specific uncertainty; and SHAP analysis reveals the inter-parameter dependency structure, recovering established geotechnical relationships including the Skempton compression index correlation and the inverse dependence of preconsolidation pressure on water content. These results suggest the potential of foundation-model-based tools to support interpretable, uncertainty-aware parameter inference in data-scarce geotechnical practice.
Problem

Research questions and friction points this paper is trying to address.

geotechnical modelling
uncertainty quantification
model interpretability
sparse borehole data
parameter inference
Innovation

Methods, ideas, or system contributions that make the work stand out.

TabPFN
in-context learning
interpretable modeling
uncertainty quantification
geotechnical parameter imputation
🔎 Similar Papers
No similar papers found.
T
Taiga Saito
Department of Civil and Environmental Engineering, Tohoku University, 6-6-06 Aramaki Aoba, Aoba-ku, Sendai, Miyagi 980-8579, Japan
Yu Otake
Yu Otake
Tohoku university
reliability analysis
D
Daijiro Mizutani
Department of Civil and Environmental Engineering, Tohoku University, 6-6-06 Aramaki Aoba, Aoba-ku, Sendai, Miyagi 980-8579, Japan
Stephen Wu
Stephen Wu
Hamilton College
EconomicsWell-BeingHigher EducationHealth EconomicsBehavioral Economics