🤖 AI Summary
This study investigates how patient-specific covariates—such as age and disease duration—influence treatment-switching behavior among multiple sclerosis (MS) patients across disease-modifying therapies (DMTs). To address this, we propose a covariate-driven sparse Markov chain model. Our method introduces an adaptive sparsification strategy grounded in empirical transition frequencies, which automatically zeroes out clinically unobserved (zero-frequency) transitions, thereby ensuring both statistical identifiability and clinical interpretability. We further develop a parallelized global optimization algorithm to efficiently maximize the multimodal likelihood function. Validated on a real-world MS cohort, the model significantly improves the stability and clinical plausibility of estimated transition matrices; achieves 100% accuracy in identifying zero-frequency transitions; and, for the first time, quantifies the directional effects of key clinical covariates on DMT switching pathways.
📝 Abstract
A Markov model is a widely used tool for modeling sequences of events from a finite state-space and hence can be employed to identify the transition probabilities across treatments based on treatment sequence data. To understand how patient-level covariates impact these treatment transitions, the transition probabilities are modeled as a function of patient covariates. This approach enables the visualization of the effect of patient-level covariates on the treatment transitions across patient visits. The proposed method automatically estimates the entries of the transition matrix with smaller numbers of empirical transitions as constant; the user can set desired cutoff of the number of empirical transition counts required for a particular transition probability to be estimated as a function of covariates. Firstly, this strategy automatically enforces the final estimated transition matrix to contain zeros at the locations corresponding to zero empirical transition counts, avoiding further complicated model constructs to handle sparsity, in an efficient manner. Secondly, it restricts estimation of transition probabilities as a function of covariates, when the number of empirical transitions is particularly small, thus avoiding the identifiability issue which might arise due to the p>n scenario when estimating each transition probability as a function of patient covariates. To optimize the multi-modal likelihood, a parallelized scalable global optimization routine is also developed. The proposed method is applied to understand how the transitions across disease modifying therapies (DMTs) in Multiple Sclerosis (MS) patients are influenced by patient-level demographic and clinical phenotypes.