Predicting Future States with Spatial Point Processes in Single Molecule Resolution Spatial Transcriptomics

📅 2024-01-04
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of predicting spatiotemporal patterns of *Sog-D* gene activity in *Drosophila* embryonic development along the anterior–posterior (AP) and dorsal–ventral (DV) axes at future time points. To this end, we propose a novel integrative method combining spatial point process statistics with machine learning, specifically embedding Ripley’s K-function—a measure of spatial clustering—into the XGBoost framework. This integration endows the model with RNA velocity–like capacity for inferring spatial gene expression dynamics from single-molecule-resolution spatial transcriptomic time-series data. Our approach achieves the first accurate, super-resolution, whole-embryo-scale modeling of developmental trajectories. It significantly outperforms existing benchmarks across multiple embryonic stages, yielding substantial improvements in average prediction accuracy. Crucially, it enables robust inference of future spatial gene expression patterns from a single time-point measurement—thereby filling a critical technical gap in high spatiotemporal-resolution predictive modeling of gene expression evolution in developmental biology.

Technology Category

Application Category

📝 Abstract
In this paper, we introduce a pipeline based on XGboost to predict the future distribution of cells that are expressed by the Sog-D gene (active cells) in both the Anterior to posterior (AP) and the Dorsal to Ventral (DV) axis of the Drosophila in embryogenesis process. This method provides insights about how cells and living organisms control gene expression in super resolution whole embryo spatial transcriptomics imaging at sub cellular, single molecule resolution. An XGboost model was used to predict the next stage active distribution based on the previous one. To achieve this goal, we leveraged temporally resolved, spatial point processes by including Ripley's K-function in conjunction with the cell's state in each stage of embryogenesis, and found average predictive accuracy of active cell distribution. This tool is analogous to RNA Velocity for spatially resolved developmental biology, from one data point we can predict future spatially resolved gene expression using features from the spatial point processes.
Problem

Research questions and friction points this paper is trying to address.

Predict future cell distribution
Spatial transcriptomics in embryogenesis
XGboost model for gene expression
Innovation

Methods, ideas, or system contributions that make the work stand out.

XGboost for cell prediction
Ripley's K-function integration
Single molecule resolution transcriptomics
🔎 Similar Papers
No similar papers found.
Parisa Boodaghi Malidarreh
Parisa Boodaghi Malidarreh
PhD student at UTA
machine learningbioinformaticsartificial neural network
Biraaj Rout
Biraaj Rout
Department of Computer Science and Engineering, University of Texas at Arlington; Multi-Interprofessional Center for Health Informatics, University of Texas at Arlington
M
M. Nasr
Department of Computer Science and Engineering, University of Texas at Arlington; Multi-Interprofessional Center for Health Informatics, University of Texas at Arlington
P
Priyanshi Borad
Department of Biology, University of Texas at Arlington; Multi-Interprofessional Center for Health Informatics, University of Texas at Arlington
Jillur Rahman Saurav
Jillur Rahman Saurav
PhD Student and Graduate Research Assistant, Luber Lab at The University of Texas at Arlington
Medical ImagingGenAINLPComputer VisionData Science
Jai Prakash Veerla
Jai Prakash Veerla
Google Student Researcher, PhD Candidate in Computer Science, The University of Texas at Arlington
Machine LearningArtificial IntelligenceResponsible AICancer ResearchHuman Computer Interaction
K
Kelli D. Fenelon
Department of Biology, University of Texas at Arlington; Multi-Interprofessional Center for Health Informatics, University of Texas at Arlington
T
T. Koromila
Department of Biology, University of Texas at Arlington; Multi-Interprofessional Center for Health Informatics, University of Texas at Arlington
J
Jacob M. Luber
Department of Computer Science and Engineering, University of Texas at Arlington; Department of Bioengineering, University of Texas at Arlington; Multi-Interprofessional Center for Health Informatics, University of Texas at Arlington