Simulation-based inference of yeast centromeres

📅 2025-08-29
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Centromere localization in yeast has long been hindered by chromatin conformational complexity and the absence of reliable prior positional information. To address this, we propose a de novo stochastic simulation–statistical inference framework that integrates experimental Hi-C contact maps with physics-based chromosome conformation dynamics simulations, enabling genome-wide, high-confidence centromere prediction through spatial contact pattern matching. Crucially, our method circumvents the strong dependence on initial coordinate guesses inherent in conventional Hi-C analyses, thereby substantially improving robustness and accuracy—particularly in unannotated or low-quality yeast genome assemblies. Validation in *Saccharomyces cerevisiae* demonstrates precise localization of all 16 centromeres, with a mean error of less than 5 kb. This approach establishes a scalable, unbiased paradigm for centromere annotation in non-model yeast species, offering broad applicability to diverse fungal genomes lacking high-resolution structural or epigenetic priors.

Technology Category

Application Category

📝 Abstract
The chromatin folding and the spatial arrangement of chromosomes in the cell play a crucial role in DNA replication and genes expression. An improper chromatin folding could lead to malfunctions and, over time, diseases. For eukaryotes, centromeres are essential for proper chromosome segregation and folding. Despite extensive research using de novo sequencing of genomes and annotation analysis, centromere locations in yeasts remain difficult to infer and are still unknown in most species. Recently, genome-wide chromosome conformation capture coupled with next-generation sequencing (Hi-C) has become one of the leading methods to investigate chromosome structures. Some recent studies have used Hi-C data to give a point estimate of each centromere, but those approaches highly rely on a good pre-localization. Here, we present a novel approach that infers in a stochastic manner the locations of all centromeres in budding yeast based on both the experimental Hi-C map and simulated contact maps.
Problem

Research questions and friction points this paper is trying to address.

Inferring centromere locations in yeast species
Overcoming limitations of Hi-C data pre-localization reliance
Using stochastic simulation with experimental and simulated maps
Innovation

Methods, ideas, or system contributions that make the work stand out.

Stochastic inference of centromere locations
Combining experimental Hi-C maps
Simulated contact maps analysis
🔎 Similar Papers
No similar papers found.