🤖 AI Summary
RNA co-transcriptional splicing is inherently unpredictable and difficult to program. Method: We introduce the first computational framework for co-transcriptional splicing based on bounded/unbounded hairpin deletion language operations, formalizing DNA template design as a language operation problem. Integrating formal language theory, computational complexity analysis, and RNA secondary structure constraints, we systematically characterize the hierarchy of language families corresponding to four splicing operations. Contribution/Results: We prove, for the first time, that both linearly and logarithmically bounded hairpin deletion are Turing-complete. Our work establishes the closure properties of co-transcriptional splicing languages and determines the computational complexity of template constructibility. It provides a rigorous theoretical foundation and methodological toolkit for controllable, predictable RNA sequence generation and programmable RNA nanostructure design.
📝 Abstract
RNA co-transcriptionality is the process where RNA sequences are spliced while being transcribed from DNA templates. This process holds potential as a key tool for molecular programming. Co-transcriptional folding has been shown to be programmable for assembling nano-scale RNA structures, and recent advances have proven its Turing universality. While post-transcriptional splicing has been extensively studied, co-transcriptional splicing is gaining attention for its potential to save resources and space in molecular systems. However, its unpredictability has limited its practical applications. In this paper, we focus on engineering co-transcriptional splicing, moving beyond natural occurrences to program RNA sequences that produce specific target sequences through DNA templates. We introduce a formal model of co-transcriptional splicing, defined by constant-, linear-, and logarithmic-bounded hairpin deletion operations, as well as an unbounded hairpin deletion operation. We examine the complexity of the template constructability problem associated with these operations and study the closure properties of the languages they generate, providing insights for RNA template design in molecular programming systems.