🤖 AI Summary
De novo design of large or cross-domain protein targets faces prohibitive computational costs and sharply declining success rates with increasing target size. Method: We propose a “local-first” epitope-fragment design paradigm that models only discontinuous, functionally critical residues around the binding site—bypassing full-protein structural dependencies. Our approach integrates protein folding neural networks (PFNNs) to decode local interaction preferences, Monte Carlo evolutionary optimization, and position-specific biased inverse folding algorithms into an efficient closed-loop design pipeline. Contribution/Results: Experimental validation demonstrates an 80% improvement in design success rate and a 40-fold reduction in time per successful design. The method achieves high-affinity binding designs for challenging targets including ClpP and ALS3. By decoupling design from global structural constraints, this work establishes a scalable, structure-agnostic framework for de novo engineering of large-scale protein–protein interfaces.
📝 Abstract
Recent advances in structure-based protein design have accelerated de novo binder generation, yet interfaces on large domains or spanning multiple domains remain challenging due to high computational cost and declining success with increasing target size. We hypothesized that protein folding neural networks (PFNNs) operate in a ``local-first'' manner, prioritizing local interactions while displaying limited sensitivity to global foldability.Guided by this hypothesis, we propose an epitope-only strategy that retains only the discontinuous surface residues surrounding the binding site. Compared to intact-domain workflows, this approach improves in silico success rates by up to 80% and reduces the average time per successful design by up to forty-fold, enabling binder design against previously intractable targets such as ClpP and ALS3. Building on this foundation, we further developed a tailored pipeline that incorporates a Monte Carlo-based evolution step to overcome local minima and a position-specific biased inverse folding step to refine sequence patterns. Together, these advances not only establish a generalizable framework for efficient binder design against structurally large and otherwise inaccessible targets, but also support the broader ``local-first'' hypothesis as a guiding principle for PFNN-based design.