KANEL: Kolmogorov-Arnold Network Ensemble Learning Enables Early Hit Enrichment in High-Throughput Virtual Screening

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF

career value

202K/year
🤖 AI Summary
This work addresses the limitation of traditional virtual screening, which relies on global metrics such as AUC and often fails to effectively prioritize high-value compounds in early enrichment. To overcome this, the authors propose KANEL, an ensemble learning framework that, for the first time, integrates the interpretable Kolmogorov–Arnold network into virtual screening. KANEL combines this novel architecture with XGBoost, Random Forest, and Multilayer Perceptron, while fusing diverse molecular representations—including LillyMol and RDKit descriptors alongside Morgan fingerprints. By leveraging multi-model and multi-representation synergy, the framework substantially improves early enrichment performance, particularly in terms of positive predictive value at top-N rankings (PPV@N), outperforming both conventional single-model approaches and methods optimized solely for global evaluation metrics.

Technology Category

Application Category

📝 Abstract
Machine learning models of chemical bioactivity are increasingly used for prioritizing a small number of compounds in virtual screening libraries for experimental follow-up. In these applications, assessing model accuracy by early hit enrichment such as Positive Predicted Value (PPV) calculated for top N hits (PPV@N) is more appropriate and actionable than traditional global metrics such as AUC. We present KANEL, an ensemble workflow that combines interpretable Kolmogorov-Arnold Networks (KANs) with XGBoost, random forest, and multilayer perceptron models trained on complementary molecular representations (LillyMol descriptors, RDKit-derived descriptors, and Morgan fingerprints).
Problem

Research questions and friction points this paper is trying to address.

early hit enrichment
virtual screening
PPV@N
chemical bioactivity
machine learning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Kolmogorov-Arnold Network
ensemble learning
early hit enrichment
virtual screening
PPV@N
💼 Related Jobs
Postdoctoral Fellow – AI-Driven Multi-Omics Integration for Predictive Toxicology
Pfizer
The annual base salary for this position ranges from $64,600.00 to $107,600.00. In addition, this position is eligible for participation in Pfizer’s Global Performance Plan with a bonus target of 7.5% of the base salary. We offer comprehensive and generous benefits and programs to help our colleagues lead healthy lives and to support each of life’s moments. Benefits offered include a 401(k) plan with Pfizer Matching Contributions and an additional Pfizer Retirement Savings Contribution, paid vacation, holiday and personal days, paid caregiver/parental and medical leave, and health benefits to include medical, prescription drug, dental and vision coverage. Learn more at Pfizer Candidate Site – U.S. Benefits | (uscandidates.mypfizerbenefits.com). Pfizer compensation structures and benefit packages are aligned based on the location of hire. The United States salary range provided does not apply to Tampa, FL or any location outside of the United States. Relocation assistance may be available based on business needs and/or eligibility.
Hybrid
P
Pavel Koptev
AI1 Technologies
N
Nikita Krainov
AI1 Technologies
K
Konstantin Malkov
AI1 Technologies, KAN Technologies, LLC
Alexander Tropsha
Alexander Tropsha
Professor at UNC-Chapel Hill
Cheminformaticschemoinformaticscomputational toxicologydrug discoverymaterials informatics