Molecular Odor Prediction with Harmonic Modulated Feature Mapping and Chemically-Informed Loss

📅 2025-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing key challenges in molecular odor prediction—including non-smooth target modeling, strong coupling of mixed-dimensional features, and severe label imbalance—this work proposes a harmonic modulation feature mapping mechanism and a cheminformatics-guided dynamic weighting loss function. The former employs frequency-adaptive mapping to decouple heterogeneous multi-source features, enhancing structural representation independence; the latter dynamically reweights samples based on label co-occurrence priors to mitigate long-tail distribution bias. Additionally, we integrate feature importance learning, molecular ensemble optimization, and interpretability constraints to improve model robustness and mapping transparency. Extensive experiments across multiple benchmark datasets demonstrate that our method significantly outperforms existing state-of-the-art models, achieving up to a 12.6% improvement in F1-score for minority odor classes. Moreover, the framework enables mechanistically interpretable inference from molecular structure to odor descriptors.

Technology Category

Application Category

📝 Abstract
Molecular odor prediction has great potential across diverse fields such as chemistry, pharmaceuticals, and environmental science, enabling the rapid design of new materials and enhancing environmental monitoring. However, current methods face two main challenges: First, existing models struggle with non-smooth objective functions and the complexity of mixed feature dimensions; Second, datasets suffer from severe label imbalance, which hampers model training, particularly in learning minority class labels. To address these issues, we introduce a novel feature mapping method and a molecular ensemble optimization loss function. By incorporating feature importance learning and frequency modulation, our model adaptively adjusts the contribution of each feature, efficiently capturing the intricate relationship between molecular structures and odor descriptors. Our feature mapping preserves feature independence while enhancing the model's efficiency in utilizing molecular features through frequency modulation. Furthermore, the proposed loss function dynamically adjusts label weights, improves structural consistency, and strengthens label correlations, effectively addressing data imbalance and label co-occurrence challenges. Experimental results show that our method significantly can improves the accuracy of molecular odor prediction across various deep learning models, demonstrating its promising potential in molecular structure representation and chemoinformatics.
Problem

Research questions and friction points this paper is trying to address.

Molecular Odor Prediction
Complex Data Handling
Imbalanced Data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Feature Mapping
Imbalanced Data Handling
Molecular Odor Prediction
🔎 Similar Papers
No similar papers found.
H
HongXin Xie
Shandong Normal University, Jinan, China
J
JianDe Sun
Shandong Normal University, Jinan, China
Yi Shao
Yi Shao
Assistant Professor, McGill University
UHPCRobotic ConstructionStructural Optimization
S
Shuai Li
Shandong University, Jinan, China
S
Sujuan Hou
Shandong Normal University, Jinan, China
Y
YuLong Sun
Shandong Normal University, Jinan, China
Y
Yuxiang Liu
Shandong Normal University, Jinan, China