Abstract:
Inhibition of human cytochrome P450 (CYP) can lead to drug-drug interactions, resulting in serious adverse reactions.It is therefore crucial to accurately predict the inhibitory power of a given compound against a particular CYP isoform.This study compared 11 machine learning methods and 2 deep learning models based on different molecular representations.The experimental results showed that the CatBoost machine learning model based on RDKit_2d+Morgan outperformed other models in terms of accuracy and Mathews coefficient, and even outperformed previously published models.Moreover, the experimental results also showed that the CatBoost model not only had superior performance, but also consumed less computational resources.Finally, this study combined the top 3 performing models as co_model, which slightly outperformed the CatBoost model alone in terms of performance.