ECPICK is a Biologically interpretable deep learning enhances trustworthy enzyme commission number prediction and discovers potential motif sites.

The rapid growth of uncharacterized enzymes and their functional diversity urge accurate and trustworthy computational functional annotation tools. However, current approaches lack trustworthiness for the predictions and model interpretation, limiting model reliability on the multi-label classification problem with thousands of classes. Here, we demonstrate that our novel biologically interpretable deep learning model (ECPICK) provides a robust solution for trustworthy predictions of enzyme commission (EC) numbers with significantly enhanced predictive power and the capability to discover potential motif sites. ECPICK learns complex sequential patterns of amino acids and their hierarchical structures from twenty million proteins to create the EC number predictions. Furthermore, ECPICK identifies significant amino acids that contribute to the prediction in a given protein sequence without multiple sequence alignment, which may match to known motif sites for trustworthy prediction or potential motif sites. Our intensive assessment showed not only outstanding enhancement of predictive performance on the largest databases of Uniprot, PDB, and KEGG, but also a capability to discover new motif sites in microorganisms. ECPICK will be a reliable EC number prediction tool to identify protein functions of an increasing number of uncharacterized enzymes.


For more information please visit our Frequently Asked Questions.

Any feedback to help improve this service is greatly appreciated. You can submit feedback and bug reports using the form on our contact page.

Recent alerts and bug fixes

  • Datasets For January 2025 Has Updated
0
Jobs queued
0
Jobs active
64
Jobs finished

Last updated January 01 2025