TY - JOUR
T1 - Protease target prediction via matrix factorization
AU - Marini, Simone
AU - Vitali, Francesca
AU - Rampazzi, Sara
AU - Demartini, Andrea
AU - Akutsu, Tatsuya
N1 - Funding Information:
While developing part of this work, Simone Marini was an International Research Fellow of the Japan Society for the Promotion of Science.
Publisher Copyright:
© 2018 The Author(s). Published by Oxford University Press. All rights reserved.
PY - 2019/3/15
Y1 - 2019/3/15
N2 - Motivation Protein cleavage is an important cellular event, involved in a myriad of processes, from apoptosis to immune response. Bioinformatics provides in silico tools, such as machine learning-based models, to guide the discovery of targets for the proteases responsible for protein cleavage. State-of-the-art models have a scope limited to specific protease families (such as Caspases), and do not explicitly include biological or medical knowledge (such as the hierarchical protein domain similarity or gene-gene interactions). To fill this gap, we present a novel approach for protease target prediction based on data integration. Results By representing protease-protein target information in the form of relational matrices, we design a model (i) that is general and not limited to a single protease family, and (b) leverages on the available knowledge, managing extremely sparse data from heterogeneous data sources, including primary sequence, pathways, domains and interactions. When compared with other algorithms on test data, our approach provides a better performance even for models specifically focusing on a single protease family.
AB - Motivation Protein cleavage is an important cellular event, involved in a myriad of processes, from apoptosis to immune response. Bioinformatics provides in silico tools, such as machine learning-based models, to guide the discovery of targets for the proteases responsible for protein cleavage. State-of-the-art models have a scope limited to specific protease families (such as Caspases), and do not explicitly include biological or medical knowledge (such as the hierarchical protein domain similarity or gene-gene interactions). To fill this gap, we present a novel approach for protease target prediction based on data integration. Results By representing protease-protein target information in the form of relational matrices, we design a model (i) that is general and not limited to a single protease family, and (b) leverages on the available knowledge, managing extremely sparse data from heterogeneous data sources, including primary sequence, pathways, domains and interactions. When compared with other algorithms on test data, our approach provides a better performance even for models specifically focusing on a single protease family.
UR - http://www.scopus.com/inward/record.url?scp=85059757963&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85059757963&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bty746
DO - 10.1093/bioinformatics/bty746
M3 - Article
C2 - 30169576
AN - SCOPUS:85059757963
VL - 35
SP - 923
EP - 929
JO - Bioinformatics
JF - Bioinformatics
SN - 1367-4803
IS - 6
ER -