TY - GEN
T1 - Fast approximate shortest hyperpaths for inferring pathways in cell signaling hypergraphs
AU - Krieger, Spencer
AU - Kececioglu, John
N1 - Funding Information:
Funding This research was supported by the US National Science Foundation through Grants CCF-1617192 and IIS-2041613 to John Kececioglu.
Publisher Copyright:
© Spencer Krieger and John Kececioglu; licensed under Creative Commons License CC-BY 4.0 21st International Workshop on Algorithms in Bioinformatics (WABI 2021).
PY - 2021/7/1
Y1 - 2021/7/1
N2 - Cell signaling pathways, which are a series of reactions that start at receptors and end at transcription factors, are basic to systems biology. Properly modeling the reactions in such pathways requires directed hypergraphs, where an edge is now directed between two sets of vertices. Inferring a pathway by the most parsimonious series of reactions then corresponds to finding a shortest hyperpath in a directed hypergraph, which is NP-complete. The state of the art for shortest hyperpaths in cell-signaling hypergraphs solves a mixed-integer linear program to find an optimal hyperpath that is restricted to be acyclic, and offers no efficiency guarantees. We present for the first time a heuristic for general shortest hyperpaths that properly handles cycles, and is guaranteed to be efficient. Its accuracy is demonstrated through exhaustive experiments on all instances from the standard NCI-PID and Reactome pathway databases, which show the heuristic finds a hyperpath that matches the state-of-the-art mixed-integer linear program on over 99% of all instances that are acyclic. On instances where only cyclic hyperpaths exist, the heuristic surpasses the state-of-the-art, which finds no solution; on every such cyclic instance, enumerating all possible hyperpaths shows that the solution found by the heuristic is in fact optimal.
AB - Cell signaling pathways, which are a series of reactions that start at receptors and end at transcription factors, are basic to systems biology. Properly modeling the reactions in such pathways requires directed hypergraphs, where an edge is now directed between two sets of vertices. Inferring a pathway by the most parsimonious series of reactions then corresponds to finding a shortest hyperpath in a directed hypergraph, which is NP-complete. The state of the art for shortest hyperpaths in cell-signaling hypergraphs solves a mixed-integer linear program to find an optimal hyperpath that is restricted to be acyclic, and offers no efficiency guarantees. We present for the first time a heuristic for general shortest hyperpaths that properly handles cycles, and is guaranteed to be efficient. Its accuracy is demonstrated through exhaustive experiments on all instances from the standard NCI-PID and Reactome pathway databases, which show the heuristic finds a hyperpath that matches the state-of-the-art mixed-integer linear program on over 99% of all instances that are acyclic. On instances where only cyclic hyperpaths exist, the heuristic surpasses the state-of-the-art, which finds no solution; on every such cyclic instance, enumerating all possible hyperpaths shows that the solution found by the heuristic is in fact optimal.
KW - Cell signaling networks
KW - Directed hypergraphs
KW - Efficient heuristics
KW - Hyperpath enumeration
KW - Reaction pathways
KW - Shortest hyperpaths
KW - Systems biology
UR - http://www.scopus.com/inward/record.url?scp=85115292895&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85115292895&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.WABI.2021.20
DO - 10.4230/LIPIcs.WABI.2021.20
M3 - Conference contribution
AN - SCOPUS:85115292895
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 21st International Workshop on Algorithms in Bioinformatics, WABI 2021
A2 - Carbone, Alessandra
A2 - El-Kebir, Mohammed
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 21st International Workshop on Algorithms in Bioinformatics, WABI 2021
Y2 - 2 August 2021 through 4 August 2021
ER -