TY - GEN
T1 - Data fusion approach for learning transcriptional Bayesian networks
AU - Sauta, Elisabetta
AU - Demartini, Andrea
AU - Vitali, Francesca
AU - Riva, Alberto
AU - Bellazzi, Riccardo
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2017
Y1 - 2017
N2 - The complexity of gene expression regulation relies on the synergic nature underlying the molecular interplay among its principal actors, transcription factors (TFs). Exerting a spatiotemporal control on their target genes, they define transcriptional programs across the genome, which are strongly perturbed in a disease context. In order to gain a more comprehensive picture of these complex dynamics, a data fusion approach, aimed at performing the integration of heterogeneous -omics data is fundamental. Bayesian Networks provide a natural framework for integrating different sources of data and knowledge through the priors’ use. In this work, we developed an hybrid structure-learning algorithm with the aim of exploiting TF ChIP-seq and gene expression (GE) data to investigate disease-specific transcriptional regulations in a genome-wide perspective. TF ChIP seq profiles were firstly used for structure learning and then integrated in the model as a prior probability. GE panels were employed to learn the model parameters, trying to find the best heuristic transcriptional network. We applied our approach to a specific pathological case, the chronic myeloid leukemia (CML), a myeloproliferative disorder, whose transcriptional mechanisms have not yet been deeply elucidated. The proposed data-driven method allows to investigate transcriptional signatures, highlighting in the obtained probabilistic network a three-layered hierarchy, as a different TFs influence on gene expression cellular programs.
AB - The complexity of gene expression regulation relies on the synergic nature underlying the molecular interplay among its principal actors, transcription factors (TFs). Exerting a spatiotemporal control on their target genes, they define transcriptional programs across the genome, which are strongly perturbed in a disease context. In order to gain a more comprehensive picture of these complex dynamics, a data fusion approach, aimed at performing the integration of heterogeneous -omics data is fundamental. Bayesian Networks provide a natural framework for integrating different sources of data and knowledge through the priors’ use. In this work, we developed an hybrid structure-learning algorithm with the aim of exploiting TF ChIP-seq and gene expression (GE) data to investigate disease-specific transcriptional regulations in a genome-wide perspective. TF ChIP seq profiles were firstly used for structure learning and then integrated in the model as a prior probability. GE panels were employed to learn the model parameters, trying to find the best heuristic transcriptional network. We applied our approach to a specific pathological case, the chronic myeloid leukemia (CML), a myeloproliferative disorder, whose transcriptional mechanisms have not yet been deeply elucidated. The proposed data-driven method allows to investigate transcriptional signatures, highlighting in the obtained probabilistic network a three-layered hierarchy, as a different TFs influence on gene expression cellular programs.
KW - -omics data integration
KW - Bayesian networks
KW - Transcriptional regulations
UR - http://www.scopus.com/inward/record.url?scp=85021690497&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021690497&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-59758-4_8
DO - 10.1007/978-3-319-59758-4_8
M3 - Conference contribution
AN - SCOPUS:85021690497
SN - 9783319597577
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 76
EP - 80
BT - Artificial Intelligence in Medicine - 16th Conference on Artificial Intelligence in Medicine, AIME 2017, Proceedings
A2 - [surname]ten Teije, Annette
A2 - Popow, Christian
A2 - Sacchi, Lucia
A2 - Holmes, John H.
PB - Springer-Verlag
T2 - 16th Conference on Artificial Intelligence in Medicine, AIME 2017
Y2 - 21 June 2017 through 24 June 2017
ER -