Abstract
The last decade has seen a rise in the rapid accumulation of large-scale data from both genomic technologies and from increased use of electronic health records. These advances have been accompanied by opportunities for automatic hypothesis generation in translational research; however, integrating and mining these highly heterogeneous datasets remains challenging. This chapter addresses the major principles and methods that are associated with providing effective solutions to a broad range of these problems. Indeed, these principles include issues of representation, biological scales of measurements, feature selection, and statistical approaches to address the curse of dimensionality, and approaches of integration that we divide into corroborative versus fusion approaches.
Original language | English (US) |
---|---|
Title of host publication | Methods in Biomedical Informatics |
Subtitle of host publication | A Pragmatic Approach |
Publisher | Elsevier Inc. |
Pages | 81-98 |
Number of pages | 18 |
ISBN (Print) | 9780124016781 |
DOIs | |
State | Published - Oct 2013 |
Externally published | Yes |
Keywords
- Biomarkers
- Complex diseases
- Corroborative mining
- Data fusion
- Disease genes
- Disease modules
- Genome-wide association studies
- Heterogeneous data Sources
- Hypothesis generation
- Knowledge discovery
- SNP
ASJC Scopus subject areas
- General Biochemistry, Genetics and Molecular Biology