TY - JOUR
T1 - Mechanistic phenotypes
T2 - An aggregative phenotyping strategy to identify disease mechanisms using GWAS data
AU - Mosley, Jonathan D.
AU - Van Driest, Sara L.
AU - Larkin, Emma K.
AU - Weeke, Peter E.
AU - Witte, John S.
AU - Wells, Quinn S.
AU - Karnes, Jason H.
AU - Guo, Yan
AU - Bastarache, Lisa
AU - Olson, Lana M.
AU - McCarty, Catherine A.
AU - Pacheco, Jennifer A.
AU - Jarvik, Gail P.
AU - Carrell, David S.
AU - Larson, Eric B.
AU - Crosslin, David R.
AU - Kullo, Iftikhar J.
AU - Tromp, Gerard
AU - Kuivaniemi, Helena
AU - Carey, David J.
AU - Ritchie, Marylyn D.
AU - Denny, Josh C.
AU - Roden, Dan M.
PY - 2013/12/12
Y1 - 2013/12/12
N2 - A single mutation can alter cellular and global homeostatic mechanisms and give rise to multiple clinical diseases. We hypothesized that these disease mechanisms could be identified using low minor allele frequency (MAF<0.1) non-synonymous SNPs (nsSNPs) associated with "mechanistic phenotypes", comprised of collections of related diagnoses. We studied two mechanistic phenotypes: (1) thrombosis, evaluated in a population of 1,655 African Americans; and (2) four groupings of cancer diagnoses, evaluated in 3,009 white European Americans. We tested associations between nsSNPs represented on GWAS platforms and mechanistic phenotypes ascertained from electronic medical records (EMRs), and sought enrichment in functional ontologies across the top-ranked associations. We used a two-step analytic approach whereby nsSNPs were first sorted by the strength of their association with a phenotype. We tested associations using two reverse genetic models and standard additive and recessive models. In the second step, we employed a hypothesis-free ontological enrichment analysis using the sorted nsSNPs to identify functional mechanisms underlying the diagnoses comprising the mechanistic phenotypes. The thrombosis phenotype was solely associated with ontologies related to blood coagulation (Fisher's p = 0.0001, FDR p = 0.03), driven by the F5, P2RY12 and F2RL2 genes. For the cancer phenotypes, the reverse genetics models were enriched in DNA repair functions (p = 2x10-5, FDR p = 0.03) (POLG/FANCI, SLX4/FANCP, XRCC1, BRCA1, FANCA, CHD1L) while the additive model showed enrichment related to chromatid segregation (p = 4610-6, FDR p = 0.005) (KIF25, PINX1). We were able to replicate nsSNP associations for POLG/FANCI, BRCA1, FANCA and CHD1L in independent data sets. Mechanism-oriented phenotyping using collections of EMR-derived diagnoses can elucidate fundamental disease mechanisms.
AB - A single mutation can alter cellular and global homeostatic mechanisms and give rise to multiple clinical diseases. We hypothesized that these disease mechanisms could be identified using low minor allele frequency (MAF<0.1) non-synonymous SNPs (nsSNPs) associated with "mechanistic phenotypes", comprised of collections of related diagnoses. We studied two mechanistic phenotypes: (1) thrombosis, evaluated in a population of 1,655 African Americans; and (2) four groupings of cancer diagnoses, evaluated in 3,009 white European Americans. We tested associations between nsSNPs represented on GWAS platforms and mechanistic phenotypes ascertained from electronic medical records (EMRs), and sought enrichment in functional ontologies across the top-ranked associations. We used a two-step analytic approach whereby nsSNPs were first sorted by the strength of their association with a phenotype. We tested associations using two reverse genetic models and standard additive and recessive models. In the second step, we employed a hypothesis-free ontological enrichment analysis using the sorted nsSNPs to identify functional mechanisms underlying the diagnoses comprising the mechanistic phenotypes. The thrombosis phenotype was solely associated with ontologies related to blood coagulation (Fisher's p = 0.0001, FDR p = 0.03), driven by the F5, P2RY12 and F2RL2 genes. For the cancer phenotypes, the reverse genetics models were enriched in DNA repair functions (p = 2x10-5, FDR p = 0.03) (POLG/FANCI, SLX4/FANCP, XRCC1, BRCA1, FANCA, CHD1L) while the additive model showed enrichment related to chromatid segregation (p = 4610-6, FDR p = 0.005) (KIF25, PINX1). We were able to replicate nsSNP associations for POLG/FANCI, BRCA1, FANCA and CHD1L in independent data sets. Mechanism-oriented phenotyping using collections of EMR-derived diagnoses can elucidate fundamental disease mechanisms.
UR - http://www.scopus.com/inward/record.url?scp=84892565751&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84892565751&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0081503
DO - 10.1371/journal.pone.0081503
M3 - Article
C2 - 24349080
AN - SCOPUS:84892565751
SN - 1932-6203
VL - 8
JO - PloS one
JF - PloS one
IS - 12
M1 - e81503
ER -