TY - JOUR
T1 - Geographic population structure analysis of worldwide human populations infers their biogeographical origins
AU - the Genographic Consortium
AU - Elhaik, Eran
AU - Tatarinova, Tatiana
AU - Chebotarev, Dmitri
AU - Piras, Ignazio S.
AU - Calò, Carla Maria
AU - De Montis, Antonella
AU - Atzori, Manuela
AU - Marini, Monica
AU - Tofanelli, Sergio
AU - Francalacci, Paolo
AU - Pagani, Luca
AU - Tyler-Smith, Chris
AU - Xue, Yali
AU - Cucca, Francesco
AU - Schurr, Theodore G.
AU - Gaieski, Jill B.
AU - Melendez, Carlalynne
AU - Vilar, Miguel G.
AU - Owings, Amanda C.
AU - Gòmez, Rocío
AU - Fujita, Ricardo
AU - Santos, Fabrício R.
AU - Comas, David
AU - Balanovsky, Oleg
AU - Balanovska, Elena
AU - Zalloua, Pierre
AU - Soodyall, Himla
AU - Pitchappan, Ramasamy
AU - Ganeshprasad, Arunkumar
AU - Hammer, Michael
AU - Matisoo-Smith, Lisa
AU - Wells, R. Spencer
AU - Acosta, Oscar
AU - Adhikarla, Syama
AU - Adler, Christina J.
AU - Bertranpetit, Jaume
AU - Clarke, Andrew C.
AU - Cooper, Alan
AU - Der Sarkissian, Clio S.I.
AU - Haak, Wolfgang
AU - Haber, Marc
AU - Jin, Li
AU - Kaplan, Matthew E.
AU - Li, Hui
AU - Li, Shilin
AU - Martínez-Cruz, Begoña
AU - Merchant, Nirav C.
AU - Mitchell, John R.
AU - Parida, Laxmi
AU - Platt, Daniel E.
N1 - Funding Information:
E.E is supported in part by Genographic grant GP 01-12. L.P, C.T.S and Y.X were supported by The Wellcome Trust (098051). O.B. was supported in part by Presidium RAS (MCB programme) and RFBR (13-04-01711). T.T. was supported by grants from The National Institute for General Medical Studies (GM068968), and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (HD070996). S.T. is supported by a PRIN2009 grant. The Genographic Project is supported by the National Geographic Society IBM and the Waitt Foundation. We are grateful to all Genographic participants who contributed their DNA samples for this study.
PY - 2014/4/29
Y1 - 2014/4/29
N2 - The search for a method that utilizes biological information to predict humans' place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000-130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50km of their villages. GPS's accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing.
AB - The search for a method that utilizes biological information to predict humans' place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000-130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50km of their villages. GPS's accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing.
UR - https://www.scopus.com/pages/publications/84899845427
UR - https://www.scopus.com/pages/publications/84899845427#tab=citedBy
U2 - 10.1038/ncomms4513
DO - 10.1038/ncomms4513
M3 - Article
C2 - 24781250
AN - SCOPUS:84899845427
SN - 2041-1723
VL - 5
JO - Nature communications
JF - Nature communications
M1 - 3513
ER -