TY - JOUR
T1 - Localization of cancer susceptibility genes by genome-wide single-nucleotide polymorphism linkage-disequilibrium mapping
AU - Mitra, Nandita
AU - Ye, Tian Zhang
AU - Smith, Alex
AU - Chuai, Shaokun
AU - Kirchhoff, Tomas
AU - Peterlongo, Paolo
AU - Nafa, Khedoudja
AU - Phillips, Michael S.
AU - Offit, Kenneth
AU - Ellis, Nathan A.
PY - 2004/11/1
Y1 - 2004/11/1
N2 - With the large numbers of single nucleotide polymorphisms (SNPs) available and new technologies that permit high throughput genotyping, we have investigated the possibility of the localization of disease genes with genome-wide panels of SNP markers and taking advantage of the linkage-disequilibrium (LD) between the disease gene and closely linked markers. For this purpose, we selected cases from the Ashkenazi Jewish population, in which the mutant alleles are expected to be identical by descent from a common founder and the regions of LD encompassing these mutant alleles are large. As a validation of this approach for localization, we performed two trials: one in autosomal recessive Bloom syndrome, in which a unique mutation of the BLM gene is present at elevated frequencies in cases, and the other in autosomal dominant hereditary nonpolyposis colorectal cancer (HNPCC), in which a unique mutation of MSH2 is present at elevated frequencies. In the Bloom syndrome trial, we genotyped 3,258 SNPs in 10 Jewish Bloom syndrome cases and 31 non-Bloom syndrome Jewish persons as a comparison group. In the HNPCC trial, we genotyped 8,549 SNPS in 13 Jewish HNPCC cases whose colon cancers exhibited microsatellite instability and in 63 healthy Jews as a comparison group. To identify significant associations, we performed (a) Fisher's exact test comparing genotypes at each locus in cases versus controls and (b) a haplotype analysis by estimating the frequency of haplotypes with the expectation-maximization algorithm and comparing haplotype frequencies in cases versus controls by logistic regression and a maximum likelihood ratio method. In the Bloom syndrome trial, by Fisher's exact test, statistically significant association was detected at a single locus, TSC0754862, which is a locus 1.7 million bp from BLM. Two-locus, three-locus, and four-locus haplotypes that included TSC0754862 and flanked BLM were also statistically more frequent in cases versus controls. In the HNPCC trial, although a significant P value was not obtained by the single SNP genotype analysis, significant associations were detected for several multilocus haplotypes in an 11-million-bp region that contained the MSH2 gene. This work demonstrates the power of the LD mapping approach in an isolated population and its general applicability to the identification of novel cancer-causing genes.
AB - With the large numbers of single nucleotide polymorphisms (SNPs) available and new technologies that permit high throughput genotyping, we have investigated the possibility of the localization of disease genes with genome-wide panels of SNP markers and taking advantage of the linkage-disequilibrium (LD) between the disease gene and closely linked markers. For this purpose, we selected cases from the Ashkenazi Jewish population, in which the mutant alleles are expected to be identical by descent from a common founder and the regions of LD encompassing these mutant alleles are large. As a validation of this approach for localization, we performed two trials: one in autosomal recessive Bloom syndrome, in which a unique mutation of the BLM gene is present at elevated frequencies in cases, and the other in autosomal dominant hereditary nonpolyposis colorectal cancer (HNPCC), in which a unique mutation of MSH2 is present at elevated frequencies. In the Bloom syndrome trial, we genotyped 3,258 SNPs in 10 Jewish Bloom syndrome cases and 31 non-Bloom syndrome Jewish persons as a comparison group. In the HNPCC trial, we genotyped 8,549 SNPS in 13 Jewish HNPCC cases whose colon cancers exhibited microsatellite instability and in 63 healthy Jews as a comparison group. To identify significant associations, we performed (a) Fisher's exact test comparing genotypes at each locus in cases versus controls and (b) a haplotype analysis by estimating the frequency of haplotypes with the expectation-maximization algorithm and comparing haplotype frequencies in cases versus controls by logistic regression and a maximum likelihood ratio method. In the Bloom syndrome trial, by Fisher's exact test, statistically significant association was detected at a single locus, TSC0754862, which is a locus 1.7 million bp from BLM. Two-locus, three-locus, and four-locus haplotypes that included TSC0754862 and flanked BLM were also statistically more frequent in cases versus controls. In the HNPCC trial, although a significant P value was not obtained by the single SNP genotype analysis, significant associations were detected for several multilocus haplotypes in an 11-million-bp region that contained the MSH2 gene. This work demonstrates the power of the LD mapping approach in an isolated population and its general applicability to the identification of novel cancer-causing genes.
UR - http://www.scopus.com/inward/record.url?scp=7444268016&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=7444268016&partnerID=8YFLogxK
U2 - 10.1158/0008-5472.CAN-04-1411
DO - 10.1158/0008-5472.CAN-04-1411
M3 - Article
C2 - 15520224
AN - SCOPUS:7444268016
SN - 0008-5472
VL - 64
SP - 8116
EP - 8125
JO - Cancer Research
JF - Cancer Research
IS - 21
ER -