TY - JOUR
T1 - Single feature polymorphisms between two rice cultivars detected using a median polish method
AU - Xie, Weibo
AU - Chen, Ying
AU - Zhou, Gang
AU - Wang, Lei
AU - Zhang, Chengjun
AU - Zhang, Jianwei
AU - Xiao, Jinghua
AU - Zhu, Tong
AU - Zhang, Qifa
N1 - Funding Information:
We thank Dr. James Ronald and Dr. Rachel B. Brem for help and suggestions in yeast data. This work was supported by grants from the National Special Key Project of China on Functional Genomics of Major Plants and Animals, and the National Natural Science Foundation of China.
PY - 2009/6
Y1 - 2009/6
N2 - Expression levels measured in microarrays of oligonucleotide probes have now been adapted as a high throughput approach for identifying DNA sequence variation between genotypes, referred to as single feature polymorphisms (SFPs). Although there have been increasing interests in this approach, there is still need for improving the algorithm in order to achieve high sensitivity and specificity especially with complex genome and large datasets, while maintaining optimal computational performance. We obtained microarray datasets for expression profiles of two rice cultivars and adapted a median polish method to detect SFPs. The analysis identified 6,655 SFPs between two the rice varieties representing 3,131 rice unique genes. We showed that the median polish method has the advantage of avoiding fitting complex linear models thus can be used to analyze complex transcriptome datasets like the ones in this study. The method is also superior in sensitivity, accuracy and computing time requirement compared with two previously used methods. A comparison with data from a resequencing project indicated that 75.6% of the SFPs had SNP supports in the probe regions. Further comparison revealed that SNPs in sequences immediately flanking the probes also had contributions to the detection of SFPs in cases where the probes and the targets had perfectly matched sequences. It was shown that differences in minimum free energies caused by flanking SNPs, which may change the stability of RNA secondary structure, may partly explain the SFPs as detected. These SFPs may facilitate gene discovery in future studies.
AB - Expression levels measured in microarrays of oligonucleotide probes have now been adapted as a high throughput approach for identifying DNA sequence variation between genotypes, referred to as single feature polymorphisms (SFPs). Although there have been increasing interests in this approach, there is still need for improving the algorithm in order to achieve high sensitivity and specificity especially with complex genome and large datasets, while maintaining optimal computational performance. We obtained microarray datasets for expression profiles of two rice cultivars and adapted a median polish method to detect SFPs. The analysis identified 6,655 SFPs between two the rice varieties representing 3,131 rice unique genes. We showed that the median polish method has the advantage of avoiding fitting complex linear models thus can be used to analyze complex transcriptome datasets like the ones in this study. The method is also superior in sensitivity, accuracy and computing time requirement compared with two previously used methods. A comparison with data from a resequencing project indicated that 75.6% of the SFPs had SNP supports in the probe regions. Further comparison revealed that SNPs in sequences immediately flanking the probes also had contributions to the detection of SFPs in cases where the probes and the targets had perfectly matched sequences. It was shown that differences in minimum free energies caused by flanking SNPs, which may change the stability of RNA secondary structure, may partly explain the SFPs as detected. These SFPs may facilitate gene discovery in future studies.
UR - http://www.scopus.com/inward/record.url?scp=67349095895&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67349095895&partnerID=8YFLogxK
U2 - 10.1007/s00122-009-1025-2
DO - 10.1007/s00122-009-1025-2
M3 - Article
C2 - 19370320
AN - SCOPUS:67349095895
SN - 0040-5752
VL - 119
SP - 151
EP - 164
JO - Theoretical and Applied Genetics
JF - Theoretical and Applied Genetics
IS - 1
ER -