TY - JOUR
T1 - Microarray-based analysis of gene expression in very large gene families
T2 - The cytochrome P450 gene superfamily of Arabidopsis thaliana
AU - Xu, Wenying
AU - Bak, Søren
AU - Decker, Adria
AU - Paquette, Suzanne M.
AU - Feyereisen, René
AU - Galbraith, David W.
N1 - Funding Information:
This work was supported by the Human Frontiers Science Program (grant RG0280/1999M) and the USDA NRICGP (grant 9701472). S.B. was supported by The Danish Agricultural and Veterinary Research Council (grant 970265). We thank Jiri Macas for assistance in the initial phase of the project, and Michael Deyholos, Betsy Pierson, Georgina Lambert, Frans Tax and Kenneth Feldmann for many helpful discussions.
PY - 2001/7/11
Y1 - 2001/7/11
N2 - Cytochrome P450 (P450s) are heme-thiolate protein products of a very large gene superfamily, present in all kingdoms and involved in a variety of metabolic reactions. P450s are classified according to the degree of amino acid sequence identity, with P450s of the same family defined as having >40% identity, and P450s of the same subfamily having >55% identity. Currently, 273 P450 genes distributed over 45 families have been identified in Arabidopsis, and its genome is estimated to contain as many as 286. Genome-wide DNA microarrays make it possible to broadly correlate P450 gene activity with alterations in physiological or developmental states. A potential problem with microarray research is that sequence similarity between and within these families of closely related genes may lead to cross-hybridization. We designed experiments to systematically evaluate the specificity of P450 microarrays, and showed that conditions could be optimized to provide a very high degree of hybridization specificity. Under these conditions, and employing a 20% intensity value of maximum hybridization intensity as a cut-off, labeled P450 genes exhibited essentially no cross-hybridization between families and within subfamilies. We also compared the gene transcription levels of microarray probes derived from EST clones and from genomic DNA sequences for which ESTs were not available, using cDNA produced from RNA from various Arabidopsis tissue as the target. Many of the P450 genes displayed tissue-specific expression, leading to hypotheses as to the function of individual genes and their regulation. We also observed that several of the genomic sequences reported high levels of expression, highlighting the limitations of expression analysis based on ESTs alone.
AB - Cytochrome P450 (P450s) are heme-thiolate protein products of a very large gene superfamily, present in all kingdoms and involved in a variety of metabolic reactions. P450s are classified according to the degree of amino acid sequence identity, with P450s of the same family defined as having >40% identity, and P450s of the same subfamily having >55% identity. Currently, 273 P450 genes distributed over 45 families have been identified in Arabidopsis, and its genome is estimated to contain as many as 286. Genome-wide DNA microarrays make it possible to broadly correlate P450 gene activity with alterations in physiological or developmental states. A potential problem with microarray research is that sequence similarity between and within these families of closely related genes may lead to cross-hybridization. We designed experiments to systematically evaluate the specificity of P450 microarrays, and showed that conditions could be optimized to provide a very high degree of hybridization specificity. Under these conditions, and employing a 20% intensity value of maximum hybridization intensity as a cut-off, labeled P450 genes exhibited essentially no cross-hybridization between families and within subfamilies. We also compared the gene transcription levels of microarray probes derived from EST clones and from genomic DNA sequences for which ESTs were not available, using cDNA produced from RNA from various Arabidopsis tissue as the target. Many of the P450 genes displayed tissue-specific expression, leading to hypotheses as to the function of individual genes and their regulation. We also observed that several of the genomic sequences reported high levels of expression, highlighting the limitations of expression analysis based on ESTs alone.
KW - Cross-hybridization
KW - Gene families
KW - Microarrays
KW - Sequence identity
UR - http://www.scopus.com/inward/record.url?scp=0035844995&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0035844995&partnerID=8YFLogxK
U2 - 10.1016/S0378-1119(01)00516-9
DO - 10.1016/S0378-1119(01)00516-9
M3 - Article
C2 - 11470511
AN - SCOPUS:0035844995
SN - 0378-1119
VL - 272
SP - 61
EP - 74
JO - Gene
JF - Gene
IS - 1-2
ER -