Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana: The Rice Annotation Project

Takeshi Itoh, Tsuyoshi Tanaka, Roberto A. Barrero, Chisato Yamasaki, Yasuyuki Fujii, Phillip B. Hilton, Baltazar A. Antonio, Hideo Aono, Rolf Apweiler, Richard Bruskiewich, Thomas Bureau, Frances Burr, Antonio Costa De Oliveira, Galina Fuks, Takuya Habara, Georg Haberer, Bin Han, Erimi Harada, Aiko T. Hiraki, Hirohiko HirochikaDouglas Hoen, Hiroki Hokari, Satomi Hosokawa, Yue Ie Hsing, Hiroshi Ikawa, Kazuho Ikeo, Tadashi Imanishi, Yukiyo Ito, Pankaj Jaiswal, Masako Kanno, Yoshihiro Kawahara, Toshiyuki Kawamura, Hiroaki Kawashima, Jitendra P. Khurana, Shoshi Kikuchi, Setsuko Komatsu, Kanako O. Koyanagi, Hiromi Kubooka, Damien Lieberherr, Yao Cheng Lin, David Lonsdale, Takashi Matsumoto, Akihiro Matsuya, W. Richard McCombie, Joachim Messing, Akio Miyao, Nicola Mulder, Yoshiaki Nagamura, Jongmin Nam, Nobukazu Namiki, Hisataka Numa, Shin Nurimoto, Claire O'Donovan, Hajime Ohyanagi, Toshihisa Okido, Satoshi OOta, Naoki Osato, Lance E. Palmer, Francis Quetier, Saurabh Raghuvanshi, Naomi Saichi, Hiroaki Sakai, Yasumichi Sakai, Katsumi Sakata, Tetsuya Sakurai, Fumihiko Sato, Yoshiharu Sato, Heiko Schoof, Motoaki Seki, Michie Shibata, Yuji Shimizu, Kazuo Shinozaki, Yuji Shinso, Nagendra K. Singh, Brian Smith-White, Jun Ichi Takeda, Motohiko Tanino, Tatiana Tatusova, Supat Thongjuea, Fusano Todokoro, Mika Tsugane, Akhilesh K. Tyagi, Apichart Vanavichit, Aihui Wang, Rod A. Wing, Kaori Yamaguchi, Mayu Yamamoto, Naoyuki Yamamoto, Yeisoo Yu, Hao Zhang, Qiang Zhao, Kenichi Higo, Benjamin Burr, Takashi Gojobori, Takuji Sasaki

Research output: Contribution to journalArticlepeer-review

202 Scopus citations

Abstract

We present here the annotation of the complete genome of rice Oryza sativa L. ssp. japonica cultivar Nipponbare. All functional annotations for proteins and non-protein-coding RNA (npRNA) candidates were manually curated. Functions were identified or inferred in 19,969 (70%) of the proteins, and 131 possible npRNAs (including 58 antisense transcripts) were found. Almost 5000 annotated protein-coding genes were found to be disrupted in insertional mutant lines, which will accelerate future experimental validation of the annotations. The rice loci were determined by using cDNA sequences obtained from rice and other representative cereals. Our conservative estimate based on these loci and an extrapolation suggested that the gene number of rice is ∼32,000, which is smaller than previous estimates. We conducted comparative analyses between rice and Arabidopsis thaliana and found that both genomes possessed several lineage-specific genes, which might account for the observed differences between these species, while they had similar sets of predicted functional domains among the protein sequences. A system to control translational efficiency seems to be conserved across large evolutionary distances. Moreover, the evolutionary process of protein-coding genes was examined. Our results suggest that natural selection may have played a role for duplicated genes in both species, so that duplication was suppressed or favored in a manner that depended on the function of a gene.

Original languageEnglish (US)
Pages (from-to)175-183
Number of pages9
JournalGenome Research
Volume17
Issue number2
DOIs
StatePublished - Feb 2007

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana: The Rice Annotation Project'. Together they form a unique fingerprint.

Cite this