RGAP


39558187	The rice genome annotation project: an updated database for mining the rice genome. [PMID: 39558187] Hamilton JP, Li C, Buell CR. Abstract Rice (Oryza sativa L.) is a major cereal crop that provides calories across the world. With a small genome, rice has been used extensively as a model for genetic and genomic studies in the Poaceae. Since the release of the first rice genome sequence in 2002, an improved reference genome assembly, multiple whole genome assemblies, extensive gene expression profiles, and resequencing data from over 3000 rice accessions have been generated. To facilitate access to the rice genome for plant biologists, we updated the Rice Genome Annotation Project database (RGAP; https://rice.uga.edu) with new datasets including 16 whole genome rice assemblies and sequence variants generated from multiple rice pan-genome projects including the 3000 Rice Genomes Project. We updated gene expression abundance data with 80 RNA-sequencing datasets and to facilitate gene function discovery, performed gene coexpression resulting in 39 coexpression modules that capture highly connected sets of co-regulated genes. To facilitate comparative genome analyses, 32 335 syntelogs were identified between the Nipponbare reference genome and other rice genomes and 19 371 syntelogs were identified between Nipponbare and four other Poaceae genomes. Infrastructure improvements to the RGAP database include an upgraded genome browser and data access portals, enhanced website security and increased performance of the website. Nucleic Acids Res. 2025:53(D1) \| 27 Citations (from Europe PMC, 2026-05-23)
24280374	Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. [PMID: 24280374] Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S, Schwartz DC, Tanaka T, Wu J, Zhou S, Childs KL, Davidson RM, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T. Abstract Background Rice research has been enabled by access to the high quality reference genome sequence generated in 2005 by the International Rice Genome Sequencing Project (IRGSP). To further facilitate genomic-enabled research, we have updated and validated the genome assembly and sequence for the Nipponbare cultivar of Oryza sativa (japonica group). Results The Nipponbare genome assembly was updated by revising and validating the minimal tiling path of clones with the optical map for rice. Sequencing errors in the revised genome assembly were identified by re-sequencing the genome of two different Nipponbare individuals using the Illumina Genome Analyzer II/IIx platform. A total of 4,886 sequencing errors were identified in 321 Mb of the assembled genome indicating an error rate in the original IRGSP assembly of only 0.15 per 10,000 nucleotides. A small number (five) of insertions/deletions were identified using longer reads generated using the Roche 454 pyrosequencing platform. As the re-sequencing data were generated from two different individuals, we were able to identify a number of allelic differences between the original individual used in the IRGSP effort and the two individuals used in the re-sequencing effort. The revised assembly, termed Os-Nipponbare-Reference-IRGSP-1.0, is now being used in updated releases of the Rice Annotation Project and the Michigan State University Rice Genome Annotation Project, thereby providing a unified set of pseudomolecules for the rice community. Conclusions A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice. Detection of polymorphisms between three different Nipponbare individuals highlights that allelic differences between individuals should be considered in diversity studies. Rice (N Y). 2013:6(1) \| 1513 Citations (from Europe PMC, 2026-05-23)
17145706	The TIGR Rice Genome Annotation Resource: improvements and new features. [PMID: 17145706] Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, Thibaud-Nissen F, Malek RL, Lee Y, Zheng L, Orvis J, Haas B, Wortman J, Buell CR. Abstract In The Institute for Genomic Research Rice Genome Annotation project (http://rice.tigr.org), we have continued to update the rice genome sequence with new data and improve the quality of the annotation. In our current release of annotation (Release 4.0; January 12, 2006), we have identified 42,653 non-transposable element-related genes encoding 49,472 gene models as a result of the detection of alternative splicing. We have refined our identification methods for transposable element-related genes resulting in 13,237 genes that are related to transposable elements. Through incorporation of multiple transcript and proteomic expression data sets, we have been able to annotate 24 799 genes (31,739 gene models), representing approximately 50% of the total gene models, as expressed in the rice genome. All structural and functional annotation is viewable through our Rice Genome Browser which currently supports 59 tracks. Enhanced data access is available through web interfaces, FTP downloads and a Data Extractor tool developed in order to support discrete dataset downloads. Nucleic Acids Res. 2007:35(Database issue) \| 936 Citations (from Europe PMC, 2026-05-23)
15888674	The institute for genomic research Osa1 rice genome annotation database. [PMID: 15888674] Yuan Q, Ouyang S, Wang A, Zhu W, Maiti R, Lin H, Hamilton J, Haas B, Sultana R, Cheung F, Wortman J, Buell CR. Abstract We have developed a rice (Oryza sativa) genome annotation database (Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis (Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 non-transposable element-related genes, 18,545 (42.4%) were annotated with a putative function, 5,777 (13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 (44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms (5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser. All data can be obtained through the project Web pages at http://rice.tigr.org. Plant Physiol. 2005:138(1) \| 143 Citations (from Europe PMC, 2026-05-23)

The rice genome annotation project: an updated database for mining the rice genome. [PMID: 39558187]

Hamilton JP, Li C, Buell CR.

Rice (Oryza sativa L.) is a major cereal crop that provides calories across the world. With a small genome, rice has been used extensively as a model for genetic and genomic studies in the Poaceae. Since the release of the first rice genome sequence in 2002, an improved reference genome assembly, multiple whole genome assemblies, extensive gene expression profiles, and resequencing data from over 3000 rice accessions have been generated. To facilitate access to the rice genome for plant biologists, we updated the Rice Genome Annotation Project database (RGAP; https://rice.uga.edu) with new datasets including 16 whole genome rice assemblies and sequence variants generated from multiple rice pan-genome projects including the 3000 Rice Genomes Project. We updated gene expression abundance data with 80 RNA-sequencing datasets and to facilitate gene function discovery, performed gene coexpression resulting in 39 coexpression modules that capture highly connected sets of co-regulated genes. To facilitate comparative genome analyses, 32 335 syntelogs were identified between the Nipponbare reference genome and other rice genomes and 19 371 syntelogs were identified between Nipponbare and four other Poaceae genomes. Infrastructure improvements to the RGAP database include an upgraded genome browser and data access portals, enhanced website security and increased performance of the website.

Nucleic Acids Res. 2025:53(D1) | 27 Citations (from Europe PMC, 2026-05-23)

Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. [PMID: 24280374]

Kawahara Y, de la Bastide M, Hamilton JP, Kanamori H, McCombie WR, Ouyang S, Schwartz DC, Tanaka T, Wu J, Zhou S, Childs KL, Davidson RM, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T.

Abstract

Background

Rice research has been enabled by access to the high quality reference genome sequence generated in 2005 by the International Rice Genome Sequencing Project (IRGSP). To further facilitate genomic-enabled research, we have updated and validated the genome assembly and sequence for the Nipponbare cultivar of Oryza sativa (japonica group).

Results

The Nipponbare genome assembly was updated by revising and validating the minimal tiling path of clones with the optical map for rice. Sequencing errors in the revised genome assembly were identified by re-sequencing the genome of two different Nipponbare individuals using the Illumina Genome Analyzer II/IIx platform. A total of 4,886 sequencing errors were identified in 321 Mb of the assembled genome indicating an error rate in the original IRGSP assembly of only 0.15 per 10,000 nucleotides. A small number (five) of insertions/deletions were identified using longer reads generated using the Roche 454 pyrosequencing platform. As the re-sequencing data were generated from two different individuals, we were able to identify a number of allelic differences between the original individual used in the IRGSP effort and the two individuals used in the re-sequencing effort. The revised assembly, termed Os-Nipponbare-Reference-IRGSP-1.0, is now being used in updated releases of the Rice Annotation Project and the Michigan State University Rice Genome Annotation Project, thereby providing a unified set of pseudomolecules for the rice community.

Conclusions

A revised, error-corrected, and validated assembly of the Nipponbare cultivar of rice was generated using optical map data, re-sequencing data, and manual curation that will facilitate on-going and future research in rice. Detection of polymorphisms between three different Nipponbare individuals highlights that allelic differences between individuals should be considered in diversity studies.

Rice (N Y). 2013:6(1) | 1513 Citations (from Europe PMC, 2026-05-23)

The TIGR Rice Genome Annotation Resource: improvements and new features. [PMID: 17145706]

Ouyang S, Zhu W, Hamilton J, Lin H, Campbell M, Childs K, Thibaud-Nissen F, Malek RL, Lee Y, Zheng L, Orvis J, Haas B, Wortman J, Buell CR.

Abstract

In The Institute for Genomic Research Rice Genome Annotation project (http://rice.tigr.org), we have continued to update the rice genome sequence with new data and improve the quality of the annotation. In our current release of annotation (Release 4.0; January 12, 2006), we have identified 42,653 non-transposable element-related genes encoding 49,472 gene models as a result of the detection of alternative splicing. We have refined our identification methods for transposable element-related genes resulting in 13,237 genes that are related to transposable elements. Through incorporation of multiple transcript and proteomic expression data sets, we have been able to annotate 24 799 genes (31,739 gene models), representing approximately 50% of the total gene models, as expressed in the rice genome. All structural and functional annotation is viewable through our Rice Genome Browser which currently supports 59 tracks. Enhanced data access is available through web interfaces, FTP downloads and a Data Extractor tool developed in order to support discrete dataset downloads.

Nucleic Acids Res. 2007:35(Database issue) | 936 Citations (from Europe PMC, 2026-05-23)

The institute for genomic research Osa1 rice genome annotation database. [PMID: 15888674]

Yuan Q, Ouyang S, Wang A, Zhu W, Maiti R, Lin H, Hamilton J, Haas B, Sultana R, Cheung F, Wortman J, Buell CR.

Abstract

We have developed a rice (Oryza sativa) genome annotation database (Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis (Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 non-transposable element-related genes, 18,545 (42.4%) were annotated with a putative function, 5,777 (13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 (44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms (5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser. All data can be obtained through the project Web pages at http://rice.tigr.org.

Plant Physiol. 2005:138(1) | 143 Citations (from Europe PMC, 2026-05-23)

URL:	https://rice.uga.edu
Full name:	Rice Genome Annotation Project
Description:	RGAP includes 16 whole genome rice assemblies and sequence variants generated from multiple rice pan-genome projects including the 3000 Rice Genomes Project. We updated gene expression abundance data with 80 RNA-sequencing datasets and to facilitate gene function discovery, performed gene coexpression resulting in 39 coexpression modules that capture highly connected sets of co-regulated genes.
Year founded:	2005
Last update:	2024-11-19
Version:	Release 7
Accessibility:	Accessible
Country/Region:	United States

Data type:	DNA
Data object:	Plant
Database category:	Gene genome and annotation Metadata
Major species:	Oryza sativa
Keywords:	genome annotation rice genome

University/Institution:	University of Georgia
Address:
City:	Athens
Province/State:	Georgia
Country/Region:	United States
Contact name (PI/Team):	C Robin Buell
Contact email (PI/Helpdesk):	Robin.Buell@uga.edu

Database Commons
a catalog of worldwide biological databases

a catalog of worldwide biological databases

Database Profile

General information

Classification & Tag

Contact information

Publications

Background

Results

Conclusions

Ranking

Community reviews

Word cloud

Tags

Related Databases

Record metadata

Database Commons a catalog of worldwide biological databases

a catalog of worldwide biological databases

Database Profile

RGAP

General information

Classification & Tag

Contact information

Publications

Background

Results

Conclusions

Ranking

Community reviews

Word cloud

Tags

Related Databases

Record metadata

Database Commons
a catalog of worldwide biological databases