MaizeMine: A Data Mining Warehouse for the Maize Genetics and Genomics Database.

Md Shamimuzzaman, Jack M Gardiner, Amy T Walsh, Deborah A Triant, Justin J Le Tourneau, Aditi Tayal, Deepak R Unni, Hung N Nguyen, John L Portwood, Ethalinda K S Cannon, Carson M Andorf, Christine G Elsik
Author Information
  1. Md Shamimuzzaman: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  2. Jack M Gardiner: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  3. Amy T Walsh: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  4. Deborah A Triant: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  5. Justin J Le Tourneau: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  6. Aditi Tayal: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  7. Deepak R Unni: Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.
  8. Hung N Nguyen: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.
  9. John L Portwood: USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA, United States.
  10. Ethalinda K S Cannon: USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA, United States.
  11. Carson M Andorf: USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA, United States.
  12. Christine G Elsik: Division of Animal Sciences, University of Missouri, Columbia, MO, United States.

Abstract

MaizeMine is the data mining resource of the Maize Genetics and Genome Database (MaizeGDB; http://maizemine.maizegdb.org). It enables researchers to create and export customized annotation datasets that can be merged with their own research data for use in downstream analyses. MaizeMine uses the InterMine data warehousing system to integrate genomic sequences and gene annotations from the B73 RefGen_v3 and B73 RefGen_v4 genome assemblies, Gene Ontology annotations, single nucleotide polymorphisms, protein annotations, homologs, pathways, and precomputed gene expression levels based on RNA-seq data from the B73 Gene Expression Atlas. MaizeMine also provides database cross references between genes of alternative gene sets from Gramene and NCBI RefSeq. MaizeMine includes several search tools, including a keyword search, built-in template queries with intuitive search menus, and a QueryBuilder tool for creating custom queries. The Genomic Regions search tool executes queries based on lists of genome coordinates, and supports both the B73 RefGen_v3 and B73 RefGen_v4 assemblies. The List tool allows you to upload identifiers to create custom lists, perform set operations such as unions and intersections, and execute template queries with lists. When used with gene identifiers, the List tool automatically provides gene set enrichment for Gene Ontology (GO) and pathways, with a choice of statistical parameters and background gene sets. With the ability to save query outputs as lists that can be input to new queries, MaizeMine provides limitless possibilities for data integration and meta-analysis.

Keywords

References

  1. New Phytol. 2018 Feb;217(3):1240-1253 [PMID: 29154441]
  2. Plant Cell. 2010 Jun;22(6):1667-85 [PMID: 20581308]
  3. Nucleic Acids Res. 2020 Jan 8;48(D1):D1093-D1103 [PMID: 31680153]
  4. Nat Biotechnol. 2010 May;28(5):511-5 [PMID: 20436464]
  5. Nucleic Acids Res. 2001 Jan 1;29(1):308-11 [PMID: 11125122]
  6. Bioinformatics. 2019 Sep 1;35(17):3206-3207 [PMID: 30668641]
  7. Nucleic Acids Res. 2020 Jan 8;48(D1):D689-D695 [PMID: 31598706]
  8. Cold Spring Harb Protoc. 2009 Oct;2009(10):pdb.emo132 [PMID: 20147033]
  9. PLoS One. 2013 Apr 23;8(4):e61005 [PMID: 23637782]
  10. Nucleic Acids Res. 2019 Jan 8;47(D1):D590-D595 [PMID: 30321428]
  11. Science. 2009 Nov 20;326(5956):1112-5 [PMID: 19965430]
  12. Nature. 2017 Jun 22;546(7659):524-527 [PMID: 28605751]
  13. Plant J. 2010 Jul 1;63(1):167-77 [PMID: 20409008]
  14. Nucleic Acids Res. 2020 Jan 8;48(D1):D927-D932 [PMID: 31566222]
  15. Plant Physiol. 2015 Jan;167(1):25-39 [PMID: 25384563]
  16. BMC Bioinformatics. 2010 Sep 27;11:485 [PMID: 20875133]
  17. PLoS One. 2011;6(12):e28334 [PMID: 22174790]
  18. Nat Biotechnol. 2019 Aug;37(8):907-915 [PMID: 31375807]
  19. Nucleic Acids Res. 2020 Jul 2;48(W1):W395-W402 [PMID: 32479607]
  20. Plant Cell Physiol. 2015 Jan;56(1):e1 [PMID: 25432968]
  21. Nucleic Acids Res. 2018 Jan 4;46(D1):D1181-D1189 [PMID: 29165610]
  22. Nucleic Acids Res. 2019 Jan 8;47(D1):D351-D360 [PMID: 30398656]
  23. Mol Plant. 2018 Mar 5;11(3):496-504 [PMID: 29223623]
  24. Nucleic Acids Res. 2019 Jan 25;47(2):594-606 [PMID: 30535227]
  25. Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515 [PMID: 30395287]
  26. Nucleic Acids Res. 2020 Jan 8;48(D1):D9-D16 [PMID: 31602479]
  27. Plant Genome. 2016 Mar;9(1): [PMID: 27898762]
  28. Nucleic Acids Res. 2019 Jan 8;47(D1):D1146-D1154 [PMID: 30407532]
  29. Nat Commun. 2016 Jun 24;7:11708 [PMID: 27339440]
  30. Plant Direct. 2018 Apr 11;2(4):e00052 [PMID: 31245718]
  31. Nucleic Acids Res. 2015 Jan;43(Database issue):D1057-63 [PMID: 25378336]
  32. Bioinformatics. 2012 Dec 1;28(23):3163-5 [PMID: 23023984]
  33. Nucleic Acids Res. 2012 Jan;40(Database issue):D1178-86 [PMID: 22110026]
  34. Plant Cell Physiol. 2017 Jan 1;58(1):e4 [PMID: 28013278]
  35. Nucleic Acids Res. 2014 Jul;42(Web Server issue):W468-72 [PMID: 24753429]
  36. BMC Genomics. 2007 May 09;8:116 [PMID: 17490480]
  37. BMC Syst Biol. 2016 Nov 29;10(1):129 [PMID: 27899149]
  38. Nucleic Acids Res. 2019 Jan 8;47(D1):D330-D338 [PMID: 30395331]
  39. Sci Rep. 2017 Jun 12;7(1):3232 [PMID: 28607429]
  40. Plant Cell Physiol. 2013 Feb;54(2):e1 [PMID: 23220694]

Word Cloud

Created with Highcharts 10.0.0datageneMaizeMineB73queriessearchtoollistsannotationsgenomeGeneprovidesminingMaizeGeneticsDatabasecreatecanInterMineRefGen_v3RefGen_v4assembliesOntologypathwaysbaseddatabasesetstemplatecustomListidentifierssetresourceGenomeMaizeGDBhttp://maizeminemaizegdborgenablesresearchersexportcustomizedannotationdatasetsmergedresearchusedownstreamanalysesuseswarehousingsystemintegrategenomicsequencessinglenucleotidepolymorphismsproteinhomologsprecomputedexpressionlevelsRNA-seqExpressionAtlasalsocrossreferencesgenesalternativeGrameneNCBIRefSeqincludesseveraltoolsincludingkeywordbuilt-inintuitivemenusQueryBuildercreatingGenomicRegionsexecutescoordinatessupportsallowsuploadperformoperationsunionsintersectionsexecuteusedautomaticallyenrichmentGOchoicestatisticalparametersbackgroundabilitysavequeryoutputsinputnewlimitlesspossibilitiesintegrationmeta-analysisMaizeMine:DataMiningWarehouseGenomicsZeamaysmaize

Similar Articles

Cited By