Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

MFS

General information

URL: https://mfs.maizegdb.org
Full name: Maize Feature Store
Description: The Maize feature store is a centralized repository of updated, raw and transformed data for solving complex biological problems associated with the maize genome.
Year founded: 2023
Last update: 2023-11-06
Version: v1.0
Accessibility:
Accessible
Country/Region: United States

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
Keywords:

Contact information

University/Institution: Corn Insects and Crop Genetics Research Unit
Address:
City:
Province/State:
Country/Region: United States
Contact name (PI/Team): carson.andorf@usda.gov
Contact email (PI/Helpdesk): carson.andorf@usda.gov

Publications

37935586
Maize Feature Store: A centralized resource to manage and analyze curated maize multi-omics features for machine learning applications. [PMID: 37935586]
Shatabdi Sen, Margaret R Woodhouse, John L Portwood, Carson M Andorf

The big-data analysis of complex data associated with maize genomes accelerates genetic research and improves agronomic traits. As a result, efforts have increased to integrate diverse datasets and extract meaning from these measurements. Machine learning models are a powerful tool for gaining knowledge from large and complex datasets. However, these models must be trained on high-quality features to succeed. Currently, there are no solutions to host maize multi-omics datasets with end-to-end solutions for evaluating and linking features to target gene annotations. Our work presents the Maize Feature Store (MFS), a versatile application that combines features built on complex data to facilitate exploration, modeling and analysis. Feature stores allow researchers to rapidly deploy machine learning applications by managing and providing access to frequently used features. We populated the MFS for the maize reference genome with over 14 000 gene-based features based on published genomic, transcriptomic, epigenomic, variomic and proteomics datasets. Using the MFS, we created an accurate pan-genome classification model with an AUC-ROC score of 0.87. The MFS is publicly available through the maize genetics and genomics database. Database URL  https://mfs.maizegdb.org/.

Database (Oxford). 2023:2023() | 3 Citations (from Europe PMC, 2025-12-20)

Ranking

All databases:
4631/6895 (32.85%)
Gene genome and annotation:
1407/2021 (30.43%)
Genotype phenotype and variation:
668/1005 (33.632%)
Expression:
950/1347 (29.547%)
Modification:
274/337 (18.991%)
4631
Total Rank
3
Citations
1.5
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2024-07-16
Curated by:
Wenzhuo Cheng [2024-08-28]
Wenzhuo Cheng [2024-07-25]
zheng luo [2024-07-16]