MethBank is a comprehensive DNA methylation database. It integrates consensus reference methylomes (CRMs), whole genome single-base resolution methylomes (SRMs), DNA & RNA methylation Tools (MeTools) and knowledge of epigenome-wide association studies (EWAS), provides an interactive browser for visualization and develops multiple tools for analysis.
To support information search and exploration, MethBank provides friendly web interfaces to
diverse information for a specific gene, region or key word.
For CRMs user click button, input related information according to the prompt of menu, and can learn reference methylation levels of specific probe ID (s), or specific gene ID (s).
Step 1: Input gene symbol tp53, click “Search” button
Step 2: Click specific gene name in the table, MethBank-CRMs will show methylation profiles of this gene in different tissue.
For SRMs, user also can learn (i) average methylation levels of promoter and genebody for
gene; (ii) promoter methylation levels of genes associated with DMPs between different
stages or tissues; (iii) the catalog of genes related to methylated CpG islands.
DMP: Genes related with methylated CpGIs: Methylation profiles of genes:
For EWAS, user click button, input specific probe ID (s) or specific gene ID (s) to the prompt
menu, and can learn information of related genes, probes and traits.
MethBank-CRM provides a tool to predict methylation age of human, named Age Predictor. Based on large-scale human methylation datasets integrated in MethBank, the age-related CpG sites with linear DNA methylation changes during aging are identified by Spearman correlation (|r| > 0.6). As a result, 52 age-related CpG sites (shown below) are selected in terms of their correlation and further employed with three machine learning models (Random Forest, SVM, and Elastic Net) to predict human DNA methylation age. Technically, the random forest algorithm is implemented by the randomForest (version 4.6-12) R package, where the parameter settings are ntree = 500 and mtry = 17. The SVM algorithm is implemented by the e1071 (version 1.6-7) R package with a radial basis function kernel, where the parameter settings are gamma = 0.0192 and cost = 1. For the elastic net, the glmnet function is used in glmnet (version 2.0-10) R package, where the parameters are optimized by tenfold cross-validation using a grid search and the best performance is obtained when setting alpha = 0.5 and lambda = 0.08. Age Predictor has been integrated into MethBank as an online tool that features straightforward and user-friendly web interfaces and accepts various types of data (raw data, processed data, GEO sample ID) as input.The input page The output page
MethBank-SRM presents IDMP (Identification of Differentially Methylated Promoter), a tool developed for identifying differentially methylated promoters (DMP) between any two samples. The identification procedure is detailed below. First, a Fisher’s exact test is performed on the condition that the delta methylation levels of the promoters between two samples are greater than a specified threshold. For this test, a contingency table is constructed where the row indicates a particular sample and the column indicates the sum of number of reads that supports a methylated cytosine or an unmethylated cytosine over all the cytosines at this promoter in a given sample. Second, the Benjamini-Hochberg False Discovery Rate (FDR) correction for the p-values of Fisher’s exact test is used. Finally, the promoter methylation of gene associated with DMP is provided. Users can directly download IDMP from the home webpage of MethBank and identify DMPs by providing two genome methylation files (BED format) of interested samples and the gene annotation file (GFF3 format) and setting the parameters (which include cytosine sequence context (C, CG or CH), the relative start position of promoters to TSS, delta methylation level, and p-value, etc).
（WGBS data in SRM module is aligned by Bismark software after 2021. But it is solved by WBSA before 2021.）
With the explosive growth of epigenome-wide association studies (EWAS), a large amount of data and knowledge related to EWAS have been accumulated. Although these data hold great potential for clinical translation, a standardized platform for data archiving, retrieving and exploration is indispensable. For this reason, we updated the existing data resources, EWAS Atlas (Nucleic Acids Res 2019, https://ngdc.cncb.ac.cn/ewas/atlas) and EWAS Data Hub (Nucleic Acids Res 2020, https://ngdc.cncb.ac.cn/ewas/datahub), and proposed EWAS Toolkit (https://ngdc.cncb.ac.cn/ewas/toolkit), an online tool for downstream analysis of EWAS result. Focusing on EWAS research, these three resources provide knowledge, data and tools respectively. More importantly, the quality and functionality for each component are enhanced by cross-linking information from each other. To this end, we present EWAS Open Platform, an integrated one-stop analysis platform for EWAS research.
We created MethBank-MeTool to catalogue and curated analysis tools for DNA and RNA methylation. MethBank-MeTool collects a range of information on each tool and categorizes them according to the platforms, libraries, applications and functions. MethBank-MeTool supports keyword search and provides dynamic update for the citation of all tools.