Database Commons
Database Commons

a catalog of worldwide biological databases

Database Profile

EpoDB

General information

URL: http://www.cbil.upenn.edu/epodb
Full name: prototype database for the analysis of genes expressed during vertebrate erythropoiesis.
Description: poDB is a database of genes expressed in vertebrate red blood cells. It is also a prototype for the creation of cell and tissue-specific databases from multiple external sources.
Year founded: 1998
Last update:
Version:
Accessibility:
Unaccessible
Country/Region: United States

Classification & Tag

Data type:
DNA
Data object:
Database category:
Major species:
NA
Keywords:

Contact information

University/Institution: Children's Hospital of Philadelphia
Address: Division of Hematology, The Children's Hospital of Philadelphia, 316E Abramson Research Center, 34th and Civic Center Boulevard, Philadelphia, PA 19104, USA.
City:
Province/State:
Country/Region: United States
Contact name (PI/Team): Stoeckert CJ Jr
Contact email (PI/Helpdesk): stoeckert@email.chop.edu

Publications

9847180
EpoDB: a prototype database for the analysis of genes expressed during vertebrate erythropoiesis. [PMID: 9847180]
Stoeckert CJ, Salas F, Brunk B, Overton GC.

EpoDB is a database of genes expressed in vertebrate red blood cells. It is also a prototype for the creation of cell and tissue-specific databases from multiple external sources. The information in EpoDB obtained from GenBank, SWISS-PROT, Transfac, TRRD and GERD is curated to provide high quality data for sequence analysis aimed at understanding gene regulation during erythropoiesis. New protocols have been developed for data integration and updating entries. Using a BLAST-based algorithm, we have grouped GenBank entries representing the same gene together. This sequence similarity protocol was also used to identify new entries to be included in EpoDB. We have recently implemented our database in Sybase (relational tables) in addition to SICStus Prolog to provide us with greater flexibility in asking complex queries that utilize information from multiple sources. New additions to the public web site (http://www.cbil.upenn.edu/epodb) for accessing EpoDB are the ability to retrieve groups of entries representing different variants of the same gene and to retrieve gene expression data. The BLAST query has been enhanced by incorporating BLASTView, an interactive and graphical display of BLAST results. We have also enhanced the queries for retrieving sequence from specified genes by the addition of MEME, a motif discovery tool, to the integrated analysis tools which include CLUSTALW and TESS.

Nucleic Acids Res. 1999:27(1) | 15 Citations (from Europe PMC, 2026-04-04)
9399855
EpoDB: a database of genes expressed during vertebrate erythropoiesis. [PMID: 9399855]
Salas F, Haas J, Brunk B, Stoeckert CJ, Overton GC.

EpoDB is a database designed for the study of gene regulation during differentiation and development of vertebrate red blood cells. In building EpoDB, we have taken the in advance approach to the data integration problem: we have extracted data relevant to red blood cells from GenBank, SWISS-PROT, TRRD (transcriptional regulation data) and GERD (expression levels data) to create a single integrated, highly curated view. Tools have been developed to automate data extraction from online resources, cleanse data of errors, enter information manually from the primary literature, generate a uniform, canonical representation of information and maintain data currency. The database is organized around biological features, e.g., genes, rather than sequences, which are supported by a controlled and consistent vocabulary for gene names and gene family names. Beyond the standard database queries, the functionality of EpoDB includes the ability to extract features and subsequences, display sequences and features graphically using bioWidget viewers and integrated analysis tools. EpoDB may be accessed at: http://cbil.humgen.upenn.edu/epodb/

Nucleic Acids Res. 1998:26(1) | 5 Citations (from Europe PMC, 2026-03-28)

Ranking

All databases:
5648/6932 (18.537%)
Genotype phenotype and variation:
816/1012 (19.466%)
5648
Total Rank
19
Citations
0.679
z-index

Community reviews

Not Rated
Data quality & quantity:
Content organization & presentation
System accessibility & reliability:

Word cloud

Related Databases

Citing
Cited by

Record metadata

Created on: 2018-02-09
Curated by:
Lin Liu [2022-08-22]
huma shireen [2018-08-28]
Zhaohua Li [2018-02-23]
Pei Wang [2018-02-09]