Yiming Bao

Professor

Email: baoym (AT) big.ac.cn

Tel: 10-84097858


PROFESSIONAL EXPERIENCE

  • Director of National Genomics Data Center, Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (CAS), China, 2019 - Present

  • Director of BIG Data Center, BIG, CAS, China, 2017 - 2019

  • Professor, BIG, CAS, China, 2017 - Present

  • Staff Scientist, National Center for Biotechnology Information (NCBI)/NLM/NIH, USA, 2005 - 2017

  • Viral Genome Scientist, Computercraft Corporation (as a government contractor working at NCBI), USA, 2001-2005

  • Postdoctoral Associate and Senior Research Associate, Noble Foundation, USA, 1994 - 2001

  • Teaching and Research Assistant, Peking University, China, 1987-1991

EDUCATION

  • PhD in Genetics, John Innes Centre (through University of East Anglia), UK, 1994

  • BS in Biochemistry, Peking University, China, 1987

RESEARCH INTERESTS

  • Bioinformatics

  • Viral Genomics

PROJECTS & RESOURCES

  • The National Natural Science Foundation of China Project, Study of a Novel Method for High-throughput Automated Virus Classification Based on Whole Genome Sequence Comparison, 2023-2026, leader

  • National Key Research and Development Project, Key Technologies and Systems for Biological Big Data Management and Analyses, 2021-2024, leader

  • The Professional Association of the Alliance of International Science Organizations, leader

  • The Strategic Priority Research Program of the Chinese Academy of Sciences, 2020-2024, co-leader

  • National Key Research and Development Project, Global Omics Data Sharing Initiative, 2017-2020, leader

  • IUBS, Open Biodiversity and Health Big Data Programme, 2019-2025, leader

  • The 13th Five-year Informatization Plan of Chinese Academy of Sciences, Big Data-Driven Innovation and Demonstration Platform in the Field of Bioinformatics, 2018-2019, leader

ACADEMIC ACTIVITIES

  • Journal ReviewerArchive of Virology; Bioinformatics; BMC Bioinformatics; BMC Microbiology; Computers in Biology; Current Genomics; Database; Genomics, Proteomics & Bioinformatics; Infection, Genetics and Evolution; Journal of Computational Biology; Journal of Genetics and Genomics; Journal of Virology; The Lancet Digital Health; Molecular Phylogenetics and Evolution; Nucleic Acids Research; Plant Molecular Biology; PLoS ONE; PNAS; Vaccine; Virus Evolution; Viruses

  • Member: Virus Data Subcommittee, International Committee on Taxonomy of Viruses (ICTV), 2011-2017

PUBLICATIONS

1.     Bao Y. as co-corresponding author in CNCB-NGDC Members and Partners. (2024). Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2024. Nucleic Acids Res. doi: 10.1093/nar/gkad1078, PMID: 38018256.

2.     Gao X, Chen K, Xiong J, Zou D, Yang F, Ma Y, Jiang C, Gao X, Wang G, Gu S, Zhang P, Luo S, Huang K, Bao Y, Zhang Z, Ma L, Miao W. (2024). The P10K database: a data portal for the protist 10 000 genomes project. Nucleic Acids Res. doi: 10.1093/nar/gkad992. PMID: 37930867.

3.     Yang S, Zong W, Shi L, Li R, Ma Z, Ma S, Si J, Wu Z, Zhai J, Ma Y, Fan Z, Chen S, Huang H, Zhang D, Bao Y, Li R, Xie J. (2024). PPGR: a comprehensive perennial plant genomes and regulation database. Nucleic Acids Res. doi: 10.1093/nar/gkad963. PMID: 37933857.

4.     Lin S, Wu S, Zhao W, Fang Z, Kang H, Liu X, Pan S, Yu F, Bao Y, Jia P. (2024). TargetGene: a comprehensive database of cell-type-specific target genes for genetic variants. Nucleic Acids Res. doi: 10.1093/nar/gkad901. PMID: 37870478.

5.     Li H, Wu S, Li J, Xiong Z, Yang K, Ye W, Ren J, Wang Q, Xiong M, Zheng Z, Zhang S, Han Z, Yang P, Jiang B, Ping J, Zuo Y, Lu X, Zhai Q, Yan H, Wang S, Ma S, Zhang B, Ye J, Qu J, Yang YG, Zhang F, Liu GH, Bao Y, Zhang W. (2024). HALL: a comprehensive database for human aging and longevity studies. Nucleic Acids Res. doi: 10.1093/nar/gkad880. PMID: 37870433.

6.     Wang G, Wu S, Xiong Z, Qu H, Fang X, Bao Y. (2024). CROST: a comprehensive repository of spatial transcriptomics. Nucleic Acids Res. doi: 10.1093/nar/gkad782. PMID: 37791883.

7.     Li C, Wu L, Shu C, Bao Y, Ma J, Song S. (2023). Data-driven public health security. Chinese Science Bulletin. doi: 10.1360/TB-2023-0708.

8.     Ma L, Zhao W, Huang T, Jin E, Wu G, Zhao W, Bao Y. (2023). On the collection and integration of SARS-CoV-2 genome data. Biosafety and Health 5, 204-210. doi: 10.1016/j.bsheal.2023.07.004.

9.     Li C, Ma L, Zou D, Zhang R, Bai X, Li L, Wu G, Huang T, Zhao W, Jin E, Bao Y, Song S. (2023). RCoV19: A One-stop Hub for SARS-CoV-2 Genome Data Integration, Variant Monitoring, and Risk Pre-warning. Genomics Proteomics Bioinformatics. doi: 10.1016/j.gpb.2023.10.004. PMID: 37898309.

10.  Bao Y, Xue Y. (2023). From BIG Data Center to China National Center for Bioinformation. Genomics Proteomics Bioinformatics. doi: 10.1016/j.gpb.2023.10.001. PMID: 37832784.

11.  Bu C, Zheng X, Mai J, Nie Z, Zeng J, Qian Q, Xu T, Sun Y, Bao Y, Xiao J. (2023). CCLHunter: An efficient toolkit for cancer cell line authentication. Comput Struct Biotechnol J. 21, 4675-4682. doi: 10.1016/j.csbj.2023.09.040. PMID: 37841327.

12.  Li L, Xu B, Tian D, Wang A, Zhu J, Li C, Li N, Zhao W, Shi L, Xue Y, Zhang Z, Bao Y, Zhao W, Song S. (2023). McAN: a novel computational algorithm and platform for constructing and visualizing haplotype networks. Brief Bioinform. 24, bbad174. doi: 10.1093/bib/bbad174. PMID: 37170752.

13.  Kang H, Zhu X, Cui Y, Xiong Z, Zong W, Bao Y, Jia P. (2023). A Comprehensive Benchmark of Transcriptomic Biomarkers for Immune Checkpoint Blockades. Cancers (Basel). 15, 4094. doi: 10.3390/cancers15164094. PMID: 37627121.

14.  Sun Y, Zheng X, Wang G, Wang Y, Chen X, Sun J, Xiong Z, Zhang S, Wang T, Fan Z, Bu C, Bao Y, Zhao W. (2023). MACdb: A curated knowledgebase for metabolic associations across human cancers. Mol Cancer Res. 21, 691-697. doi: 10.1158/1541-7786.MCR-22-0909. PMID: 37027007.

15.  Simmonds P, Adriaenssens EM, Zerbini FM, Abrescia NGA, Aiewsakun P, Alfenas-Zerbini P, Bao Y, Barylski J, Drosten C, Duffy S, Duprex WP, Dutilh BE, Elena SF, García ML, Junglen S, Katzourakis A, Koonin EV, Krupovic M, Kuhn JH, Lambert AJ, Lefkowitz EJ, Łobocka M, Lood C, Mahony J, Meier-Kolthoff JP, Mushegian AR, Oksanen HM, Poranen MM, Reyes-Muñoz A, Robertson DL, Roux S, Rubino L, Sabanadzovic S, Siddell S, Skern T, Smith DB, Sullivan MB, Suzuki N, Turner D, Van Doorslaer K, Vandamme AM, Varsani A, Vasilakis N. (2023). Four principles to establish a universal virus taxonomy. PLoS Biol. 21, e3001922. doi: 10.1371/journal.pbio.3001922. PMID: 36780432.

16.  Wu S, Huang Y, Zhang M, Gong Z, Wang G, Zheng X, Zong W, Zhao W, Xing P, Li R, Liu Z, Bao Y. (2023). ASCancer Atlas: a comprehensive knowledgebase of alternative splicing in human cancers. Nucleic Acids Res. 51, D1196-D1204. doi: 10.1093/nar/gkac955. PMID: 36318242.

17.  Duan G, Wu G, Chen X, Tian D, Li Z, Sun Y, Du Z, Hao L, Song S, Gao Y, Xiao J, Zhang Z, Bao Y, Tang B, Zhao W. (2023). HGD: an integrated homologous gene database across multiple species. Nucleic Acids Res. 51, D994-D1002. doi: 10.1093/nar/gkac970. PMID: 36318261.

18.  Jiang S, Qian Q, Zhu T, Zong W, Shang Y, Jin T, Zhang Y, Chen M, Wu Z, Chu Y, Zhang R, Luo S, Jing W, Zou D, Bao Y, Xiao J, Zhang Z. (2023). Cell Taxonomy: a curated repository of cell types with multifaceted characterization. Nucleic Acids Res. 51, D853-D860. doi: 10.1093/nar/gkac816. PMID: 36161321.

19.  Pan S, Kang H, Liu X, Lin S, Yuan N, Zhang Z, Bao Y, Jia P. (2023). Brain Catalog: a comprehensive resource for the genetic landscape of brain-related traits. Nucleic Acids Res. 51, D835-D844. doi: 10.1093/nar/gkac895. PMID: 36243988.

20.  Zhang M, Zong W, Zou D, Wang G, Zhao W, Yang F, Wu S, Zhang X, Guo X, Ma Y, Xiong Z, Zhang Z, Bao Y, Li R. (2023). MethBank 4.0: an updated database of DNA methylation across a variety of species. Nucleic Acids Res. 51, D208-D216. doi: 10.1093/nar/gkac969. PMID: 36318250.

21.  Bao Y. as co-corresponding author in CNCB-NGDC Members and Partners. (2023). Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2023. Nucleic Acids Res. 51, D18-D28. doi: 10.1093/nar/gkac1073. PMID: 36420893.

22.  Ma L, Zou D, Liu L, Shireen H, Abbasi AA, Bateman A, Xiao J, Zhao W, Bao Y, Zhang Z. (2022). Database Commons: A Catalog of Worldwide Biological Databases. Genomics Proteomics & Bioinformatics. doi: 10.1016/j.gpb.2022.12.004. PMID: 36572336.

23.  Zheng X, Zong W, Li Z, Ma Y, Sun Y, Xiong Z, Wu S, Yang F, Zhao W, Bu C, Du Z, Xiao J, Bao Y. (2022). CCAS: One-stop and comprehensive annotation system for individual cancer genome at multi-omics level. Front Genet. 13, 956781. doi: 10.3389/fgene.2022.956781. PMID: 36035123.

24.  Ma Y, Chen M, Bao Y, Song S, MPoxVR Team. (2022). MPoxVR – A comprehensive genomic resource for monkeypox virus variants surveillance. The Innovation 100296, doi: 10.1016/j.xinn.2022.100296.

25.  Wang G, Xiong Z, Yang F, Zheng X, Zong W, Li R, Bao Y. (2022). Identification of COVID-19-Associated DNA Methylation Variations by Integrating Methylation Array and scRNA-Seq Data at Cell-Type Resolution. Genes (Basel) 13, 1109. doi: 10.3390/genes13071109. PMID: 35885892.

26.  Hou Y, Zhao S, Liu Q, Zhang X, Sha T, Su Y, Zhao W, Bao Y, Xue Y, Chen H. (2022). Ongoing positive selection drives the evolution of SARS-CoV-2 genomes. Genomics Proteomics Bioinformatics doi: 10.1016/j.gpb.2022.05.009. PMID: 35760317.

27.  Ma W, Yang J, Fu H, Su C, Yu C, Wang Q, de Vasconcelos ATR, Bazykin GA, Bao Y, Li M. (2022). Genomic perspectives on the emerging SARS-CoV-2 omicron variant. Genomics Proteomics Bioinformatics doi: 10.1016/j.gpb.2022.01.001. PMID: 35033679.

28.  Xiong Z, Li M, Ma Y, Li R, Bao Y. (2022). GMQN: A Reference-Based Method for Correcting Batch Effects and Probe Bias in HumanMethylation BeadChip. Front Genet 12, 810985. doi: 10.3389/fgene.2021.810985. PMID: 35069703.

29.  Li R, Zhang X, Song S, Wang Y, Zou D, Xiao J, Zhao W, Zhang Z, Bao Y. (2022). Safety management and application of genomics data. Big Data Research 8, 37-45.

30.  Wang YY, Kang H, Xu T, Hao L, Bao Y, Jia P. (2022). CeDR Atlas: a knowledgebase of cellular drug response. Nucleic Acids Res. 50, D1164-D1171. doi: 10.1093/nar/gkab897. PMID: 34634794.

31.  Kang W, Jin T, Zhang T, Ma S, Yan H, Liu Z, Ji Z, Cai Y, Wang S, Song M, Ren J, Hu B, Zhou Q, Zhang W, Qu J, Bao Y, Liu GH. (2022). Regeneration Roadmap: database resources for regenerative biology. Nucleic Acids Res. 50, D1085-D1090. doi: 10.1093/nar/gkab870. PMID: 34591960.

32.  Zhang Y, Zou D, Zhu T, Xu T, Chen M, Niu G, Zong W, Pan R, Jing W, Sang J, Liu C, Xiong Y, Sun Y, Zhai S, Chen H, Zhao W, Xiao J, Bao Y, Hao L, Zhang Z. (2022). Gene Expression Nebulas (GEN): a comprehensive data portal integrating transcriptomic profiles across multiple species at both bulk and single-cell levels. Nucleic Acids Res. 50, D1016-D1024. doi: 10.1093/nar/gkab878. PMID: 34591957.

33.  Xiong Z, Yang F, Li M, Ma Y, Zhao W, Wang G, Li Z, Zheng X, Zou D, Zong W, Kang H, Jia Y, Li R, Zhang Z, Bao Y. (2022). EWAS Open Platform: integrated data, knowledge and toolkit for epigenome-wide association study. Nucleic Acids Res. 50, D1004-D1009. doi: 10.1093/nar/gkab972. PMID: 34718752.

34.  Zong W, Kang H, Xiong Z, Ma Y, Jin T, Gong Z, Yi L, Zhang M, Wu S, Wang G, Bao Y, Li R. (2022). scMethBank: a database for single-cell whole genome DNA methylation maps. Nucleic Acids Res. 50, D380-D386. PMID: 34570235.

35.  Bao Y. as co-corresponding author in CNCB-NGDC Members and Partners. (2022). Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022. Nucleic Acids Research 50, D27-D38. doi: 10.1093/nar/gkab951. PMID: 34718731.

36.  Zhang SS, Chen X, Chen TT, Zhu JW, Tang BX, Wang AK, Dong LL, Zhang ZW, Sun YL, Yu CX, Zhai S, Sun YB, Chen HX, Du ZL, Xiao JF, Zhang Z, Bao YM, Wang YQ, Zhao WM. (2021). GSA-Human: Genome Sequence Archive for Human. Yi Chuan 43, 988-993. doi: 10.16288/j.yczz.21-248. PMID: 34702711.

37.  Song S, Li C, Kang L, Tian D, Badar N, Ma W, Zhao S, Jiang X, Wang C, Sun Y, Li W, Lei M, Li S, Qi Q, Ikram A, Salman M, Umair M, Shireen H, Batool F, Zhang B, Chen H, Yang Y, Ali Abbasi A, Li M, Xue Y, Bao Y. (2021). Genomic Epidemiology of SARS-CoV-2 in Pakistan. Genomics Proteomics Bioinformatics 19, 727-740. doi: 10.1016/j.gpb.2021.08.007. PMID: 34695600

38.  Chen M, Ma Y, Wu S, Zheng X, Kang H, Sang J, Xu X, Hao L, Li Z, Gong Z, Xiao J, Zhang Z, Zhao W, Bao Y. (2021). Genome Warehouse: A Public Repository Housing Genome-scale Data. Genomics, Proteomics & Bioinformatics 19, 584-589. doi: 10.1016/j.gpb.2021.04.001. PMID: 34175476.

39.  Chen T, Chen X, Zhang S, et al. (2021). The Genome Sequence Archive Family: Toward Explosive Data Growth and Diverse Data Types. Genomics, Proteomics & Bioinformatics 19, 578-583. doi: 10.1016/j.gpb.2021.08.001. PMID: 34400360.

40.  Pervaiz, N., Kang, H., Bao, Y., Abbasi A.A. (2021).  Molecular evolutionary analysis of human primary microcephaly genes. BMC Ecol Evo 21, 76.

41.  Raza RZ, Ma L, Zhang Z, Bao Y, Abbasi, AA. (2021). Selection trends on nasal-associated SNP variants across human populations. Meta Gene 28, 100872. doi: 10.1016/j.mgene.2021.100872.

42.  Zhang, T., Gong, Z., Guo, T., Xiong, Z. and Bao, Y. (2021). Pairwise Sequence Comparison in Virology. In: Bamford, D.H. and Zuckerman, M. (eds.) Encyclopedia of Virology, 4th Edition, vol. 1, pp. 100–107. Oxford: Academic Press.

43.  Bao Y. as co-corresponding author in Aging Atlas Consortium. (2021). Aging Atlas: a multi-omics database for aging biology. Nucleic Acids Research 49, D825-D830.

44.  Bao Y. as co-corresponding author in CNCB-NGDC Members and Partners. (2021). Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2021. Nucleic Acids Research 49, D18-D28.

45.  Song S, Ma L, Zou D, Tian D, Li C, Zhu J, Chen M, Wang A, Ma Y, Li M, Teng X, Cui Y, Duan G, Zhang M, Jin T, Shi C, Du Z, Zhang Y, Liu C, Li R, Zeng J, Hao L, Jiang S, Chen H, Han D, Xiao J, Zhang Z, Zhao W, Xue Y, Bao Y. (2020). The Global Landscape of SARS-CoV-2 Genomes, Variants, and Haplotypes in 2019nCoVR. Genomics Proteomics Bioinformatics 18, 749-759.

46.  Liu Q., Zhao S., Shi C.M., Song S., Zhu S., Su Y., Zhao W., Li M., Bao Y., Xue Y., & Chen H. (2020). Population Genetics of SARS-CoV-2: Disentangling Effects of Sampling Bias and Infection Clusters. Genomics, Proteomics & Bioinformatics 18, 640-647.

47.  Duan Y.F., Wang S.Y., Chen Y.B., et al. (2020). Expert consensus on microbiome sequencing and analysis. Chinese Journal of Biotechnology 36, 2516-2524.

48.  Hussain I., Pervaiz N., Khan A., Saleem S., Shireen H., Wei D.Q., Labrie V., Bao Y., Abbasi A.A. (2020). Evolutionary and structural analysis of SARS-CoV-2 specific evasion of host immunity. Genes Immun. 21, 409-419. doi: 10.1038/s41435-020-00120-6.

49.  Gong Z., Zhu J.W., Li C.P., Jiang S., Ma L.N., Tang B.X., Zou D., Chen M.L., Sun Y.B., Song S.H., Zhang Z., Xiao J.F., Xue Y.B., Bao Y.M., Du Z.L., Zhao W.M. (2020). An online coronavirus analysis platform from the National Genomics Data Center. Zool. Res. 41, 705-708.

50.  Nawaz M.S., Asghar R., Pervaiz N., Ali S., Hussain I., Xing P., Bao Y., Abbasi A.A. (2020). Molecular evolutionary and structural analysis of human UCHL1 gene demonstrates the relevant role of intragenic epistasis in Parkinson's disease and other neurological disorders. BMC Evol Biol. 20, 130. doi: 10.1186/s12862-020-01684-7.

51.  Liu Y., Ding H., Chang S.T., Lu R., Zhong H., Zhao N., Lin T.H., Bao Y. et al. (2020). Exposure to air pollution and scarlet fever resurgence in China: a six-year surveillance study. Nature Communications 11, 4229. https://doi.org/10.1038/s41467-020-17987-8

52.  Pei S., Dong R., Bao Y., He R.L., Yau S.S. (2020). Classification of genomic components and prediction of genes of Begomovirus based on subsequence natural vector and support vector machine. PeerJ 8, e9625. https://doi.org/10.7717/peerj.9625

53.  Garnett S.T., Christidis L., Conix S., Costello M.J., Zachos F.E., Ba´nki O.S., Bao Y. et al. (2020). Principles for creating a single authoritative list of the world’s species. PLoS Biol 18, e3000736. https://doi.org/10.1371/journal.pbio.3000736

54.  Mehmood F., Abdullah, Ubaid Z., Bao Y., Poczai P., Bushra Mirza. (2020). Comparative Plastomics of Ashwagandha (Withania, Solanaceae) and Identification of Mutational Hotspots for Barcoding Medicinal Plants. Plants (Basel) 9, 752.

55.  Zhang Z., Song S., Yu J., Zhao W., Xiao J., Bao Y. (2020). The elements of data sharing. Genomics, Proteomics & Bioinformatics 18, 1-4.

56.  Chen M., Ma Y., Li R., Bao Y. (2020). Current Status and Prospects of Genomics Data Analysis Methods. Frontiers of Data and Computing 2, 1-19.

57.  Shah S., Malik A. H., Zhang B., Bao Y., Qazi J. (2020). Metagenomic analysis of relative abundance and diversity of bacterial microbiota in Bemisia tabaci infesting cotton crop in Pakistan. Infection, Genetics and Evolution 84, 104381.

58.  Zhao W.M., Song S.H., Chen M.L., Zou D., Ma L.N., Ma Y.K., Li R.J., Hao L.L., Li C.P., Tian D.M., Tang B.X., Wang Y.Q., Zhu J.W., Chen H.X., Zhang Z., Xue Y.B., Bao Y.M. (2020). The 2019 novel coronavirus resource. Hereditas (Beijing) 42, 212-221.

59.  Xiong Z., Li M., Yang F., Ma Y., Sang J., Li R., Li Z., Zhang Z, Bao Y. (2020). EWAS Data Hub: a resource of DNA methylation array data and metadata. Nucleic Acids Research 48, D890-D895.

60.  Bao Y. as co-corresponding author in National Genomics Data Center Members and Partners. (2020). Database Resources of the National Genomics Data Center in 2020. Nucleic Acids Research 48, D24-D33.

61.  Pervaiz N, Shakeel N, Qasim A, Zehra R, Anwar S, Rana N, Xue Y, Zhang Z, Bao Y., Abbasi AA. (2019). Evolutionary history of the human multigene families reveals widespread gene duplications throughout the history of animals. BMC Evol Biol. 19, 128. doi: 10.1186/s12862-019-1441-0. PMID: 31221090.

62.  Ma L.N., Cao J., Liu L., Li Z., Shireen H., Pervaiz N., Batool F., Raza R., Zou D., Bao Y., Abbasi A.A., Zhang Z. (2019). Community curation and expert curation of human long non-coding RNAs. Current Protocols in Bioinformatics 67, e82.

63.  Yu D., Dong L., Yan F. et al. (2019). eGPS 1.0: comprehensive software for multi-omic and evolutionary analyses. National Science Review 6, 867–869. https://doi.org/10.1093/nsr/nwz079

64.  Amarasinghe G.K., Ayllon M.A., Bao Y., Basler C.F., et al. (2019). Taxonomy of the order Mononegavirales: update 2019. Archives of Virology 164, 1967-1980.

65.  Wang G., Yin H., Li B., Yu C., Wang F., Xu X., Cao J., Bao Y., Wang L., Abbasi A.A., Bajic V.B., Ma L., Zhang Z. (2019). Characterization and identification of long non-coding RNAs based on feature relationship. Bioinformatics 35, 2949-2956.

66.  Seemab S., Pervaiz N., Zehra R., Anwar S., Bao Y., Abbasi A.A. (2019). Molecular evolutionary and structural analysis of familial exudative vitreoretinopathy associated FZD4 gene. BMC Evolutionary Biology 19, 72.

67.  Bao Y. as co-corresponding author among BIG Data Center Members. (2019). Database Resources of the BIG Data Center in 2019. Nucleic Acids Research 47, D8-D14.

68.  Li M., Zou D., Li Z., Gao R., Sang J., Zhang Y., Li R., Xia L., Zhang T., Niu G., Bao Y., Zhang Z. (2019). EWAS Atlas: a curated knowledgebase of epigenome-wide association studies. Nucleic Acids Research 47, D983-D988.

69.  Tang B., Zhou Q., Dong L., Li W., Zhang X., Lan L., Zhai S., Xiao J., Zhang Z., Bao Y., Zhang Y-P., Wang G-D., Zhao W. (2019). iDog: an integrated resource for domestic dogs and wild canids. Nucleic Acids Research 47, D793-D800.

70.  Zhao Y., Wang J., Liang F., Liu Y., Wang Q., Zhang H., Jiang M., Zhang Z.W., Zhao W., Bao Y., Zhang Z., Wu J., Asmann Y.W., Li R., Xiao J. (2019). NucMap: a database of genome-wide nucleosome positioning map across species. Nucleic Acids Research 47, D163-D169.

71.  Ma Y. & Bao Y. (2018). Prospects for national biological big data centers. Hereditas (Beijing) 40, 938-943.

72.  Pavesi A., Vianelli A., Chirico N., Bao Y., et al. (2018) Overlapping genes and the proteins they encode differ significantly in their sequence composition from non-overlapping genes. PLoS ONE 13, e0202513. https://doi.org/10.1371/journal.pone.0202513

73.  Bao Y. & Xue Y. (2018). Current Status and Prospect of Life and Health Big Data. Bulletin of the Chinese Academy of Sciences 33, 861-865.

74.  Maes P., Alkhovsky S.V., Bao Y., et al. (2018). Taxonomy of the family Arenaviridae and the order Bunyavirales: update 2018. Archives of Virology 163, 2295-2310.

75.  Song S.H., Tian D.M., ..., Bao Y., et al. (2018). Genome  Variation  Map:  a  data  repository  of  genome  variations  in  BIG  Data  Center. Nucleic Acids Research 46, D944-D949.

76.  Li R.J., Liang F., ..., Bao Y., et al. (2018). MethBank 3.0: a database of DNA methylomes across a variety of species. Nucleic Acids Research 46, D288-D295.

77.  Bao Y. as co-corresponding author among BIG Data Center Members. (2018). Database Resources of the BIG Data Center in 2018. Nucleic Acids Research 46, D14-D20.

78.  Sang J., Wang Z., …, Bao Y., et al. (2018). ICG: a wiki-driven knowledgebase of internal control genes for RT-qPCR normalization. Nucleic Acids Research 46, D121-D126.

79.  Bao Y., Kuhn J.H. (2018). Preliminary Classification of Novel Hemorrhagic Fever-Causing Viruses Using Sequence-Based PAirwise Sequence Comparison (PASC) Analysis. Methods Molecular Biology 1604, 43-53.

80.  Hatcher E., Bao Y., et al. (2017). NCBI will no longer make taxonomy identifiers for individual influenza strains on January 15, 2018. PeerJ Preprints 5:e3428v1 https://doi.org/10.7287/peerj.preprints.3428v1

81.  Bao Y., Amarasinghe G.K., Basler C.F., et al. (2017). Implementation of Objective PASC-Derived Taxon Demarcation Criteria for Official Classification of Filoviruses. Viruses 9. pii: E106. doi: 10.3390/v9050106.

82.  Wang F., Fang Q., Wang B., Yan Z., Hong J., Bao Y., et al. (2017). A novel negative-stranded RNA virus mediates sex ratio in its parasitoid host. PLoS Pathogens 13, e1006201.

83.  Amarasinghe G.K., Bao Y., Basler C.F., Bavari S., et al. (2017). Taxonomy of the order Mononegavirales: update 2017. Archives of Virology 162.

84.  Hatcher E.L., Zhdanov S.A., Bao Y., Blinkova O., Nawrocki E.P., Ostapchuck Y., Schäffer A.A., Brister J.R. (2017). Virus Variation Resource - improved response to emergent viral outbreaks. Nucleic Acids Research 45, D482-D490.

85.  Kuhn J.H., Wiley M.R., Rodriguez S.E., Bao Y., et al. (2016). Genomic Characterization of the Genus Nairovirus (Family Bunyaviridae). Viruses 8, 164.

86.  Afonso C.L., Amarasinghe G.K., Banyai K., Bao Y., et al. (2016). Taxonomy of the order Mononegavirales: update 2016. Archives of Virology 161, 2351-2360.

87.  Kuhn J.H., Lauck M., Bailey A.L., Shchetinin A.M., Vishnevskaya T.V., Bao Y., et al. (2016). Reorganization and expansion of the nidoviral family Arteriviridae. Archives of Virology 161, 755-768.

88.  O'Leary N.A., Wright M.W., Brister J.R., Ciufo S., Haddad D., McVeigh R., Rajput B., Robbertse B., Smith-White B., Ako-Adjei D., Astashyn A., Badretdin A., Bao Y., et al. (2016). Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Research 44, D733-D745.

89.  Lauck M., Alkhovsky S.V., Bào Y., et al. (2015). Historical Outbreaks of Simian Hemorrhagic Fever in Captive Macaques Were Caused by Distinct Arteriviruses. Journal of Virology 89, 8082-8087.

90.  Radoshitzky S.R., Bao Y., et al. (2015). Past, present, and future of arenavirus taxonomy. Archives of Virology 160, 1851-1874.

91.  Brister J.R., Ako-Adjei D., Bao Y. & Blinkova O. (2015). NCBI Viral Genomes Resource. Nucleic Acids Research 43, D571-D577.

92.  Kuhn J.H., Dürrwald R., Bao Y., et al. (2015). Taxonomic reorganization of the family Bornaviridae. Archives of Virology 160, 621-632.

93.  Kuhn J.H., Andersen K.G., Baize S., Bao Y., et al. (2014). Nomenclature- and Database-Compatible Names for the Two Ebola Virus Variants that Emerged in Guinea and the Democratic Republic of the Congo in 2014. Viruses 6, 4760-4799.

94.  Kuhn J.H., Andersen K.G., Bao Y., et al. (2014). Filovirus RefSeq Entries: Evaluation and Selection of Filovirus Type Variants, Type Sequences, and Names. Viruses 6, 3663-3682.

95.  Bao Y., Chetvernin V. & Tatusova T. (2014). Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification. Archives of Virology 159, 3293-3304.

96.  Du Z., Chen A., Chen W., Liao Q., Zhang H., Bao Y., Roossinck M.J. & Carr J.P. (2014). Nuclear-cytoplasmic partitioning of the Cucumber mosaic virus 2b protein determines the balance between its roles as a virulence determinant and RNA silencing suppressor. Journal of Virology 88, 5228-5241.

97.  Brister J.R., Bao Y., Zhdanov S.A., Ostapchuck Y., Chetvernin V., Kiryutin B., Zaslavsky L., Kimelman M. & Tatusova T.A. (2014). Virus Variation Resource--recent updates and future directions. Nucleic Acids Research 42, D660-D665.

98.  Kuhn J.H., Bao Y., et al. (2014). Virus nomenclature below the species level: a standardized nomenclature for filovirus strains and variants rescued from cDNA. Archives of Virology 159, 1229-1237.

99.  Brister J.R. & Bao Y. (2013). Virus Variation. 2013 Nov 14. In: The NCBI Handbook [Internet]. 2nd edition. Bethesda (MD): National Center for Biotechnology Information (US); 2013-. Available from: http://www.ncbi.nlm.nih.gov/books/NBK143949/

100.            Bao Y., Brister J.R., Blinkova O., et al. (2013). About Viral and Phage Genome Processing and Tools. 2013 Mar 30 [Updated 2013 May 10]. In: The NCBI Handbook [Internet]. 2nd edition. Bethesda (MD): National Center for Biotechnology Information (US); 2013-. Available from: http://www.ncbi.nlm.nih.gov/books/NBK153522/

101.            Kuhn J.H., Bao Y., et al. (2013). Virus nomenclature below the species level: a standardized nomenclature for laboratory animal-adapted strains and variants of viruses assigned to the family Filoviridae. Archives of Virology 158, 1425-1432.

102.            Kuhn J.H., Bao Y., et al. (2013). Virus nomenclature below the species level: a standardized nomenclature for natural variants of viruses assigned to the family Filoviridae. Archives of Virology 158, 301-311.

103.            Bao Y., Chetvernin V. & Tatusova T. (2012). PAirwise Sequence Comparison (PASC) and Its Application in the Classification of Filoviruses. Viruses 4, 1318-1327.

104.            Gilbert, J.A., Bao, Y., et al. (2012). Report of the 13th Genomic Standards Consortium Meeting, Shenzhen, China, March 4–7, 2012. Standards in Genomic Sciences 6, 276-286.

105.            Brister, J.R., Bao, Y., Kuiken, C., Lefkowitz, E.J., Mercier, P.L., Leplae, R., Madupu, R., Scheuermann, R.H., Schobel, S., Seto, D., Shrivastava, S., Sterk, P., Zeng, Q., Klimke, W. & Tatusova, T. (2010). Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop. Viruses 2, 2258-2268.

106.            Resch, W., Zaslavsky, L., Kiryutin B., Rozanov, M., Bao Y. & Tatusova T.A. (2009). Virus variation resource at the national center for biotechnology information: dengue virus. BMC Microbiology 9, 65.

107.            Bao, Y., Kapustin Y. & Tatusova T. (2008). Virus Classification by Pairwise Sequence Comparison (PASC).  Encyclopedia of Virology, 5 vols. (B.W.J. Mahy and M.H.V. Van Regenmortel, Editors). Oxford: Elsevier. Vol. 5, 342-348.

108.            Zaslavsky, L., Bao Y. & Tatusova T.A. (2008). Visualization of large influenza virus sequence datasets using adaptively aggregated trees with sampling-based subscale representation. BMC Bioinformatics  9, 237.

109.            Bao, Y., Bolotov P., Dernovoy D., Kiryutin B., Zaslavsky L., Tatusova T., Ostell J. & Lipman D. (2008). The influenza virus resource at the national center for biotechnology information. Journal of Virology 82, 596-601.

110.            Bao, Y., Bolotov P., Dernovoy D., Kiryutin B. & Tatusova T. (2007). FLAN: a web server for influenza virus genome annotation. Nucleic Acids Research 35, W280-W284.

111.            Zaslavsky, L., Bao Y. & Tatusova T.A. (2007). An adaptive resolution tree visualization of large influenza virus sequence datasets, p. 192-202. In I. Mandoiu and A. Zelikovsky (ed.), Bioinformatics Research and Applications. Proc. of ISBRA 2007, Lecture Notes in Bioinformatics 4463. Springer, Verlag.

112.            Ghedin E, Sengamalay NA, Shumway M, Zaborsky J, Feldblyum T, Subbu V, Spiro DJ, Sitz J, Koo H, Bolotov P, Dernovoy D, Tatusova T, Bao Y, St George K, Taylor J, Lipman DJ, Fraser CM, Taubenberger JK & Salzberg SL. (2005). Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution. Nature 437, 1162-1166.

113.            Holmes E. C., Ghedin E., Miller N., Taylor J., Bao Y., St. George K., Grenfell B.T., Salzberg S.L., Fraser C.M., Lipman D.J. & Taubenberger J.K. (2005). Whole-Genome Analysis of Human Influenza A Virus Reveals Multiple Persistent Lineages and Reassortment among Recent H3N2 Viruses. PLoS Biology 3: e300.

114.            Bao, Y., Federhen, S., Leipe, D., Pham, V., Resenchuk, S., Rozanov, M., Tatusov, R. & Tatusova, T.  (2004).  National Center for Biotechnology Information Viral Genomes ProjectJournal of Virology 78, 7291-7298.

115.            Ding, X. S., Liu, J., Cheng, N-H., Folimonov, A., Hou, Y-M., Bao, Y., Katagi, C., Carter, S. A. & Nelson, R. S. (2004).  The Tobacco mosaic virus 126-kDa Protein Associated with Virus Replication and Movement Suppresses RNA SilencingMolecular Plant-Microbe Interactions 17, 583-592.

116.            Li, Y., Bao, Y. M., Wei, C. H., Kang, Z. S., Zhong, Y. W., Mao, P., Wu, G., Chen, Z. L. Schiemann, J. & Nelson, R. S. (2004). Rice dwarf phytoreovirus segment s6-encoded nonstructural protein has a cell-to-cell movement function.  Journal of Virology 78, 5382-5389.

117.            Chen Y. J., Gao G., Bao Y. M., Lopez R., Wu J. M., Cai T., Ye Z. Q., Gu  X. C. & Luo J. C. (2003).  Initial Analysis of Complete Genome Sequences of SARS Coronavirus.  Acta Genetica Sinica 30, 493-500.

118.            Sivakumaran, K., Bao, Y., Roossinck, M. J. & Kao, C. C. (2000).  Recognition of the core RNA promoter for the minus-strand RNA synthesis by the replicase of Brome mosaic virus and Cucumber mosaic virus.  Journal of Virology 74, 10323-10331.

119.            Itaya, A., Woo, Y-M, Masuta, C., Bao, Y., Nelson, R. S. & Ding, B. (1998).  Developmental regulation of intercellular protein trafficking through plasmodesmata in tobacco leaf epidermis.  Plant Physiology 118, 373-385.

120.            Itaya, A., Hickman, H., Bao, Y., Nelson, R. S. & Ding, B. (1997).  Cell-to-cell trafficking of cucumber mosaic virus movement protein: green fluorescent protein fusion produced by biolistic gene bombardment in tobacco.  The Plant Journal 12, 1223-1230.

121.            Shintaku, M. H., Carter, S. A., Bao, Y. & Nelson, R. S. (1996).  Mapping nucleotides in the 126-kDa protein gene that control the differential symptoms induced by two strains of tobacco mosaic virus.  Virology 221, 218-225.

122.            Futterer, J., Potrykus, I., Bao, Y., Li, L., Burns, T. M., Hull, R. & Hohn, T. (1996).  Position-dependent ATT initiation during plant pararetrovirus rice tungro bacilliform virus translation.  Journal of Virology 70, 2999-3010.

123.            Bao, Y., Carter, S. A. & Nelson, R. S. (1996).  The 126- and 183-kilodalton proteins of tobacco mosaic virus and not their common nucleotide sequence, control mosaic symptom formation in tobacco. Journal of Virology 70, 6378-6383.

124.            Bao, Y. & Hull, R. (1994).  Replication intermediates of rice tungro bacilliform virus support a replication mechanism involving reverse transcription.  Virology 204, 626-633.

125.            Chu, R-Y, Leng, X-H, Bao, Y-M, Pu, Z-Q, Pan, N-S, & Chen, Z-L. (1993).  Amplification of soybean mosaic virus coat protein gene by polymerase chain reaction and its sequence analysis.  Acta Botanica Sinica 34, 523-528.

126.            Bao, Y-M, Chu, R-Y, Han, J-H, Zhang, H., Pan, N-S, Gu, X-C, & Chen, Z-L. (1993).  Cloning and sequencing of trichosanthin gene and its expression in Escherichia coli and tobacco plant.  Science in China (Series B) 36, 669-676.

127.            Bao, Y. & Hull, R. (1993).  Mapping the 5'-terminus of rice tungro bacilliform virus genomic RNA.  Virology 197, 445-448.

128.            Bao, Y. & Hull, R. (1993).  A strong-stop DNA in rice plants infected with rice tungro bacilliform virus.  Journal of General Virology 74, 1611-1616.

129.            Bao, Y. & Hull, R. (1992).  Characterization of the discontinuities in rice tungro bacilliform virus DNA.  Journal of General Virology 73, 1297-1301.