CancerInSilico: An R/Bioconductor package for combining mathematical and statistical modeling to simulate time course bulk and single cell gene expression data in cancer

Sherman, T. D.; Kagohara, L. T.; Cao, R.; Cheng, R.; Satriano, M.; Considine, M.; Krigsfeld, G.; Ranaweera, R.; Tang, Y.; Jablonski, S.; Stein-O'Brien, G.; Gaykalova, D.; Weiner, L. M.; Chung, C.; Fertig, E.

Abstract

Bioinformatics techniques to analyze time course bulk and single cell omics data are advancing. The absence of a known ground truth of the dynamics of molecular changes challenges benchmarking their performance on real data. Realistic simulated time-course datasets are essential to assess the performance of time course bioinformatics algorithms. We develop an R/Bioconductor package, CancerInSilico, to simulate bulk and single cell transcriptional data from a known ground truth obtained from mathematical models of cellular systems. This package contains a general R infrastructure for running cell-based models and simulating gene expression data based on the model states. We show how to use this package to simulate a gene expression data set and consequently benchmark analysis methods on this data set with a known ground truth. The package is freely available via Bioconductor: http://bioconductor.org/packages/CancerInSilico/

Word Cloud

Created with Highcharts 10.0.0datapackagetimecoursebulksinglecellknowngroundtruthsimulategeneexpressionperformanceR/BioconductormathematicalmodelssetBioinformaticstechniquesanalyzeomicsadvancingabsencedynamicsmolecularchangeschallengesbenchmarkingrealRealisticsimulatedtime-coursedatasetsessentialassessbioinformaticsalgorithmsdevelopCancerInSilicotranscriptionalobtainedcellularsystemscontainsgeneralRinfrastructurerunningcell-basedsimulatingbasedmodelstatesshowuseconsequentlybenchmarkanalysismethodsfreelyavailableviaBioconductor:http://bioconductororg/packages/CancerInSilico/CancerInSilico:combiningstatisticalmodelingcancernull

Similar Articles

Cited By