Basic Introduction to Statistics in Medicine, Part 1: Describing Data.

Wyatt P Bensken, Fredric M Pieracci, Vanessa P Ho
Author Information
  1. Wyatt P Bensken: Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA.
  2. Fredric M Pieracci: Department of Surgery, Denver Health Medical Center, Denver, Colorado, USA.
  3. Vanessa P Ho: Department of Population and Quantitative Health Sciences, Case Western Reserve University School of Medicine, Cleveland, Ohio, USA.

Abstract

Standardized and concise data presentation forms the base for subsequent analysis and interpretation. This article reviews types of data, data properties and distributions, and both numerical and graphical methods of data presentation. For the purposes of illustration, the National Inpatient Sample was queried to categorize patients as having either emergency general surgery or non-emergency general surgery admissions. Variables are categorized as either categorical or numerical. Within the former, there are ordinal and or nominal subtypes; within the latter, there are ratio and interval subtypes. Categorical data are typically displayed as number (%). Numerical data must be assessed for normality as normally distributed data behave in certain patterns that allow for specific statistical tests to be used. Several properties exist for numerical data, including measurements of central tendency (mean, median, and mode), as well as standard deviation, range, and interquartile range. The best initial assessment of the distribution of numerical data is graphical with both histograms and box plots. Knowledge of the types, distribution, and properties of data is essential to move forward with hypothesis testing.

Keywords

References

  1. Eur J Epidemiol. 2016 Apr;31(4):337-50 [PMID: 27209009]
  2. J Am Geriatr Soc. 2019 Nov;67(11):2289-2297 [PMID: 31301180]
  3. Am Stat. 2019;73(Suppl 1):82-90 [PMID: 31413381]
  4. Surg Infect (Larchmt). 2021 Aug;22(6):597-603 [PMID: 34270362]

Grants

  1. KL2 TR002547/NCATS NIH HHS

MeSH Term

Data Collection
Data Interpretation, Statistical
Humans

Word Cloud

Created with Highcharts 10.0.0datanumericalpropertiespresentationtypesgraphicaleithergeneralsurgerysubtypesrangedistributionStandardizedconciseformsbasesubsequentanalysisinterpretationarticlereviewsdistributionsmethodspurposesillustrationNationalInpatientSamplequeriedcategorizepatientsemergencynon-emergencyadmissionsVariablescategorizedcategoricalWithinformerordinalnominalwithinlatterratiointervalCategoricaltypicallydisplayednumber%NumericalmustassessednormalitynormallydistributedbehavecertainpatternsallowspecificstatisticaltestsusedSeveralexistincludingmeasurementscentraltendencymeanmedianmodewellstandarddeviationinterquartilebestinitialassessmenthistogramsboxplotsKnowledgeessentialmoveforwardhypothesistestingBasicIntroductionStatisticsMedicinePart1:DescribingDatadescriptionsciencestatistics

Similar Articles

Cited By