Comparing methods for clinical investigator site inspection selection: a comparison of site selection methods of investigators in clinical trials.

Nicholas Hein, Elena Rantou, Paul Schuette
Author Information
  1. Nicholas Hein: Department of Biostatistics, University of Nebraska Medical Center , Omaha , NE , USA.
  2. Elena Rantou: Office of Biostatistics/Office of Translational Sciences/Center for Drug Evaluation and Research, U.S. Food and Drug Administration , Silver Spring , MD , USA.
  3. Paul Schuette: Office of Biostatistics/Office of Translational Sciences/Center for Drug Evaluation and Research, U.S. Food and Drug Administration , Silver Spring , MD , USA.

Abstract

During the past two decades, the number and complexity of clinical trials have risen dramatically increasing the difficulty of choosing sites for inspection. FDA's resources are limited and so sites should be chosen with care. To determine if data mining techniques and/or unsupervised statistical monitoring can assist with the process of identifying potential clinical sites for inspection. Five summary-level clinical site datasets from four new drug applications (NDA) and one biologics license application (BLA), where the FDA had performed or had planned site inspections, were used. The number of sites inspected and the results of the inspections were blinded to the researchers. Five supervised learning models from the previous two years (2016-2017) of an on-going research project were used to predict site inspections results, i.e., No Action Indicated (NAI), Voluntary Action Indicated (VAI), or Official Action Indicated (OAI). Statistical Monitoring Applied to Research Trials (SMART) software for unsupervised statistical monitoring software developed by CluePoints (Mont-Saint-Guibert, Belgium) was utilized to identify atypical centers (via a -value approach) within a study.Finally, Clinical Investigator Site Selection Tool (CISST), developed by the Center for Drug Evaluation and Research (CDER), was used to calculate the total risk of each site thereby providing a framework for site selection. The agreement between the predictions of these methods was compared. The overall accuracy and sensitivity of the methods were graphically compared. Spearman's rank order correlation was used to examine the agreement between the SMART analysis (CluePoints' software) and the CISST analysis. The average aggregated correlation between the -values (SMART) and total risk scores (CISST) for all five studies was 0.21, and range from -0.41 to 0.50. The Random Forest models for 2016 and 2017 showed the highest aggregated mean agreement (65.1%) amongst outcomes (NAI, VAI, OAI) for the three available studies. While there does not appear to be a single most accurate approach, the performance of methods under certain circumstances is discussed later in this paper. Classifier models based on data mining techniques require historical data (i.e., training data) to develop the model. There is a possibility that sites in the five-summary level datasets were included in the training datasets for the models from the previous year's research which could result in spurious confirmation of predictive ability. Additionally, the CISST was utilized in three of the five site selection processes, possibly biasing the data. The agreement between methods was lower than expected and no single method emerged as the most accurate.

Keywords

MeSH Term

Biological Products
Data Mining
Databases, Factual
Drug Approval
Humans
Research Personnel
United States
United States Food and Drug Administration

Chemicals

Biological Products

Word Cloud

Created with Highcharts 10.0.0sitedatamethodsclinicalsitesinspectionusedmodelsCISSTagreementminingunsupervisedstatisticalmonitoringdatasetsinspectionsActionIndicatedSMARTsoftwareriskselectiontwonumbertrialstechniquesFiveresultspreviousresearchieNAIVAIOAIResearchdevelopedutilizedapproachSitetotalcomparedcorrelationanalysisaggregatedfivestudies0threesingleaccuratetrainingpastdecadescomplexityrisendramaticallyincreasingdifficultychoosingFDA'sresourceslimitedchosencaredetermineand/orcanassistprocessidentifyingpotentialsummary-levelfournewdrugapplicationsNDAonebiologicslicenseapplicationBLAFDAperformedplannedinspectedblindedresearcherssupervisedlearningyears2016-2017on-goingprojectpredictVoluntaryOfficialStatisticalMonitoringAppliedTrialsCluePointsMont-Saint-GuibertBelgiumidentifyatypicalcentersvia-valuewithinstudyFinallyClinicalInvestigatorSelectionToolCenterDrugEvaluationCDERcalculatetherebyprovidingframeworkpredictionsoverallaccuracysensitivitygraphicallySpearman'srankorderexamineCluePoints'average-valuesscores21range-04150RandomForest20162017showedhighestmean651%amongstoutcomesavailableappearperformancecertaincircumstancesdiscussedlaterpaperClassifierbasedrequirehistoricaldevelopmodelpossibilityfive-summarylevelincludedyear'sresultspuriousconfirmationpredictiveabilityAdditionallyprocessespossiblybiasinglowerexpectedmethodemergedComparinginvestigatorselection:comparisoninvestigatorsp-valuesassessment

Similar Articles

Cited By

No available data.