Optimal dose selection in phase I/II dose finding trial with contextual bandits: a case study and practical recommendations.

Jixian Wang, Ram Tiwari
Author Information
  1. Jixian Wang: GBDS, Bristol Myers Squibb, Boudry, Switzerland.
  2. Ram Tiwari: Global Stat Solutions, Reston, Virginia, USA.

Abstract

Dose selection is a key decision to make in the early phase of drug development. Classical phase I/II dose-finding trials randomly assign a few doses and select the best among them. Response-adaptive assignment designs are more efficient but are still far from optimal. Recently, some researchers used machine learning (ML) methods such as contextual bandits (CB) to find the "optimal" dose and to investigate the asymptotic properties of the methods. We present a case study for oncology phase I/II dose-finding trial designs using Thompson sampling and Bayesian bootstrap for CB with either modeling clinical utility directly or jointly modeling efficacy and safety. We focus on practical questions such as the number of interim analyses to conduct and whether we should model the utility directly, jointly model efficacy and safety which compose the utility, or use a model independent approach such as multi-armed bandits, but not for a specific compound or tumor type. We also consider how to use weak informative prior information. We conducted an extensive simulation study and compared different combinations of design settings and modeling methods, under several feasible scenarios of the dose-response relationship. Based on simulation results, we make practical recommendations for the use of the proposed ML approach for phase I/II dose-finding trial designs.

Keywords

Word Cloud

Created with Highcharts 10.0.0phaseI/IIdosedose-findingdesignsmethodscontextualbanditsstudytrialmodelingutilitypracticalmodeluseselectionmakemachinelearningMLCBcaseThompsonsamplingBayesianbootstrapdirectlyjointlyefficacysafetyapproachsimulationrecommendationsfindingDosekeydecisionearlydrugdevelopmentClassicaltrialsrandomlyassigndosesselectbestamongResponse-adaptiveassignmentefficientstillfaroptimalRecentlyresearchersusedfind"optimal"investigateasymptoticpropertiespresentoncologyusingeitherclinicalfocusquestionsnumberinterimanalysesconductwhethercomposeindependentmulti-armedspecificcompoundtumortypealsoconsiderweakinformativepriorinformationconductedextensivecompareddifferentcombinationsdesignsettingsseveralfeasiblescenariosdose-responserelationshipBasedresultsproposedOptimalbandits:

Similar Articles

Cited By

No available data.