Multi-robot task allocation in e-commerce RMFS based on deep reinforcement learning.

Ruiping Yuan, Jiangtao Dou, Juntao Li, Wei Wang, Yingfan Jiang
Author Information
  1. Ruiping Yuan: School of Information, Beijing Wuzi University, Beijing 101149, China.
  2. Jiangtao Dou: School of Information, Beijing Wuzi University, Beijing 101149, China.
  3. Juntao Li: School of Information, Beijing Wuzi University, Beijing 101149, China.
  4. Wei Wang: School of Information, Beijing Wuzi University, Beijing 101149, China.
  5. Yingfan Jiang: School of Information, Beijing Wuzi University, Beijing 101149, China.

Abstract

A Robotic Mobile Fulfillment System (RMFS) is a new type of parts-to-picker order fulfillment system where multiple robots coordinate to complete a large number of order picking tasks. The multi-robot task allocation (MRTA) problem in RMFS is complex and dynamic, and it cannot be well solved by traditional MRTA methods. This paper proposes a task allocation method for multiple mobile robots based on multi-agent deep reinforcement learning, which not only has the advantage of reinforcement learning in dealing with dynamic environment but also can solve the task allocation problem of large state space and high complexity utilizing deep learning. First, a multi-agent framework based on cooperative structure is proposed according to the characteristics of RMFS. Then, a multi agent task allocation model is constructed based on Markov Decision Process. In order to avoid inconsistent information among agents and improve the convergence speed of traditional Deep Q Network (DQN), an improved DQN algorithm based on a shared utilitarian selection mechanism and priority empirical sample sampling is proposed to solve the task allocation model. Simulation results show that the task allocation algorithm based on deep reinforcement learning is more efficient than that based on a market mechanism, and the convergence speed of the improved DQN algorithm is much faster than that of the original DQN algorithm.

Keywords

Word Cloud

Created with Highcharts 10.0.0taskallocationbasedlearningdeepreinforcementDQNalgorithmRMFSordermulti-agentimprovedmultiplerobotslargeMRTAproblemdynamictraditionalsolveproposedmultimodelMarkovDecisionProcessconvergencespeedmechanismRoboticMobileFulfillmentSystemnewtypeparts-to-pickerfulfillmentsystemcoordinatecompletenumberpickingtasksmulti-robotcomplexwellsolvedmethodspaperproposesmethodmobileadvantagedealingenvironmentalsocanstatespacehighcomplexityutilizingFirstframeworkcooperativestructureaccordingcharacteristicsagentconstructedavoidinconsistentinformationamongagentsimproveDeepQNetworksharedutilitarianselectionpriorityempiricalsamplesamplingSimulationresultsshowefficientmarketmuchfasteroriginalMulti-robote-commercerobotmodeling

Similar Articles

Cited By