Kavli Affiliate: Feng Wang | Authors: Jiaqi Ye, Xiaodong Li, Pangjing Wu, Feng Wang | Summary: Most reinforcement learning algorithms are based on a key assumption that Markov decision processes (MDPs) are stationary. However, non-stationary MDPs with dynamic action space are omnipresent in real-world scenarios. Yet problems of dynamic action space reinforcement learning have been […]
Continue.. Action Pick-up in Dynamic Action Space Reinforcement Learning