The 20 Questions (Q20) game is a well known game which encourages deductive reasoning and creativity. In the game, the answerer first thinks of an object such as a famous person or a kind of animal. Then the questioner tries to guess the object by asking 20 questions. In a Q20 game system, the user is considered as the answerer while the system itself acts as the questioner which requires a good strategy of question selection to figure out the correct object and win the game. However, the optimal policy of question selection is hard to be derived due to the complexity and volatility of the game environment. In this paper, we propose a novel policy-based Reinforcement Learning (RL) method, which enables the questioner agent to learn the optimal policy of question selection through continuous interactions with users. To facilitate training, we also propose to use a reward network to estimate the more informative reward. Compared to previous methods, our RL method is robust to noisy answers and does not rely on the Knowledge Base of objects. Experimental results show that our RL method clearly outperforms an entropy-based engineering system and has competitive performance in a noisyfree simulation environment 1 .
Cyclist trajectory prediction is of great significance for both active collision avoidance and path planning of intelligent vehicles. This paper presents a trajectory prediction method for the motion intention of cyclists in real traffic scenarios. This method is based on dynamic Bayesian network (DBN) and long short-term memory (LSTM). The motion intention of cyclists is hard to predict owing to potential large uncertainties. The DBN is used to infer the distribution of cyclists' intentions at intersections to improve the prediction time. The LSTM with encoder-decoder is used to predict the cyclists' trajectories to improve the accuracy of prediction. Therefore, the DBN and LSTM are adopted to guarantee prediction accuracy and improve the prediction time. The experiment results are presented to show the effectiveness of the predict strategies.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.