Wei Wu scite author profile

The 20 Questions (Q20) game is a well known game which encourages deductive reasoning and creativity. In the game, the answerer first thinks of an object such as a famous person or a kind of animal. Then the questioner tries to guess the object by asking 20 questions. In a Q20 game system, the user is considered as the answerer while the system itself acts as the questioner which requires a good strategy of question selection to figure out the correct object and win the game. However, the optimal policy of question selection is hard to be derived due to the complexity and volatility of the game environment. In this paper, we propose a novel policy-based Reinforcement Learning (RL) method, which enables the questioner agent to learn the optimal policy of question selection through continuous interactions with users. To facilitate training, we also propose to use a reward network to estimate the more informative reward. Compared to previous methods, our RL method is robust to noisy answers and does not rely on the Knowledge Base of objects. Experimental results show that our RL method clearly outperforms an entropy-based engineering system and has competitive performance in a noisyfree simulation environment 1 .

show abstract

Trajectory prediction of cyclist based on dynamic Bayesian network and long short-term memory model at unsignalized intersections

Gao

Cai

et al. 2021

Sci. China Inf. Sci.

View full text Add to dashboard Cite

Cyclist trajectory prediction is of great significance for both active collision avoidance and path planning of intelligent vehicles. This paper presents a trajectory prediction method for the motion intention of cyclists in real traffic scenarios. This method is based on dynamic Bayesian network (DBN) and long short-term memory (LSTM). The motion intention of cyclists is hard to predict owing to potential large uncertainties. The DBN is used to infer the distribution of cyclists' intentions at intersections to improve the prediction time. The LSTM with encoder-decoder is used to predict the cyclists' trajectories to improve the accuracy of prediction. Therefore, the DBN and LSTM are adopted to guarantee prediction accuracy and improve the prediction time. The experiment results are presented to show the effectiveness of the predict strategies.

show abstract

Less is More: Data-Efficient Complex Question Answering Over Knowledge Bases

Hua

et al. 2021

SSRN Journal

View full text Add to dashboard Cite

DisenKGAT

Shi

Cao

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wei Wu

Neural Feature Search: A Neural Architecture for Automated Feature Engineering

Playing 20 Question Game with Policy-Based Reinforcement Learning

Trajectory prediction of cyclist based on dynamic Bayesian network and long short-term memory model at unsignalized intersections

Less is More: Data-Efficient Complex Question Answering Over Knowledge Bases

DisenKGAT

Contact Info

Product

Resources

About