This paper presents a chatbot for a Dialogue-Based Computer-Assisted second Language Learning (DB-CALL) system. A DB-CALL system normally leads dialogues by asking questions according to given scenarios. User utterances outside the scenarios are normally considered as semantically improper and simply rejected. In this paper, we assume that raising the freedom of dialogue can stimulate the user's interest in learning. For this, a chatbot based on a search engine with a dialogue corpus has been developed to deal with conversations out of the scenarios. We evaluate the chatbot separately in two different cases: as an independent bot and as an auxiliary system. The results showed that, unlike the independent chatbot system, the chatbot as an auxiliary system showed a much lower turn success ratio.
Feature selection is very important for feature-based relation classification tasks. While most of the existing works on feature selection rely on linguistic information acquired using parsers, this letter proposes new features, including probabilistic and semantic relatedness features, to manifest the relatedness between patterns and certain relation types in an explicit way. The impact of each feature set is evaluated using both a chisquare estimator and a performance evaluation. The experiments show that the impact of relatedness features is superior to existing well-known linguistic features, and the contribution of relatedness features cannot be substituted using other normally used linguistic feature sets.
This paper proposes a hybrid machine-translation system that combines neural machine translation with well-developed rule-based machine translation to utilize the stability of the latter to compensate for the inadequacy of neural machine translation in rare-resource domains. A classifier is introduced to predict which translation from the two systems is more reliable. We explore a set of features that reflect the reliability of translation and its process, and training data is automatically expanded with a small, human-labeled dataset to solve the insufficient-data problem. A series of experiments shows that the hybrid system’s translation accuracy is improved, especially in out-of-domain translations, and classification accuracy is greatly improved when using the proposed features and the automatically constructed training set. A comparison between feature- and text-based classification is also performed, and the results show that the feature-based model achieves better classification accuracy, even when compared to neural network text classifiers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.