This paper proposes a new method for discovering rules from textual data. The method decomposes textual data into word sets by using lexical analysis, generates training examples from both key phrase relations extracted from the word sets by using key phrase patterns and text classes given by the user, and acquires key phrase relation rules from the examples by using a fuzzy inductive learning algorithm. The method is also able to deal with textual data that requires word segmentation, such as Japanese text. This paper reports on the application of the method to e-mail analysis tasks for a customer center. The e-mails are written in Japanese and have two analytical criteria: a product criterion and a contents criterion. We evaluate the acquired rules in each criterion.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.