In the following paper the authors present a GAN-type model and the most important stages of its development for the task of emotion recognition in text. In particular, we propose an approach for generating a synthetic dataset of all possible emotions combinations based on manually labelled incomplete data.
Dataset vectorizationInitially, it is necessary to form a dataset for training the model. The dataset should contain data about seven basic emotions for each certain piece of text -a paragraph or a sentence. The authors couldn't find a publicly available dataset with this or similar data. Therefore, it was
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.