Personality is the most critical feature that tells us about an individual. It is the collection of the individual’s thoughts, opinions, emotions and more. Personality detection is an emerging field in research and Deep Learning models have only recently started being developed. There is a need for a larger dataset that is unbiased as the current dataset that is used is in the form of questionnaires that the individuals themselves answer, hence increasing the chance of unconscious bias. We have used the famous stream-of-consciousness essays collated by James Pennbaker and Laura King. We have used the Big Five Model often known as the five-factor model or OCEAN model. Document-level feature extraction has been performed using Google’s word2vec embeddings and Mairesse features. The processed data has been fed into a deep convolutional network and a binary classifier has been used to classify the presence or absence of the personality trait. Hold- out method has been used to evaluate the model, and the F1 score has been used as the performance metric.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.