Community question answering platforms need to automatically rank answers and questions with respect to a given question. In this paper, we present the approaches for the Answer Selection and Question Retrieval tasks of SemEval-2016 (task 3). We develop a bag-of-vectors approach with various vectorand text-based features, and different neural network approaches including CNNs and LSTMs to capture the semantic similarity between questions and answers for ranking purpose. Our evaluation demonstrates that our approaches significantly outperform the baselines.
Natural language processing tools are used to automatically detect disturbances in transcribed speech of schizophrenia inpatients who speak Hebrew. We measure topic mutation over time and show that controls maintain more cohesive speech than inpatients. We also examine differences in how inpatients and controls use adjectives and adverbs to describe content words and show that the ones used by controls are more common than the those of inpatients. We provide experimental results and show their potential for automatically detecting schizophrenia in patients by means only of their speech patterns.
Abstract-We show how an Arabic language religious-political document can be automatically classified according to the ideological stream and organizational affiliation that it represents. Tests show that our methods achieve near-perfect accuracy.
The Impact Tech Startup (ITS) is a new, rapidly developing type of organizational category. Based on an entrepreneurial approach and technological foundations, ITSs adopt innovative strategies to tackle a variety of social and environmental challenges within a for-profit framework and are usually backed by private investment. This new organizational category is thus far not discussed in the academic literature. The paper first provides a conceptual framework for studying this organizational category, as a combination of aspects of social enterprises and startup businesses. It then proposes a machine learning (ML)-based algorithm to identify ITSs within startup databases. The UN’s Sustainable Development Goals (SDGs) are used as a referential framework for characterizing ITSs, with indicators relating to those 17 goals that qualify a startup for inclusion in the impact category. The paper concludes by discussing future research directions in studying ITSs as a distinct organizational category through the usage of the ML methodology.
We describe our entry in the EMNLP 2014 code-switching shared task. Our system is based on a sequential classifier, trained on the shared training set using various character-and word-level features, some calculated using a large monolingual corpora. We participated in the Twitter-genre Spanish-English track, obtaining an accuracy of 0.868 when measured on the tweet level and 0.858 on the word level.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.