Could Social Media, and in particular, microblogs such as Twitter, play a part in helping to track criminal movement? The aim of this paper is to narrow the focus of this broader problem of using social media to crowdsource information to assist in the fight against crime, to the specific problem of identifying the description of vehicles in microblog text. As this problem has many aspects, especially in terms of data gathering and identification, an initial search is performed on preset keywords and the resulting database is tagged. The tags are then analysed to determine which features are the most common. Topic models are then run on the data to determine if any useful keyword can be found for further searches and initial statistics are recorded as a baseline for further processing. Our primary concern is establishing the common content of the relevant Tweets. The result could be used both for help with data collection as well as with feature selection when learning classification algorithms for data mining.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.