A Neural Model for User Geolocation and Lexical Dialectology

Rahimi, Afshin; Cohn, Trevor; Baldwin, Timothy

doi:10.18653/v1/p17-2033

Cited by 71 publications

(87 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It not only improves prediction accuracy but also greatly reduces mean error distance. Compared with a strong neural model equipped with local dialects (Rahimi et al, 2017), it increases Acc@161 by an absolute value 4% and reduces mean error distance by about 400 kilometers on the challenging Twitter-World dataset, without using any external knowledge. Its mean error distance on Twitter-World is even comparable to some methods using network feature (Do et al, 2017).…”

Section: Baseline Comparisonsmentioning

confidence: 99%

See 1 more Smart Citation

A Hierarchical Location Prediction Neural Network for Twitter User Geolocation

Huang¹,

Carley²

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Accurate estimation of user location is important for many online services. Previous neural network based methods largely ignore the hierarchical structure among locations. In this paper, we propose a hierarchical location prediction neural network for Twitter user geolocation. Our model first predicts the home country for a user, then uses the country result to guide the city-level prediction. In addition, we employ a character-aware word embedding layer to overcome the noisy information in tweets. With the feature fusion layer, our model can accommodate various feature combinations and achieves state-of-the-art results over three commonly used benchmarks under different feature settings. It not only improves the prediction accuracy but also greatly reduces the mean error distance.

show abstract

Section: Baseline Comparisonsmentioning

confidence: 99%

“…In recent years, neural network based prediction methods have shown great success on this Twitter user geolocation prediction task (Rahimi et al, 2017;Miura et al, 2017). However, these neural network based methods largely ignore the hierarchical structure among locations (eg.…”

Section: Introductionmentioning

confidence: 99%

A Hierarchical Location Prediction Neural Network for Twitter User Geolocation

Huang¹,

Carley²

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

show abstract

“…Based on within-domain performance for each of the Twitter data sets, we recognize that our inference modeling approach is below state of the art. For example, in the space of text-only models, Rahimi et al (2017) have achieved an Acc@100 of 0.34 on TWITTER-WORLD using a multilayer perceptron and k-d tree discretization over the label set.…”

Section: Trainmentioning

confidence: 99%

Geocoding Without Geotags: A Text-based Approach for reddit

Harrigian¹

2018

Proceedings of the 2018 EMNLP Workshop W-Nut: The 4th Workshop on Noisy User-Generated Text

View full text Add to dashboard Cite

In this paper, we introduce the first geolocation inference approach for reddit, a social media platform where user pseudonymity has thus far made supervised demographic inference difficult to implement and validate. In particular, we design a text-based heuristic schema to generate ground truth location labels for reddit users in the absence of explicitly geotagged data. After evaluating the accuracy of our labeling procedure, we train and test several geolocation inference models across our reddit data set and three benchmark Twitter geolocation data sets. Ultimately, we show that geolocation models trained and applied on the same domain substantially outperform models attempting to transfer training data across domains, even more so on reddit where platformspecific interest-group metadata can be used to improve inferences.

show abstract

“…The weakness of this method is that it can not propagate labels (locations) to users who are not connected to the graph. To address this problem, methods combining textual information and graph topology knowledge are proposed in [24], [8]. Furthermore, these works build densely undirected graphs based on mentioning of users, which helps improve significantly the results.…”

Section: Related Workmentioning

confidence: 99%

“…Our user graph is formed in a way similar as in [8], [24] but instead of predicting users' locations directly on the graph, we extract node2vec feature for later use in our model. First, a unique set of nodes, V , is created for all the users of interest.…”

Section: A Multiview Featuresmentioning

confidence: 99%

Twitter User Geolocation Using Deep Multiview Learning

Huu

Nguyen

Tsiligianni

et al. 2018

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

View full text Add to dashboard Cite

Predicting the geographical location of users on social networks like Twitter is an active research topic with plenty of methods proposed so far. Most of the existing work follows either a content-based or a network-based approach. The former is based on user-generated content while the latter exploits the structure of the network of users. In this paper, we propose a more generic approach, which incorporates not only both content-based and network-based features, but also other available information into a unified model. Our approach, named Multi-Entry Neural Network (MENET), leverages the latest advances in deep learning and multiview learning. A realization of MENET with textual, network and metadata features results in an effective method for Twitter user geolocation, achieving the state of the art on two well-known datasets.

show abstract

A Neural Model for User Geolocation and Lexical Dialectology

Cited by 71 publications

References 34 publications

A Hierarchical Location Prediction Neural Network for Twitter User Geolocation

A Hierarchical Location Prediction Neural Network for Twitter User Geolocation

Geocoding Without Geotags: A Text-based Approach for reddit

Twitter User Geolocation Using Deep Multiview Learning

Contact Info

Product

Resources

About