To take advantage of the full range of services that online social networks (OSNs) offer, people commonly open several accounts on diverse OSNs where they leave lots of different types of profile information. The integration of these pieces of information from various sources can be achieved by identifying individuals across social networks. In this article, we address the problem of user identification by treating it as a classification task. Relying on common public attributes available through the official application programming interface (API) of social networks, we propose different methods for building negative instances that go beyond usual random selection so as to investigate the effectiveness of each method in training the classifier. Two test sets with different levels of discrimination are set up to evaluate the robustness of our different classifiers. The effectiveness of the approach is measured in real conditions by matching profiles gathered from Google+, Facebook and Twitter.
The continual launches of new online social media that meet the most varied people’s needs are resulting in a simultaneous adoption of different social platforms. As a consequence people are pushed to handle their identity across multiple platforms. However, due the to specialization of the services, people’s identity and behavior are often partial, incomplete and scattered in different “places”. To overcome this identity fragmentation and to give an all-around picture of people’s online behavior, in this paper we perform a multidimensional analysis of users across multiple social media sites. Our study relies on a new rich dataset collecting information about how and when users post their favorite contents, about their centrality on different social media and about the choice of their username. Specifically we gathered the posting activities and social sites usage from Alternion, a social media aggregator.The analysis of social media usage shows that Alternion data reflect the novel trend of today’s users of branching out into different social platforms. However the novelty is the multidimensional and longitudinal nature of the dataset. Having at our disposal users’ degree in five different social networks, we performed a rank correlation analysis on users’ degree centrality and we find that the degrees of a given user are scarcely correlated. This is suggesting that the individuals’ importance changes from medium to medium. The longitudinal nature of the dataset has been exploited to investigate the posting activity. We find a slightly positive correlation on how often users publish on different social media and we confirm the burstiness of the posting activities extending it to multidimensional time-series. Finally we show that users tend to use similar usernames to keep their identifiability across social sites.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.