Mobile phone datasets allow for the analysis of human behavior on an unprecedented scale. The social network, temporal dynamics and mobile behavior of mobile phone users have often been analyzed independently from each other using mobile phone datasets. In this article, we explore the connections between various features of human behavior extracted from a large mobile phone dataset. Our observations are based on the analysis of communication data of 100000 anonymized and randomly chosen individuals in a dataset of communications in Portugal. We show that clustering and principal component analysis allow for a significant dimension reduction with limited loss of information. The most important features are related to geographical location. In particular, we observe that most people spend most of their time at only a few locations. With the help of clustering methods, we then robustly identify home and office locations and compare the results with official census data. Finally, we analyze the geographic spread of users' frequent locations and show that commuting distances can be reasonably well explained by a gravity model. Keywords:Human Mobility, Data Mining, Location Detection, Commuting Distance AbstractMobile phone datasets allow for the analysis of human behavior on an unprecedented scale. The social network, temporal dynamics and mobile behavior of mobile phone users have often been analyzed independently from each other using mobile phone datasets. In this article, we explore the connections between various features of human behavior extracted from a large mobile phone dataset. We show that clustering and principal component analysis allows for a significant dimension reduction with limited loss of information. The most important features are related to geographical location. In particular, we observe that most people spend most of their time at only a few locations. of users' frequent locations and show that commuting distances can be reasonably well explained by a gravity model.
We propose a new finite sample system identification method, called Sign-Perturbed Sums (SPS), to estimate the parameters of dynamical systems under mild statistical assumptions. The proposed method constructs non-asymptotic confidence regions that include the leastsquares (LS) estimate and are guaranteed to contain the true parameters with a user-chosen exact probability. Our method builds on ideas imported from the "Leave-out Sign-dominant Correlation Regions" (LSCR) approach, but, unlike LSCR, also guarantees the inclusion of the LS estimate and provides confidence regions for multiple parameters with exact probabilities. This paper presents the SPS method for FIR and ARX systems together with its main theoretical properties, as well as demonstrates the approach through simple examples and experiments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.