Inferring Social Strength from Spatiotemporal Data

Pham, Huy; Shahabi, Cyrus; Liu, Yanping

doi:10.1145/2877200

“…Three baseline models are chosen for performance comparison: EBM [8], PGT [17], and TAI [18]. EBM is a state-ofthe-art social relationship inference model.…”

Section: Methodsmentioning

confidence: 99%

“…Since then, many state-of-the-art inference models are based on location entropy. In [8], Pham et al considered both location entropy and diversity of locations in cooccurrences. They proposed EBM model which is a linear regression model to attack social relationship of two users:…”

Section: Current Attack Modelsmentioning

confidence: 99%

“…In Section 3.4, a feature vector which is based on mobility intention dyads is constructed for attacking social relationship. If social strength between users in training dataset can be measured by continuous value like Katz score [8], we can train a linear regression model and can tell how close two users' relationship is. However, the training dataset of passively collected spatiotemporal data only tells if two users are acquaintances or not.…”

Section: Current Attack Modelsmentioning

confidence: 99%

“…This phenomenon is known as cooccurrence where two people have been to the same places at the same time [8]. The cooccurrence is very common in people's daily life.…”

Section: Introductionmentioning

confidence: 99%

“…What is more, existing models are based on self-reported data which involves explicit users' operations like Gowalla [8] and cell phone data [10]. In general, there are many cooccurrences belonging to acquaintances due to reporting motivation [11].…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Why You Go Reveals Who You Know: Disclosing Social Relationship by Cooccurrence

Feng

¹

,

Li

²

,

Wang

³

et al. 2017

Wireless Communications and Mobile Computing

0

View full text Add to dashboard Cite

The popularity of location-based services (LBS) and the ubiquity of sensor device have resulted in rich spatiotemporal data. A large number of human behaviors had been recorded including cooccurrence which refers to the phenomenon that two people have been to the same places at the same time. These data enable attackers to infer people's social relationship based on their cooccurrences and many attack models were proposed. However, current attack models still cannot effectively address the following two challenges: How to distinguish cooccurrences between acquaintances and strangers? What kind of cooccurrence contributes to strong social strength? In this paper, we present a novel social relationship attack model-the Mobility Intention-based Relationship Inference (MIRI) model-which can solve the above two issues. Firstly, we extract mobility intentions and adopt them to characterize cooccurrences. A classification model is trained for attacking social relationship. The experimental results on two real-world datasets demonstrate that the proposed MIRI model can properly differentiate cooccurrences by simultaneously considering spatial and temporal features. The comparison results also indicate that MIRI model significantly outperforms state-of-the-art social relationship attack models.

show abstract

Geographic Data Reduction

2017

Encyclopedia of GIS

0

View full text Add to dashboard Cite

DefinitionGaussian processes (GPs) are local approximation techniques that model spatial data by placing (and updating) priors on the covariance structures underlying the data. Originally developed for geo-spatial contexts, they are also applicable in general contexts that involve computing and modeling with multi-level spatial aggregates, e.g., modeling a configuration space for crystallographic design, casting folding energies as a function of a protein's contact map, and formulation of vaccination policies taking into account social dynamics of individuals. Typically, we assume a parametrized covariance structure underlying the data to be modeled. We estimate the covariance parameters conditional on the locations for which we have observed data, and use the inferred structure to make predictions at new locations. GPs have a probabilistic basis that allow us to estimate variances at unsampled locations, aiding in the design of targeted sampling strategies. Historical BackgroundThe underlying ideas behind GPs can be traced back to the geostatistics technique called kriging (Journel and Huijbregts 1992), named after the South African miner Danie Krige. Kriging in this literature was used to model response variables (e.g., ozone concentrations) over 2D spatial fields as realizations of a stochastic process. Sacks et al. (1989) described the use of kriging to model (deterministic) computer experiments. It took more than a decade from this point for the larger computer science community to investigate GPs for pattern analysis purposes. Thus, in the recent past, GPs have witnessed a revival primarily due to work in (MacKay 1997) and graphical models literature (Jordan 1998). Neal established the connection between Gaussian processes and neural networks with an infinite number of hidden units (Neal 1996). Such relationships allow us to take traditional learning techniques and re-express them as imposing a particular covariance structure on the joint distribution of inputs. For instance, we can take a trained neural network and mine the covariance structure implied by the weights (given mild assumptions such as a Gaussian prior over the weight space). Williams motivates the usefulness of such studies and describes common covariance functions (Williams 1998). Williams and Barber (1998) describe how the Gaussian process framework can be extended to classification in which the modeled variable is categorical. Since these publications were introduced, interest in GPs has exploded with rapid publications in conferences such as ICML, NIPS; see also the recently published book by Rasmussen and Williams (2006). Scientific FundamentalsA GP can be formally defined as a collection of random variables, any finite subset of which have a (multivariate) normal distribution. For simplicity, we assume 2D spatially distributed (scalar) response variables t i , one for each location x i D OEx i1 ; x i2 where we have collected a data sample. Observe that, in the limiting case, each random variable has a Gaussian distribution (but i...

show abstract

Geographic Resources Analysis Support Software

2017

Encyclopedia of GIS

0

View full text Add to dashboard Cite

DefinitionGaussian processes (GPs) are local approximation techniques that model spatial data by placing (and updating) priors on the covariance structures underlying the data. Originally developed for geo-spatial contexts, they are also applicable in general contexts that involve computing and modeling with multi-level spatial aggregates, e.g., modeling a configuration space for crystallographic design, casting folding energies as a function of a protein's contact map, and formulation of vaccination policies taking into account social dynamics of individuals. Typically, we assume a parametrized covariance structure underlying the data to be modeled. We estimate the covariance parameters conditional on the locations for which we have observed data, and use the inferred structure to make predictions at new locations. GPs have a probabilistic basis that allow us to estimate variances at unsampled locations, aiding in the design of targeted sampling strategies. Historical BackgroundThe underlying ideas behind GPs can be traced back to the geostatistics technique called kriging (Journel and Huijbregts 1992), named after the South African miner Danie Krige. Kriging in this literature was used to model response variables (e.g., ozone concentrations) over 2D spatial fields as realizations of a stochastic process. Sacks et al. (1989) described the use of kriging to model (deterministic) computer experiments. It took more than a decade from this point for the larger computer science community to investigate GPs for pattern analysis purposes. Thus, in the recent past, GPs have witnessed a revival primarily due to work in (MacKay 1997) and graphical models literature (Jordan 1998). Neal established the connection between Gaussian processes and neural networks with an infinite number of hidden units (Neal 1996). Such relationships allow us to take traditional learning techniques and re-express them as imposing a particular covariance structure on the joint distribution of inputs. For instance, we can take a trained neural network and mine the covariance structure implied by the weights (given mild assumptions such as a Gaussian prior over the weight space). Williams motivates the usefulness of such studies and describes common covariance functions (Williams 1998). Williams and Barber (1998) describe how the Gaussian process framework can be extended to classification in which the modeled variable is categorical. Since these publications were introduced, interest in GPs has exploded with rapid publications in conferences such as ICML, NIPS; see also the recently published book by Rasmussen and Williams (2006). Scientific FundamentalsA GP can be formally defined as a collection of random variables, any finite subset of which have a (multivariate) normal distribution. For simplicity, we assume 2D spatially distributed (scalar) response variables t i , one for each location x i D OEx i1 ; x i2 where we have collected a data sample. Observe that, in the limiting case, each random variable has a Gaussian distribution (but i...

show abstract

Inferring Social Strength from Spatiotemporal Data

Cited by 63 publications

References 44 publications

Why You Go Reveals Who You Know: Disclosing Social Relationship by Cooccurrence

Why You Go Reveals Who You Know: Disclosing Social Relationship by Cooccurrence

Geographic Data Reduction

Geographic Resources Analysis Support Software

Contact Info

Product

Resources

About