2012 IEEE 12th International Conference on Data Mining Workshops 2012
DOI: 10.1109/icdmw.2012.126
|View full text |Cite
|
Sign up to set email alerts
|

Sampling Online Social Networks Using Coupling from the Past

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
5
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 15 publications
0
5
0
Order By: Relevance
“…We used a population dataset from Advanced Symbolics Inc. 3 (ASI), a market research company in Canada. ASI is continuously collecting tweets posted by Twitter users using Conditional Independence Coupler (CIC) sampling algorithm that is based on Coupling from the Past (CFTP) [24]. The stopping condition is enhanced by measuring the distance between the new node and the seed node, then adjusting the weights of sampling using post-stratification to compensate for the underrepresented groups of the population.…”
Section: B Population-level Twitter Datasetmentioning
confidence: 99%
“…We used a population dataset from Advanced Symbolics Inc. 3 (ASI), a market research company in Canada. ASI is continuously collecting tweets posted by Twitter users using Conditional Independence Coupler (CIC) sampling algorithm that is based on Coupling from the Past (CFTP) [24]. The stopping condition is enhanced by measuring the distance between the new node and the seed node, then adjusting the weights of sampling using post-stratification to compensate for the underrepresented groups of the population.…”
Section: B Population-level Twitter Datasetmentioning
confidence: 99%
“…In constructing our labeled Twitter dataset we initially randomly collected a sample of 282, 201 Twitter users from Canada by using the Conditional Independence Coupling (CIC) method [17]. CIC matches the prior distribution of the population, in this case the Canadian general population, ensuring that the sample is balanced for gender, race and age.…”
Section: Development Of Labeled Twitter Covid-19 Datasetmentioning
confidence: 99%
“…Many sampling techniques were studied ranging from topical [11,19] to user-based approaches [12]. The first set of techniques is topic-based sampling, where specific keywords or hashtags are applied to collect tweets through Twitter API [6,20].…”
Section: Related Workmentioning
confidence: 99%
“…However, the major problem with the mentioned techniques is that, these techniques are biased toward high degree nodes similar to expert sampling. A solution to this problem is the traditional Monte Carlo Markov Chain (MCMC), which was proposed by White et al [12]. They applied a technique based on MCMC and Coupling From The Past (CFTP) to have better convergence in sampling.…”
mentioning
confidence: 99%
See 1 more Smart Citation