Little Ball of Fur

Rózemberczki, Benedek; Kiss, Oliver; Sarkar, Rik

doi:10.1145/3340531.3412758

Cited by 27 publications

(10 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For a budget of n nodes in a sample, RN selects n random nodes from 𝑉, while RE selects enough edges from 𝐸 until the number of unique endpoints of edges (nodes) equals n (Ribeiro &Towsley, 2010). However, large networks seldom have all nodes and edges initially accessible or at least feasibly reachable (Rozemberczki et al, 2020b). A common reason is saving extremely large networks in relatively slow-acess storage mediums.Furthermore, organizations use systems with limited memory and therefore can only load and view a small proportion of a stored network.…”

Section: Introductionmentioning

confidence: 99%

“…The quality of sampled nodes can be evaluated by a metric that summarizes connectivity, clustering, degrees, or other characteristics of the sample. Many algorithms follow this general heuristic of starting at a single node 𝑠 and iteratively adding unvisited nodes 𝑣 in 𝑉 adjacent to nodes already in the sample (Rozemberczki et al, 2020b). To compare different exploration-based sampling algorithms, establishing some time or resource bound 𝐵 per sample creates an even field for comparisons.…”

Section: Introductionmentioning

confidence: 99%

“…Two problems current exploration-based network sampling algorithms face are addressed in this paper. First, targeting high-degree nodes in S was not deeply addressed during the development of many sampling algorithms (Rozemberczki et al, 2020b). In many real-world applications, node degree constitutes a value of importance (Rozemberczki et al, 2020b).…”

Section: Introductionmentioning

confidence: 99%

“…First, targeting high-degree nodes in S was not deeply addressed during the development of many sampling algorithms (Rozemberczki et al, 2020b). In many real-world applications, node degree constitutes a value of importance (Rozemberczki et al, 2020b). Actual meaning ranges from someone's social circle to the number of highways intersecting a city.…”

Section: Introductionmentioning

confidence: 99%

“…Targeting high-degree nodes provides direct benefit when sampling networks that model these applications. Second, a network sampling algorithm may linger in proximity to the start node 𝑠 rather than reach nodes farther away (Rozemberczki et al, 2020b). This renders 𝑆 a weaker representation of 𝐺 and forfeits opportunities to discover higher degree nodes farther away from 𝑠.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Predictive Sampling Method for Spread Models in Networks

Qin

2021

UFJUR

View full text Add to dashboard Cite

This paper proposes a novel, exploration-based network sampling algorithm called caterpillar quota walk sampling (CQWS) inspired by the caterpillar tree. Network sampling identifies a subset of nodes and edges from a network, creating an induced graph. Beginning from an initial node, exploration-based sampling algorithms grow the induced set by traversing and tracking unvisited neighboring nodes from the original network. Tunable and trainable parameters allow CQWS to maximize the sum of the degrees of the induced graph from multiple trials when sampling dense networks. A network spread model renders effective use in various applications, including tracking the spread of epidemics, visualizing information transmissions through social media, and cell-to-cell spread of neurodegenerative diseases. CQWS generates a spread model as its sample by visiting the highest-degree neighbors of previously visited nodes. For each previously visited node, a top proportion of the highest-degree neighbors fulfills a quota and branches into a new caterpillar tree. Sampling more high-degree nodes constitutes an objective among various applications. Many exploration-based sampling algorithms suffer drawbacks that limit the sum of degrees of visited nodes and thus the number of high-degree nodes visited. Furthermore, a strategy may not be adaptable to volatile degree frequencies throughout the original network architecture, which influences how deep into the original network an algorithm could sample. This paper analyzes CQWS in comparison to four other exploration-based network in tackling these two problems by sampling sparse and dense randomly generated networks.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%