Advances in Network Clustering and Blockmodeling 2019
DOI: 10.1002/9781119483298.ch11
|View full text |Cite
|
Sign up to set email alerts
|

Bayesian Stochastic Blockmodeling

Abstract: This chapter provides a self-contained introduction to the use of Bayesian inference to extract large-scale modular structures from network data, based on the stochastic blockmodel (SBM), as well as its degree-corrected and overlapping generalizations. We focus on nonparametric formulations that allow their inference in a manner that prevents overfitting, and enables model selection. We discuss aspects of the choice of priors, in particular how to avoid underfitting via increased Bayesian hierarchies, and we c… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
154
0
2

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 137 publications
(156 citation statements)
references
References 115 publications
0
154
0
2
Order By: Relevance
“…Two of the recent developments are worth singling out. The first is the nested microcanonical SBM (Peixoto, 2017a), which establishes the MDL (Peixoto, 2013) and its equivalence with the usual Bayesian inference approach, incorporates an efficient MCMC algorithm (Peixoto, 2014a) and a hierarchical structure (Peixoto, 2014b) to model K and circumvent the issue with potential underfitting due to the maximum K detectable in an SBM (Peixoto, 2013). One possible direction is to apply these methods and techniques to hypergraphs as well, as they are still a growing field to our knowledge.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Two of the recent developments are worth singling out. The first is the nested microcanonical SBM (Peixoto, 2017a), which establishes the MDL (Peixoto, 2013) and its equivalence with the usual Bayesian inference approach, incorporates an efficient MCMC algorithm (Peixoto, 2014a) and a hierarchical structure (Peixoto, 2014b) to model K and circumvent the issue with potential underfitting due to the maximum K detectable in an SBM (Peixoto, 2013). One possible direction is to apply these methods and techniques to hypergraphs as well, as they are still a growing field to our knowledge.…”
Section: Discussionmentioning
confidence: 99%
“…(We refrain from calling it the marginal likelihood, which we refer to the likelihood with Z also integrated out.) Furthermore, this π(Y|Z, λ) can be split into the product of the two underbraced terms, which is the joint likelihood of a microcanonical model (Peixoto, 2017a). It is termed "microcanonical" because of the hard constraints imposed, as Y and Z together fix the value of E, and therefore π(Y|Z, λ) = π(Y, E|Z, λ) = π(Y|E, Z, λ) × π(E|Z, λ).…”
Section: Microcanonical Sbmmentioning
confidence: 99%
“…Another drawback of this approach is that the stochastic block model requires the selection of the number of communities, because selecting a large number of blocks always leads to a high likelihood of generating the observed network. Therefore, recent works [23,24,26] adopt Bayes model selection to find the appropriate number of communities in a network. According to Occam's Razor, this approach also minimizes the description length (MDL) of the block model [25,26] so that community detection algorithm finds the most suitable number of communities.…”
Section: Discussionmentioning
confidence: 99%
“…Alternatively, according to the Occam's Razor, the model inference process can take into account the complexity of the model, which can be measured by the model description length [25]. Other work [26] also uses the Bayesian model selection to determine the number of the communities in a network.…”
Section: Model Inferencementioning
confidence: 99%
“…Community detection 132 We combined a stochastic block model (SBM) [7] and a consensus clustering approach 133 to uncover the mesoscopic block structure of the correlation networks. In the 134 microcanonical formulation of the SBM that we used, one minimizes the description 135 length of the network [8], such that one partitions the nodes in the network G into B by a partition b is given by the following Bayesian posterior probability:…”
mentioning
confidence: 99%