“…For example, Hawkes processes [Hawkes, 1971] produce clusters via a self-excitatory model: each event increases the probability of subsequent events. There has been a resurgence of interest in these models, and recent work has generalized the basic Hawkes process [Wang et al, 2012, Zhou et al, 2013, Linderman and Adams, 2014, Wu et al, 2019, Dutta et al, 2020, established new theory [Chen et al, 2017a], incorporated neural networks into the model [Mei and Eisner, 2016, Zuo et al, 2020, Zhang et al, 2020a, developed new inference and estimation algorithms [Simma and Jordan, 2010, Halpin, 2012, Rasmussen, 2013, Wheatley et al, 2016, Chen et al, 2017b, Shelton et al, 2018, Kirchner and Bercher, 2018, Zhang et al, 2020b, and specialized them to new domains like email and social media exchanges [Gomez-Rodriguez et al, 2010, Blundell et al, 2012, Farajtabar et al, 2015, Guo et al, 2015, He et al, 2015, Kobayashi and Lambiotte, 2016, Li et al, 2017, Mohler et al, 2018, online education [Jiang et al, 2021], crime modeling [Mohler et al, 2011], biology [Carstensen et al, 2010, Verma et al, 2021, healthcare , and epidemiology [Choi et al, 2015, Garetto et al, 2021, Chiang et al, 2021. One advantage of Hawkes processes is that parameters can often be inferred by simple maximum likelihood estimation.…”