Geographical kernel weighting is proposed as a method for deriving local summary statistics from geographically weighted point data. These local statistics are then used to visualise geographical variation in the statistical distribution of variables of interest. U nivariate and bivariate summary statistics are considered, for both moment-based and order-based approaches. Several aspects of visualisation are considered. F inally, an example based on house price data is presented. #
Geographically weighted regression (GWR) is a spatial statistical technique that recognizes traditional 'global' regression models may be limited when spatial processes vary with spatial context. GWR captures process spatial heterogeneity via an operationalization of Tobler's first law of geography: "everything is related to everything else, but near things are more related than distant things" (1970). An ensemble of local linear models are calibrated at any number of locations by 'borrowing' nearby data. The result is a surface of location-specific parameter estimates for each relationship in the model that may vary spatially, as well as a single bandwidth parameter that provides intuition about the geographic scale of the processes. A recent extension to this framework allows each relationship to vary according to a distinct spatial scale parameter, and is therefore known as multiscale (M)GWR. This paper introduces mgwr, a Python-based implementation for efficiently calibrating a variety of (M)GWR models and a selection of associated diagnostics. It reviews some core concepts, introduces the primary software functionality, and demonstrates suggested usage on several example datasets.
Under the realization that Geographically Weighted Regression (GWR) is a data-borrowing technique, this paper derives expressions for the amount of bias introduced to local parameter estimates by borrowing data from locations where the processes might be different from those at the regression location. This is done for both GWR and Multiscale GWR (MGWR). We demonstrate the accuracy of our expressions for bias through a comparison with empirically derived estimates based on a simulated data set with known local parameter values. By being able to compute the bias in both models we are able to demonstrate the superiority of MGWR. We then demonstrate the utility of a corrected Akaike Information Criterion statistic in finding optimal bandwidths in both GWR and MGWR as a trade-off between minimizing both bias and uncertainty. We further show how bias in one set of local parameter estimates can affect the bias in another set of local estimates. The bias derived from borrowing data from other locations appears to be very small.
The concept of scale is inherent to, and consequential for, the modeling of geographical processes. However, scale also causes huge problems because the results of many types of spatial analysis appear to be dependent on the scale of the units for which data are reported (measurement scale). With the advent of local models and the fundamental difference in their scale of application compared to global models, this issue is exacerbated in unexpected ways. For example, a global model and local model calibrated using data measured at the same aggregation scale can also result in different and sometimes contradictory inferences (the classic Simpson’s Paradox). Here we provide a geographical perspective on why and how contrasting inferences might result from the calibration of a local and global model using the same data. Further, we examine the viability of such an occurrence using a synthetic experiment and two empirical examples. Finally, we discuss how such a perspective might inform the analyst’s conundrum: when the respective inferences run counter to one another, do we believe the local or global model results?
Bandwidth, a key parameter in geographically weighted regression models, is closely related to the spatial scale at which the underlying spatially heterogeneous processes being examined take place. Generally, a single optimal bandwidth (geographically weighted regression) or a set of covariate-specific optimal bandwidths (multiscale geographically weighted regression) is chosen based on some criterion such as the Akaike Information Criterion (AIC) and then parameter estimation and inference are conditional on the choice of this bandwidth. In this paper, we find that bandwidth selection is subject to uncertainty in both single-scale and multi-scale geographically weighted regression models and demonstrate that this uncertainty can be measured and accounted for. Based on simulation studies and an empirical example of obesity rates in Phoenix, we show that bandwidth uncertainties can be quantitatively measured by Akaike weights, and confidence intervals for bandwidths can be obtained. Understanding bandwidth uncertainty offers important insights about the scales over which different processes operate, especially when comparing covariate-specific bandwidths. Additionally, unconditional parameter estimates can be computed based on Akaike weights accounts for bandwidth selection uncertainty.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.