skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Scale and correlation in multiscale geographically weighted regression (MGWR)
Abstract Multiscale geographically weighted regression (MGWR) extends geographically weighted regression (GWR) by allowing process heterogeneity to be modeled at different spatial scales. While MGWR improves parameter estimates compared to GWR, the relationship between spatial scale and correlations within and among covariates—specifically spatial autocorrelation and collinearity—has not been systematically explored. This study investigates these relationships through controlled simulation experiments. Results indicate that spatial autocorrelation and collinearity affect specific model components rather than the entire model. Their impacts are cumulative but remain minimal unless they become very strong. MGWR effectively mitigates local multicollinearity issues by applying varying bandwidths across parameter surfaces. However, high levels of spatial autocorrelation and collinearity can lead to bandwidth underestimation for global processes, potentially producing false local effects. Additionally, strong collinearity may cause bandwidths to be overestimated for some processes, which helps mitigate collinearity but may obscure local effects. These findings suggest that while MGWR offers greater robustness against multicollinearity compared to GWR, bandwidth estimates should be interpreted with caution, as they can be influenced by strong spatial autocorrelation and collinearity. These results have important implications for empirical applications of MGWR.  more » « less
Award ID(s):
2345820 2151970
PAR ID:
10598559
Author(s) / Creator(s):
;
Publisher / Repository:
Springer Science + Business Media
Date Published:
Journal Name:
Journal of Geographical Systems
Volume:
27
Issue:
3
ISSN:
1435-5930
Format(s):
Medium: X Size: p. 399-424
Size(s):
p. 399-424
Sponsoring Org:
National Science Foundation
More Like this
  1. Traffic crashes significantly contribute to global fatalities, particularly in urban areas, highlighting the need to evaluate the relationship between urban environments and traffic safety. This study extends former spatial modeling frameworks by drawing paths between global models, including spatial lag (SLM), and spatial error (SEM), and local models, including geographically weighted regression (GWR), multi-scale geographically weighted regression (MGWR), and multi-scale geographically weighted regression with spatially lagged dependent variable (MGWRL). Utilizing the proposed framework, this study analyzes severe traffic crashes in relation to urban built environments using various spatial regression models within Leon County, Florida. According to the results, SLM outperforms OLS, SEM, and GWR models. Local models with lagged dependent variables outperform both the global and generic versions of the local models in all performance measures, whereas MGWR and MGWRL outperform GWR and GWRL. Local models performed better than global models, showing spatial non-stationarity; so, the relationship between the dependent and independent variables varies over space. The better performance of models with lagged dependent variables signifies that the spatial distribution of severe crashes is correlated. Finally, the better performance of multi-scale local models than classical local models indicates varying influences of independent variables with different bandwidths. According to the MGWRL model, census block groups close to the urban area with higher population, higher education level, and lower car ownership rates have lower crash rates. On the contrary, motor vehicle percentage for commuting is found to have a negative association with severe crash rate, which suggests the locality of the mentioned associations. 
    more » « less
  2. Abstract Political and social processes that shape people's voting preferences might be linked to geographical location, varying from place to place, and operating at local, regional, and national scales. Here, we use a local modeling technique, multiscale geographically weighted regression (MGWR), to examine spatial and temporal variations in the influences of county‐level socio‐economic factors on voter preference during the 2008–2020 U.S. presidential elections. We argue that the local intercept in the MGWR model is an indicator of the effect of spatial context on voter preference and not only can this be separated from the effect of other socio‐economic factors, but it needs to be in order to prevent misspecification bias in the indicators of these other factors. We also identify strong and consistent divisions across the country in how context shapes election results. 
    more » « less
  3. null (Ed.)
    Abstract. Local spatiotemporal nonstationarity occurs in various naturaland socioeconomic processes. Many studies have attempted to introduce timeas a new dimension into a geographically weighted regression (GWR) model,but the actual results are sometimes not satisfying or even worse than theoriginal GWR model. The core issue here is a mechanism for weighting the effectsof both temporal variation and spatial variation. In many geographical andtemporal weighted regression (GTWR) models, the concept of time distance hasbeen inappropriately treated as a time interval. Consequently, the combinedeffect of temporal and spatial variation is often inaccurate in theresulting spatiotemporal kernel function. This limitation restricts theconfiguration and performance of spatiotemporal weights in many existingGTWR models. To address this issue, we propose a new spatiotemporal weightedregression (STWR) model and the calibration method for it. A highlight ofSTWR is a new temporal kernel function, wherein the method for temporalweighting is based on the degree of impact from each observed point to aregression point. The degree of impact, in turn, is based on the rate ofvalue variation of the nearby observed point during the time interval. Theupdated spatiotemporal kernel function is based on a weighted combination ofthe temporal kernel with a commonly used spatial kernel (Gaussian orbi-square) by specifying a linear function of spatial bandwidth versus time.Three simulated datasets of spatiotemporal processes were used to test theperformance of GWR, GTWR, and STWR. Results show that STWR significantlyimproves the quality of fit and accuracy. Similar results were obtained byusing real-world data for precipitation hydrogen isotopes (δ2H) in the northeastern United States. The leave-one-out cross-validation(LOOCV) test demonstrates that, compared with GWR, the total predictionerror of STWR is reduced by using recent observed points. Predictionsurfaces of models in this case study show that STWR is more localized thanGWR. Our research validates the ability of STWR to take full advantage ofall the value variation of past observed points. We hope STWR can bringfresh ideas and new capabilities for analyzing and interpreting localspatiotemporal nonstationarity in many disciplines. 
    more » « less
  4. Geographically Weighted Regression (GWR) is a widely used tool for exploring spatial heterogeneity of processes over geographic space. GWR computes location-specific parameter estimates, which makes its calibration process computationally intensive. The maximum number of data points that can be handled by current open-source GWR software is approximately 15,000 observations on a standard desktop. In the era of big data, this places a severe limitation on the use of GWR. To overcome this limitation, we propose a highly scalable, open-source FastGWR implementation based on Python and the Message Passing Interface (MPI) that scales to the order of millions of observations. FastGWR optimizes memory usage along with parallelization to boost performance significantly. To illustrate the performance of FastGWR, a hedonic house price model is calibrated on approximately 1.3 million single-family residential properties from a Zillow dataset for the city of Los Angeles, which is the first effort to apply GWR to a dataset of this size. The results show that FastGWR scales linearly as the number of cores within the High-Performance Computing (HPC) environment increases. It also outperforms currently available open-sourced GWR software packages with drastic speed reductions – up to thousands of times faster – on a standard desktop. 
    more » « less
  5. Animal-mediated seed dispersal is important for promoting forest regeneration and sustainability. Animal movement influences the distribution of seeds across the environment, resulting in spatially aggregated seed dispersal patterns. Animal seed dispersal patterns likely play an important role in the spatial structuring of tree populations: where a seed disperser moves influences the seed distribution. Environmental parameters that shape a disperser’s movement also influence the spatial distribution pattern of their seed dispersal. Orangutans are highly frugivorous and have been shown to disperse intact viable seeds. GPS locations were recorded for all orangutan defecations (n=1721) from 2014 to 2016 at the Cabang Panti Research Station in Gunung Palung National Park (GPNP), Indonesia. Our pilot research at GPNP measured seeds in fecal samples (n=98 fecal samples) and demonstrated that orangutan fecal samples do have intact seeds in more than 95% of t heir feces. A kernel density map was made using the defecation data to calculate the spatial density distribution of the defecations. A geographically weighted regression model (GWR) analyzed how well spatial parameters (altitude, slope, distance to river, and normalized difference vegetation index) predict the spatial density distribution of orangutan seed dispersal. All parameters in the GWR were statistically significant (R2=0.80, p<0.001) and showed low values for collinearity. The results show that orangutan seed dispersal is aggregated in space and the seed dispersal pattern is significantly shaped by environmental variables. This study provides us a better understanding of how the environment plays a role in determining animal behavior which influences the seed spatial distribution. Funders include the National Science Foundation (BCS-1638823), National Geographic Society, US Fish and Wildlife (F15AP00812), Leakey Foundation, Disney Wildlife Conservation Fund, and Nacey-Maggioncalda Foundation. 
    more » « less