Previous topic Next topic 

If we look more closely at the Moran I index we see that it can be disaggregated to provide a series of local indices. We start with the standard formulation we provided earlier:

Examining the computation of this “global” index provided earlier in Figure 5‑24B, we can see that each row in the computation contributes to the overall sum (Figure 5‑27A), with the covariance component being 16.19 and overall Moran I value 0.03167. Each row item could be standardised by the overall sum of squared deviations and adjusted by the number of cells, n, as shown. The total of these local contributions divided by the total number of joins, ΣΣwij, gives the overall or Global Moran I value. These individual components, or Local Indicators of Spatial Association (LISA), can be mapped and tested for significance to provide an indication of clustering patterns within the study region.

However, these calculations require some adjustment in order to provide Global Moran’s I and LISA values in their commonly used form. The first adjustment is to the weights matrix. It is usual to standardise the row totals in the weights matrix to sum to 1. One advantage of row standardisation is that for each row (location) the set of weights corresponds to a form of weighted average of the neighbours that are included. Row standardisation may also provide a computationally more stable weights matrix. This procedure alters the Global Moran I value when rows have differing totals, which is the typical situation. For the example above the effect is to increase the computed Moran I to 0.0736. The second adjustment involves using (n‑1) rather than n as the row multiplier, and then the sum is re-adjusted by this factor to produce the Global index value. These changes are illustrated in Figure 5‑27B and Figure 5‑28 and correspond to the computations performed within GeoDa, R and SpaceStat (Anselin, personal communication). ArcGIS offers the option to have no weights matrix standardisation, row standardisation (dividing rows by row totals) or total weights standardisation (dividing all cells by the overall cell total). The Geary C contiguity index may also be disaggregated to produce a LISA measure, in the same manner as for the Moran I.

Another statistic, "G", due to Getis and Ord (1992, 1995) may also be used in this way. The latter is supported within ArcGIS as both a global index and in disaggregated form for identifying local clustering patterns, which ArcGIS classifies as a form of hot-spot analysis. The ArcGIS implementation facilitates the use of simple fixed Euclidean distance bands (threshold distances) within binary weights (as described below), but also permits the use of the Manhattan metric and a variety of alternative models of contiguity such as: inverse distance; inverse distance squared; so-called Zones of Indifference  (a combination of inverse distance and threshold distance); and user-defined spatial weights matrices.

Figure 5‑27 Local Moran's I computation

A. C*W: Adjustment by multiplication of the weighting matrix, W

B. C*W1: Adjustment by multiplication of the row-adjusted weighting matrix, W1

 

Figure 5‑28 LISA map, Moran’s I

Unlike Moran’s I and Geary’s C statistics, the Getis and Ord G statistic identifies the degree to which high or low values cluster together. The standard global form of the statistic is:

Whilst the local variant is

The weights matrix, W, is typically defined as a symmetric binary contiguity matrix, with contiguity determined by a (generalised) distance threshold, d. A variant of this latter statistic, which is regarded as of greater use in hot-spot analysis, permits i=j in the expression (so includes i in the summations, with wii=1 rather than 0 in the binary weights contiguity model) and is known as the Gi* statistic rather than Gi. Note that these local statistics are not meaningful if there are no events or values in the set of cells or zones under examination.

The expected value of G under Complete Spatial Randomness (CSR) is:

where n is the number of points or zones in the study area and W is the sum of the weights, which for binary weights is simply the number of pairs within the distance threshold, d. The equivalent expected value for the local variant is:

where n is the number of points or zones within the threshold distance, d, for point/region i. Closed formulas for the variances have been obtained but are very lengthy. They facilitate the construction of z-scores and hence significance testing of the global and local versions of the G statistics. For the local measure both computed values and z-scores may be useful to map.

The Rookcase add-in provides support for all three LISA measures with variable lag distance and number of lags. Further discussion of these methods is provided in several other sections of this Guide: in Section 5.2 covering exploratory spatial data analysis (ESDA); in Section 5.6.5 in the context of regression modelling; and in Sections 4.6 and Chapter 6 relating to grid files, since there are parallels between several of the concepts applied.

  Back to Top    Back to Home  Previous topic Next topic