|
|
Grid or raster data files often have a single attribute associated with each cell position. Datasets of this type can be thought of as triples (x,y,z) where x and y are the cell coordinates and z the attribute values. z-values might be: a colour code in the case of image data; a binary value, for example in the case of converted vector data (without additional attributes); a cell classification number, such as the type of vegetation cover associated with that location; or a measured or calculated data value, such as an interpolated figure for soil heavy metal content in parts per million (ppm) or the height of a surface in metres. In all instances the set of values observed in a single grid of size M-rows and N-columns is limited, either by the coding or classification procedure adopted, or by the fact that ultimately only MxN values can possibly be recorded.
This set of values may be summarised and analysed using basic univariate statistics (non-spatial analysis) and many GIS packages provide a wide number of simple single-grid measures. These include many of the basic statistics described in Table 1‑4, including the arithmetic mean, max, min, count, sum, variety, minority, majority, range, mid-range, variance and standard deviation. Each of these operations applies to the set of individual cells of a single grid and may be described as local statistics.
Simple statistical operations may be applied to groups of cells, for example the immediate 8 neighbourhood (or window) of each cell or to individual cells across multiple grids (see also, Section 5.3). Where regular neighbourhoods of cells are involved such operations are termed focal, or if the neighbourhoods are defined using pre-defined (irregular) zones, then the operations are described as zonal. MFWorks, for example, includes a range of focal operations within its SCAN facility based on a sample window size and shape (square or circular). It allows for windows of varying sizes specified in terms of grid cells or measured distance, with facilities to include masked-off regions and annular-shaped windows. A matching set of facilities is provided via its SCORE option for zonal analysis.
There are a large number of basic grid operations supported for image and general raster files. These include local, focal and zonal operations depending on the scope of the operation. Such operations may be applied to a single grid, or to a number of input grids, depending on the operation in question. The set of possible operations of this type are often referred to as Map Algebra. Originally this term was introduced by Tomlin (1983) as the process of map combination for co-registered layers with rasters of identical size and resolution. Combinations involved arithmetic and Boolean operations. However the term is now used more widely by many suppliers. For example, ArcGIS describes the set of all operations performed using its Spatial Analyst option as “Map Algebra”. More specifically it divides such functions into five main categories, the three above plus Global and Application-specific:
· Local functions, which include mathematical and statistical functions, reclassification, and selection operations
· Focal functions, which provide tools for neighbourhood analysis
· Zonal functions, which provide tools for zonal analysis and the calculation of zonal statistics
· Global functions, which provide tools for full raster layer or raster dataset analysis, for example the generation of Cost Distance rasters
· Application functions, specific to tasks such as hydrology and geometric transformation
MFWorks, a software package that is a direct descendent of Tomlin’s original Map Analysis Package (MAP) software (now no longer under development however), includes a similarly broad range of grid functions. Its Operations menu includes the following:
· Classification: tools to re-code, slice (classify) single input layers, and to combine multiple layers using a range of local and focal operations
· Statistical: a number of statistical tools for grid processing, including focal (scan) and zonal (score) statistical calculations
· Proximity: tools to identify clumps of similar cells, to generate grid Voronoi polygons, and to generate cost distance grids
· Surface: computation of surface characteristics, such as gradient and orientation (focal functions)
· Network: tools to compute the number and pattern of stream networks
· Zone/area: tools to measure areas, boundaries of areas and to facilitate the creation of partitions
· Flow characterisation: hydrographic tools, such as identifying flow directions, drainage patterns, flows and accumulations across the surface
· Transformation: facilities to re-sample and/or adjust (warp) input grids
· Surface processing: procedures for filling-in pits and defined areas
· Interpolation: a range of different interpolation methods
· Visualise: image processing functions, such as merging input bands and applying filters, such as image sharpening
Similar facilities are provided in many of the mainstream raster-based GIS packages, such as PCRaster, TNTMips and Idrisi, and in the raster-processing toolsets of more generic packages such as GRASS and the Vertical Mapper facility for MapInfo. Many of these are described in greater detail in Chapter 6.
The map algebra concept is capable of being extended in several ways. One example is extension to explicitly handle the time domain coupled with 1, 2 or 3-dimensional spatial datasets. Mennis et al. (2005A, 2005B) have developed such a model and implemented a set of Java classes for handling what they describe as multidimensional map algebra (mma). Performance issues are one of the principal barriers to widespread use of such concepts, but as improved data structures and processing architectures emerge for spatio-temporal datasets, concepts of this kind are likely to be more widely applied. A second example of extension to map algebra ideas is map calculus, put forward by Haklay (2004, 2006). Haklay proposes retaining the functional form of surfaces to be combined, wherever possible, and performing operations on these forms rather than the resultant grids. For example, instead of generating a high resolution grid from a point dataset using some interpolation function, F(), one could retain the functional form F() and only generate the resulting grid as an output when required. This has the potential to provide a more manageable approach to spatio-temporal modelling than the mma procedure in some instances.
|
|