Parent topic Previous topic Next topic 

Given the vast range of spatial analysis techniques that have been developed over the past half century many areas can only be covered to a limited depth, whilst others have been omitted because they are not implemented in current mainstream GIS products. This is a rapidly changing field and increasingly GIS packages are including analytical tools as standard built‑in facilities or as optional toolsets, add‑ins or analysts. In many instances such facilities are provided by the original software suppliers (commercial vendors or collaborative non‑commercial development teams) whilst in other cases facilities have been developed and are provided by third parties. Finally, many products offer software development kits (SDKs), programming languages and language support, scripting facilities and/or special interfaces for developing one’s own analytical tools or variants.

Throughout this Guide there are numerous examples of the use of software tools that facilitate geospatial analysis. In addition, some subsections of the Guide provide summary information about such tools and links to their providers. In addition to the present subsection these include those addressing:

·         GIS and related software

·         Sample software products (and Table 1‑1)

·         Software performance

·         Network analysis software

·         Guidelines for choosing a simulation/modelling (s/m) system

·         Simulation/modelling (s/m) systems for agent-based modelling, and

·         Principal software products cited (full web addresses)

Commercial software products rarely provide access to source code or full details of the algorithms employed. Typically they provide references to books and articles on which the procedure is based, coupled with online help and “white papers” describing their parameters and applications. This means that results produced using one package on a given dataset can rarely be exactly matched to those produced using any other package or through hand‑crafted coding. There are many reasons for these inconsistencies including: differences in the software architectures of the various packages and the algorithms used to implement individual methods; errors in the source materials or their interpretation; coding errors; inconsistencies arising out of the ways in which different GIS packages model, store and manipulate information; and differing treatments of special cases (e.g. missing values, boundaries, adjacency, obstacles, distance computations etc.).

Non‑commercial packages sometimes provide source code and test data for some or all of the analytical functions provided, although it is important to understand that “non‑commercial” often does not mean that users can download the full source code. Source code greatly aids understanding, reproducibility and further development. Such software will often also provide details of known bugs and restrictions associated with functions — although this information may also be provided with commercial products it is generally less transparent. In this respect non‑commercial software may meet the requirements of scientific rigour more fully than many commercial offerings, but is often provided with limited documentation, training tools, cross‑platform testing and/or technical support, and thus is generally more demanding on the users and system administrators. In many instances open source and similar not-for-profit GIS software may also be less generic, focusing on a particular form of spatial representation (e.g. a grid or raster spatial model). Like some commercial software, it may also be designed with particular application areas in mind, such as addressing problems in hydrology or epidemiology.

The process of selecting software tools encourages us to ask: (i) “what is meant by geospatial analysis techniques?” and (ii) “what should we consider to be GIS software?” To some extent the answer to the second question is the simpler, if we are prepared to be guided by self-selection. For our purposes we focus principally on products that claim to provide geographic information systems capabilities, supporting at least 2D mapping (display and output) of raster (grid based) and/or vector (point/line/polygon based) data, with a minimum of basic map manipulation facilities. We concentrate our review on a number of the products most widely used or with the most readily accessible analytical facilities. This leads us beyond the realm of pure GIS. For example: we use examples drawn from packages that do not directly provide mapping facilities (e.g. Crimestat) but which provide input and/or output in widely used GIS map-able formats; products that include some mapping facilities but whose primary purpose is spatial or spatio-temporal data exploration and analysis (e.g. GeoDa, GS+, STARS); and products that are general or special purpose analytical engines incorporating mapping capabilities (e.g. MATLab with the Mapping Toolbox, WinBUGS with GeoBUGS — for more details on these and other example software tools, please see Table 1‑1).

The more difficult of the two questions above is the first — what should be considered as “geospatial analysis”? In conceptual terms, the phrase identifies the sub-set of techniques that are applicable when, as a minimum, data can be referenced on a two-dimensional frame and relate to terrestrial activities. The results of geospatial analysis will change if the location or extent of the frame changes, or if objects are repositioned within it: if they do not, then “everywhere is nowhere”, location is unimportant, and it is simpler and more appropriate to use conventional, aspatial, techniques.

Many GIS products apply the term (geo)spatial analysis in a very narrow context. In the case of vector-based GIS this typically means operations such as map overlay (combining two or more maps or map layers according to predefined rules), simple buffering (identifying regions of a map within a specified distance of one or more features, such as towns, roads or rivers) and similar basic operations. This reflects (and is reflected in) the use of the term spatial analysis within the Open Geospatial Consortium (OGC) “simple feature specifications” (see further Table 4‑2). For raster-based GIS, widely used in the environmental sciences and remote sensing, this typically means a range of actions applied to the grid cells of one or more maps (or images) often involving filtering and/or algebraic operations (map algebra). These techniques involve processing one or more raster layers according to simple rules resulting in a new map layer, for example replacing each cell value with some combination of its neighbours’ values, or computing the sum or difference of specific attribute values for each grid cell in two matching raster datasets. Descriptive statistics, such as cell counts, means, variances, maxima, minima, cumulative values, frequencies and a number of other measures and distance computations are also often included in this generic term “spatial analysis”.

However, at this point only the most basic of facilities have been included, albeit those that may be the most frequently used by the greatest number of GIS professionals. To this initial set must be added a large variety of statistical techniques (descriptive, exploratory, explanatory and predictive) that have been designed specifically for spatial and spatio-temporal data. Today such techniques are of great importance in the social sciences, medicine and criminology, despite the fact that their origins may often be traced back to problems in the environmental and life sciences, in particular ecology, geology and epidemiology. It is also to be noted that spatial statistics is largely an observational science (like astronomy) rather than an experimental science (like agronomy or pharmaceutical research). This aspect of geospatial science has important implications for analysis, particularly the application of a range of statistical methods to spatial problems.

Limiting the definition of geospatial analysis to 2D mapping operations and spatial statistics remains too restrictive for our purposes. There are other very important areas to be considered. These include: surface analysis —in particular analysing the properties of physical surfaces, such as gradient, aspect and visibility, and analysing surface-like data “fields”; network analysis — examining the properties of natural and man-made networks in order to understand the behaviour of flows within and around such networks; and locational analysis. GIS-based network analysis may be used to address a wide range of practical problems such as route selection and facility location, and problems involving flows such as those found in hydrology. In many instances location problems relate to networks and as such are often best addressed with tools designed for this purpose, but in others existing networks may have little or no relevance or may be impractical to incorporate within the modelling process. Problems that are not specifically network constrained, such as new road or pipeline routing, regional warehouse location, mobile phone mast positioning or the selection of rural community health care sites, may be effectively analysed (at least initially) without reference to existing physical networks. Locational analysis “in the plane” is also applicable where suitable network datasets are not available, or are too large or expensive to be utilised, or where the location algorithm is very complex or involves the examination or simulation of a very large number of alternative configurations.

A further important aspect of geospatial analysis is visualisation — the creation and manipulation of images, maps, diagrams, charts, 3D static and dynamic views and their associated tabular datasets (see further, Slocum et al., 2008). GIS packages increasingly provide a range of such tools, providing static or rotating views, draping images over 2.5D surface representations, providing animations and fly-throughs, dynamic linking and brushing and spatio-temporal visualisations. This latter class of tools is the least developed, reflecting in part the limited range of suitable compatible datasets and the limited set of analytical methods available, although this picture is changing rapidly. 3D geometric and photo-realistic visualisation is becoming a sine qua non of modern systems, and we can expect to see an explosion of tools in this area over the coming years. All these facilities augment the core tools utilised in spatial analysis throughout the analytical process (exploration of data, identification of patterns and relationships, construction of models, and communication of results).

GIS software, especially in the commercial sphere, is driven primarily by demand and applicability, as manifest in willingness to pay. Hence, to an extent, the facilities available often reflect commercial and resourcing realities (including the development of improvements in processing and display hardware, and the ready availability of high quality datasets) rather than the status of development in geospatial science. Indeed, there may be many capabilities available in software packages that are provided simply because it is extremely easy for the designers and programmers to implement them, especially those employing object-oriented programming and data models. For example, a given operation may be provided for polygonal features in response to a well-understood application requirement, which is then easily enabled for other features (e.g. point sets, polylines) despite the fact that there may be no known or likely requirement for the facility. Despite this cautionary note, for specific well-defined or core problems, software developers will frequently utilise the most up-to-date research on algorithms in order to improve the quality (accuracy, optimality) and efficiency (speed, memory usage) of their products. For futher information on algorithms and data structures, see the online NIST Dictionary of algorithms and data structures.

Furthermore, the quality, variety and efficiency of spatial analysis facilities provide an important discriminator between commercial offerings in an increasingly competitive and open market for software. However, the ready availability of analysis tools does not imply that one product is necessarily better or more complete than another — it is the selection and application of appropriate tools in a manner that is fit for purpose that is important. Guidance documents exist in some disciplines that assist users in this process, e.g. Perry et al. (2002) dealing with ecological data analysis, and to a significant degree we hope that this Guide will assist users from many disciplines in the selection process.

  Back to Top    Back to Home Parent topic Previous topic Next topic