You are on page 1of 36

Spatial Information

Spatial information is information in 2, 3 or 4 dimensions. It is information where location has some importance or benefit and is not necessarily about locations on the surface of the Earth (e.g., can be a body organ or system). "Spatial" has to do with any multi-dimensional frame, e.g.,:

Medical images are referenced to the human body; Engineering drawings are referenced to a mechanical object; Architectural drawings are referenced to a building.

A location can be a body organ or system. Screenshot of BodyViewer, an ArcView extension developed by GeoHealth, Inc. BodyViewer allows users to map any database that contains a geographic reference (e.g., postal codes, street addresses, census tract numbers) and an ICD-9 or ICD-10 code (diagnoses).

Geographic information is a subset of spatial information, though the terms are often used interchangeably. "Geographic" is concerned with planet Earth: its twodimensional surface, its three-dimensional atmosphere, oceans, and sub-surface. (Bryan, 2000)

Estimates are that 80% of all data has a spatial component. This data can be queried and analysed to answer questions such as "How many (e.g., healthcare facilities, patients with a specific profile, etc.)?" "What kind/type (e.g., type of healthcare facility: GP surgery, district hospital, teaching hospital, specialised centre, etc.)?" "Where are they located (e.g., relationships between populations, their locations and available/planned healthcare facilities)?" (See also: 'Spatial
Analysis'.)

Geographically referenced data refers to data referenced by location on Earth (e.g., latitude/longitude, northing/easting) in some standard format.

Geographic References and Geocoding


Geographic information contains either an explicit geographic reference, such as a latitude and longitude or national grid co-ordinate, or an implicit reference such as an address, postal code, census tract name, forest stand identifier, or road name. An automated process called geocoding is used to create explicit geographic references (multiple locations) from implicit references (descriptions such as addresses). These geographic references allow us to locate features, such as a business or forest stand, and events, such as an earthquake or disease outbreak, on the earth's surface for analysis. Geographic information systems rely on two interrelated types of databases:

The Spatial Database


Describes the location and shape of geographic features, and their spatial relationship to other features. The information contained in the spatial database is held in the form of digital co-ordinates, which describe the spatial features. These can be points (for example, hospitals), lines (for example, roads), or polygons (for example, administrative districts). Normally, the different sets of data will be held as separate layers, which can be combined in a number of different ways for analysis or map production.

The Attribute Database


The attribute database is of a more conventional type; it contains data describing characteristics or qualities of the spatial features (i.e., descriptive information): land use, type of soil, distance from the regional centre, or, using the same examples as in the preceding paragraph, number of beds in the hospital, type of road, population of the administrative districts. Thus, we could have health districts (polygons) and health care centres (points) in the spatial database, and characteristics of these features in the attribute database, for instance persons having access to clean water, number of births, number of 1 year old children fully immunised, number of health personnel, and so on. (Loslier, 1995 in GIS for Health and the Environment) GIS links spatial data with geographic information about a particular feature on a map. The information is stored as attributes of the graphically represented feature.

Example: A line that denotes a road tells you nothing but its location. An attribute table stores all relevant (descriptive) information about this feature, which can be queried and displayed in many formats based on the users needs.

Points, Lines and Polygons


As mentioned above, GIS attempts to describe all features in geometric terms.

Point: discrete location represented as a co-ordinate pair (e.g., sampling locations, disease cases, hospitals, and town centroids). Line (Arc): set of ordered co-ordinates represented by a string of coordinates (e.g., streams, power and pipelines, and transportation routes). Polygon (Area): closed feature whose boundary encloses a homogeneous area represented by a closed string of co-ordinates which encompass an area (e.g., land use, lakes, census tracts, hospital catchment area, and town boundaries).

Many features can be described by either a point or a polygon. Similarly, lines can be of a specific width. Map scale and resolution define the conditions for appropriate application of these feature types. The uses of co-ordinate based analysis are only limited by the imagination of the user.

Vector and Raster GIS

There are two major methods to input, store and visualise mapped data in GIS. Geographic Information Systems which store map features in vector format store points, lines and polygons with high accuracy. They are preferred in urban

applications where legal boundaries and the analysis of networks are important. Applications of urban GIS include location and allocation of critical resources such as hospitals, study of disease outbreak patterns and crime analysis. Raster Geographic Information Systems, which store map features in raster or grid format, generalise the location of features to a regular matrix of cells. Raster GIS data structures are preferred for digital elevation modelling (DEM records terrain elevations for ground positions at regularly spaced horizontal intervals see USA photo below), statistical analysis, remotely sensed data, simulation modelling, and natural resource applications like sedimentation and water quality studies.

In raster-based analysis, the areas of analysis are divided into squares of uniform size (cells). Each cell characterises the feature of interest within this area with a single value. Digital image data, including aerial photos and satellite imagery, are stored in raster format (as pixels). GRID cell-based modelling uses the raster format to determine routing patterns and terrain.

An aerial photo

A digital elevation model (DEM) of the USA

Vector data on the other hand, are coordinate-based data structures commonly used to represent linear features (polygons can be formed by closed strings of co-ordinates). Each feature in this format is represented as a list of ordered x,y co-ordinates. Computer algorithms exist that can convert data of one type to the other.

Thematic Mapping
Maps in Geographic Information Systems are represented thematically. A standard topographic map will show roads, rivers, contour elevations, vegetation, human settlement patterns and other features on a single map sheet. In a GIS these features are categorised separately and stored in different map themes or overlays. For example, roads will be stored in a separate overlay. Likewise, rivers and streams will each be stored as a separate theme. This way of organising data in the GIS makes maps much more flexible to use since these themes can be combined in any manner that is useful (individual themes can be 'turned on' and 'off' as needed).

Choropleth Maps
These are thematic maps portraying properties of a surface using area symbols such as shading. Area symbols on a choropleth map usually represent categorised classes of the mapped phenomenon, e.g., population density.

Examples of choropleth maps

Maps Defined
A map is a graphic representation of some part of the earths surface. A map usually contains a series of themes or coverages that are often combined to form the final product. A map also contains descriptive information (e.g., legend) to help readers interpret the details on the map.

Map Scale
The map scale tells the user how the map relates to the real world features it represents.

Scale: describes the relation between a single map unit to the number of same units in the real world, e.g., 1:1000 (1 inch on the map = 1000 inches in the real world - see also 'Resolution' below). Scale Bar: compares the map units to an established real-world unit of measure, e.g., 1 inch = 2.5 miles. It helps users measure real-world distances on the map.

The term map is also used to describe a GIS Project or View. A map can provide an interpretation of features on the earths surface. Scale, map units and data layers (themes or coverages) are an inherent part of a GIS and allow users to conduct spatial queries and measure distances in their projects when needed.

Co-ordinate Systems
In a GIS, locations on the earths surface described by points, lines, and polygons are defined by a series of x,y co-ordinates. Co-ordinate systems can be selfdescribed or in units that relate to the real world. Decimal degrees; degrees,

minutes, seconds; metres; and feet are all examples of units of measure in a coordinate system. A degree or is a unit of measurement equal to 1/360 of a circle. A degree of latitude on the earth's surface is about 69 miles. A degree of longitude is about 69 miles at the equator and undefined at the poles, but any point on the surface rotates through a degree of longitude in about 4 minutes of time. Minute or ' is the sixtieth part of a degree of angular measurement, often represented by the sign ' as in 12 30', read 12 degrees, 30 minutes. Arcsecond or " is the sixtieth part of a minute of angular measure often represented by the sign ", as in 30", which is read 30 seconds. Decimal degrees are the decimal representation of fractions of degrees. Many paper maps express co-ordinates in degrees, minutes, seconds (e.g., 40 30'), where minutes and seconds are fractions of degrees. 30 minutes equal half a degree, and 30 seconds equal half a minute. GIS software, however, expresses coordinates in decimal degrees (e.g., 40.50 degrees), where fractions of degrees are expressed as decimals. Thus, the longitude: 40 degrees, 30 minutes, would be expressed in ArcView as 40.5 degrees. Try this: Online Conversions of Degrees, Minutes, Seconds and Decimal Degrees for Co-ordinates by the Audio Services Division of the Federal Communications Commission, US x,y co-ordinates define the location of map features. Co-ordinate systems must be consistent between map layers. For any database to be useful for spatial analysis, the database must be registered to a recognised global co-ordinate system. A co-ordinate system consists of:

A Spheroid: a mathematical description of the earths shape. A Map Projection: a mathematical conversion from spherical to planar coordinates.

Map Projections

The accuracy of measurements, e.g., of distance, done on a map is affected by the type of map projection used to draw this map.

Resolution

The accuracy with which a given map scale can depict the location and shape of map features is known as resolution. The larger the map scale, the higher the possible resolution. As map scale decreases, resolution diminishes and feature boundaries must be smoothed, simplified, or not shown at all. Resolution plays a large role in GIS, especially in raster-based modelling.

Resolution plays a large part in the ability of a map to accurately describe the earths features. It is essential that the user be mindful of the scale of the data layers. Serious errors can result if the theme lacks sufficient resolution to effectively describe an area of interest. A GIS does not tell its user that she has made an error in choosing the right data layer for her project.

Topology
Topology is a mathematical procedure for explicitly defining spatial relationships.

Arcs connect to each other at nodes (connectivity), Arcs that connect to surround an area define a polygon (area definition), and Arcs have direction and right and left sides (contiguity).

Connectivity: Arc-Node Topology

Points along the arc that define its shape are called vertices. Endpoints of arcs are called nodes. Arcs join only at nodes.

Area Definition: Polygon-Arc Topology

Polygons are represented as a series of x,y co-ordinates that connect to define an area. The GIS also stores the list of arcs that make up the polygon.
Contiguity

Every arc has a direction. The GIS maintains a list of polygons on the left and right side of each arc. The computer uses this information to determine which features are next to one another.

Getting Data into a GIS - Sources of Electronic Data Files

This can be done by:


Digitising hard copy maps (see 'Digitisers' below); Keyboard entry of co-ordinate data (co-ordinates are added as a series of
numbers defining the location of a point, the shape of a line, or the co-ordinates that define a closed area (polygon); very accurate; requires minimal conversion, but can be time intensive);

Electronic entry using a data file; Scanning a map manuscript; and Converting or reformatting existing data.

The Global Positioning System (GPS) can be also used in creating maps.

Electronic data files are the easiest way to get data into a GIS. Ready-to-use data sources include:

DXF: Auto-CAD files/Scanning products; DLG: Digital Line Graphs available from the USGS (US Geological Survey URI: http://www.usgs.gov/pubprod/index.html); TIGER: 1990+ Census files (US, URI: http://tiger.census.gov/); Ordnance Survey Maps: Britain's National Mapping Agency (URI: http://www.ordsvy.gov.uk); Bartholomew Mapping Solutions - Free Samples for ArcView and MapInfo (London, Great Britain and Europe) SHP: ArcView Shape files (many available for download from Health Geomatics 'Resources' page); Can also add point data using dBase file with x,y co-ordinates in decimal degrees.

Digitisers

A digitiser converts spatial features on a hard copy map into digital format. Point, line and area features are converted into x,y co-ordinates. The process involves manually tracing all features of interest using an electronic stylus. Good base maps must be used. After digitising, a procedure known as transformation converts digitiser units to a real-world co-ordinate system. Tics are used to provide the relationship between the two co-ordinate systems.

Product: Coverage
This term is used in a GIS to describe a spatial dataset that has a particular theme. A coverage consists of topologically linked geographic features. For maximal analytical power, each theme should exist as a separate coverage; different feature types can coexist in a coverage if they describe the same data.

Four Main Types of Maps/Data Exist:

Base Maps: include streets and highways; boundaries for census, postal, and political areas; rivers and lakes; parks and landmarks; place names; and USGS raster maps. Business Maps and Data: include data related to census/demography, consumer products, financial services, healthcare, real estate, telecommunications, emergency preparedness, crime, advertising, business establishments, and transportation. Environmental Maps and Data: include data related to the environment, weather, environmental risk, satellite imagery, topography, and natural resources. General Reference Maps: world and country maps and data that can be a foundation for a GIS database.

Data Availability and Quality Issues


One potential problem with GIS is the availability and quality of data needed by such systems to use and analyse in order to produce meaningful results. The data provided may be inaccurate or incomplete or inappropriate for some particular use, e.g., not of the appropriate geographic scale needed in some situation (Albert et al, 2000). Sometimes, due to patient privacy and confidentiality issues, researchers do not have access to point patient data and census tract data (polygon; aggregated data) is all that is available. Data is usually accompanied by descriptions (metadata). Spatial data quality standards are now in place to help users understand what is out there and the intended purpose of the dataset.

==========================================

Spatial Analysis and GIS Functions


Please be patient! This page takes about 5 minutes to download over a 56.6K modem (13 seconds over T1), but is worth waiting for it.

Spatial Analysis in ArcView (executable video adapted from ESRI educational material) Spatial analysis involves data queries performed on geo-referenced information to answer complex questions, like "How many people live within one mile of a hazardous waste site in the United Kingdom?"

From Simple Questions to Analytic Questions


Once we have a functioning GIS containing our geographic information, we can begin to ask simple questions such as:

How far is it between two places? (measuring distances) Where is land zoned for industrial use?

And analytical questions such as:


Where are all the sites suitable for building new houses (or new healthcare facilities)? If we build a new highway here, how will traffic be affected?

Spatial Analysis not just a map. With GIS, users can turn data into information, ask questions and interact with the system

GIS provides both simple point-and-click query capabilities and sophisticated analysis tools to provide timely information to managers and analysts alike. GIS technology really comes into its own when used to analyse geographic data to look for patterns and trends and to undertake "what if" scenarios. By applying the latest GIS to public health research, it is possible to confirm an existing hypothesis about the cause of a certain disease, to identify previously unstudied, preventable causes of a disease and to determine people at risk (risk assessment). Modern GIS have many powerful analytical tools, but two are especially important: proximity analysis and overlay analysis.

Proximity Analysis

How many houses lie within 100 m of this water main? What is the total number of patients within 10 km of this healthcare facility? What proportion of identified cases lies within 500 m of a suspected well (as source of infection)?

To answer such questions, GIS technology uses a process called buffering to determine the proximity relationship between features.

Overlay Analysis
The integration of different data layers involves a process called overlay. At its simplest, this could be a visual operation, but analytical operations require one or more data layers to be joined physically. This overlay, or spatial join, can for example link land-use and environmental data to population and disease data. Let's consider this example: we want to assess the risk to people living in a certain area regarding their exposure to some carcinogenic pesticide that had been sprayed and polluted the soil. We can use GIS to calculate the distance from the residential parcels to the likely sources of pollution.

But since forests reduce drift from aerial spraying of pesticides by capturing some of the spray in their foliage, we should also overlay data about land use (e.g., forests, homes, etc.) with data about sources of pollution (e.g., sprayed areas, cranberry bogs, etc.) to see where forests come between pollution and homes (protecting homes from pollution). GIS could be then used to identify the

areas with the greatest risk of exposure by selecting residential land-use polygons that intersect or are adjacent to pesticide sources.

Data Linkage for Analysis

Multiple databases can be linked and related

Analysis requires data linkage, within the same dataset and/or in a second dataset. GIS uses geography, or space, as the common key element between datasets. Information is linked only if it relates to the same geographic area. GIS can communicate with conventional DBMS.

Spatio-Temporal Analysis
By adding a temporal (time) dimension to spatial data and analysis, we can track changes that might occur regarding some variable/condition within the same location with time. Also the variable/condition we are studying might change locations with time, or extend beyond the original location to involve additional ones.

Spatial Analytic Techniques for Medical Geographers


(Albert and Gesler, 2000)

See also:

Diggle P. Spatial Statistics for Environmental Epidemiology. Medical Statistics Unit, Lancaster University, UK. 2000 The GeoMed Advisor: A Disease Clustering Advisor: The advisor will ask you a
series of questions about your data and the kind of clustering you wish to detect, then suggest an appropriate test (Source: UMICH.EDU, US)

Visualisation
For many types of geographic operation the end result is best visualised as a map or graph. Maps are very efficient at storing and communicating geographic

information. While cartographers have created maps for millennia, GIS provides new and exciting tools to extend the art and science of cartography. Map displays can be integrated with reports, three-dimensional views, photographic images, and other output such as multimedia.

GIS combines various display methods, all of them very useful in supporting the decision maker

In the Health Sector, GIS Can Answer the Following Questions and Serve the Following Functions:

Some Real-Life Examples:

Site location and client distance

Modelling future trends

Clusters and comparison of data

Environmental monitoring and modelling - toxic plume

Types of GIS
N.B.: The following GIS types are not necessarily mutually exclusive and a GIS application can be always classified under more than one type.

Four-dimensional GIS

While spatio-temporal geo-representations can handle two dimensions of space and one of time, four-dimensional GIS are designed for three dimensions of space and one of time.

Multimedia/hypermedia GIS
Multimedia/hypermedia GIS allow the user to access a wide range of georeferenced multimedia data (e.g., simulations, sounds and videos) by selecting resources from a georeferenced image map base. A map serving as the primary index to multimedia data in a multimedia geo-representation is termed a hypermap. Multimedia and virtual geo-representations can be stored either in extended relational databases, object databases or in application-specific data stores. Raper (2000) mentions several examples including a multimedia atlas that has been developed for the tourist information system in China.

Imagine having a real-time (using telecommunication links) multimedia GIS that can audio-visually monitor different locations at the same time; one application of such system could be as a decision support tool to improve the analysis of traffic problems and the assessment of noise in urban planning and management. Since video is a spatio-temporal projection of the world in imagery, it can be considered capable of fully multi-dimensional geo-representation. It can be used for mapping from moving platforms, for target positioning, for the measurement of geo-phenomena such as air pollution, and as a process monitoring system. Video imaging has also become a recognised and valuable new technique of remote sensing since the 1990s.

Web GIS
Widespread access to the Internet, the ubiquity of browsers and the explosion of commodified geographic information has made it possible to develop new forms of multimedia geo-representations on the Web. ESRI, for example, provides a good range of Internet mapping solutions based on their RouteMAP IMS and ArcIMS (Internet Mapping Server) architectures. Many current geomatics solutions are Web-based overtaking the traditional Desktop environment and most future one are expected to follow the same direction. ESRI promotional video on the use of GIS in crime- and fire-fighting and by insurance companies and even people looking for a house to buy; the last part of this video, "trying to locate a house to buy", describes www.realtor.com, a Web GIS (Format: RealVideo; Running Time: 2:42 min. - Source: ESRI, US)

Related Links: HealthQuery The National geographic MapMachine ATM Locator

Virtual Reality GIS


Virtual Reality GIS have been developed to allow the creation, manipulation and exploration of geo-referenced virtual environments, e.g., using VRML modelling (Virtual Reality Modelling Language). Virtual Reality GIS can be also Web-based. Applications include 3D simulation for planning (to experiment with different scenarios).
Related Links:

An Investigation into User Requirements for Advanced Spatial Decision Support Systems (University of Nottingham, UK)

This project (now completed) was funded by the UK Economic and Social Research Council (ESRC) with the aim to assist users in environmental design tasks, using three key

technologies: Geographical Information Systems, Virtual Reality and Computer-Supported Cooperative Work.

Web3D Consortium

Real-time GIS
With the availability of real-time positioning systems and cost-effective mobile telecommunications, it has become possible to develop real-time GIS that monitor, transmit, record and analyse the movement of mobile agents such as vehicles, people or animals (telegeomonitoring). In transportation for example, many organisations need to monitor the position of their vehicles for scheduling or safety reasons. Each mobile agents transmits positioning information to the appropriate control centre(s), where it is entered into a database of timestamped positions. Some applications can monitor the proximity of the mobile agents to specified locations or advise on alternative routes based on traffic reports. Imagine such applications in the context of an ambulance fleet of cars. Real-time GIS can also include location-based services, where a moving agent receives information depending on its location (GMS - Geographic Messaging Services), e.g., in tourism, giving and updating relevant details of attractions to tourists as they approach different areas.

Reference:
Raper J. Multidimensional Geographic Information Science. London: Taylor and Francis. 2000 [ISBN 0-7484-0507-0]

You might also like