geogra ph i cal data

32
Geographical Data Types, relations, measures, classifications, dimension, aggregation

Upload: anne

Post on 16-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

Geogra ph i cal Data. Type s , relati ons , m easures , classificati on s, dimensi on , aggregati on. To be seen on maps. urban. grass. water. te x t (nam e , e levation ). dik e. Topogra ph ic map. C lassif ied isolin e map. To be seen on maps. Choropleth map: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Geogra ph i cal Data

Geographical Data

Types, relations, measures, classifications, dimension,

aggregation

Page 2: Geogra ph i cal Data

To be seen on maps

Topographic map

urban

grass

water

dike

text(name,elevation)

Classified isoline map

Page 3: Geogra ph i cal Data

To be seen on maps

Choropleth map:Map with administrativeboundaries which shows per region a value by a color or shade

Use of pesticide 1_3_Dper county

Page 4: Geogra ph i cal Data

Maps show ...• Relation of place (geographic location) to a value

(here 780 mm precipitation) or name (here is Minnesota).

• An abstraction (model, simplification) of reality • A combination of themes (different sorts of data)• Connections (subway maps)

Tokyo subway map

Page 5: Geogra ph i cal Data

Scales of measurement

• Nominal scale• Ordinal scale• Interval scale• Ratio scale• ( Angle/direction, vector, … )

Classification of types of data by statistical properties (Stevens, 1946)

Page 6: Geogra ph i cal Data

Nominal scale

• Administrative map (names of the countries)• Landuse map (names of landuse: urban, grass,

forest, water, …)• Geological map (names of soil types: sand,

clay, rock, …)

Finite number of classes, each with a name.

Testing is possible for equivalence of name.

Page 7: Geogra ph i cal Data

Ordinal scale

• School type (VMBO, HAVO, VWO)• Wind force on schale of Beaufort (0=no

wind, ... 6=heavy wind, …, 9=storm, ...)• Questionnaire-answers (disagree, partly

disagree, neutral, partly agree, agree)

Finite number of classes, each with a name

Testing for equivalence of name and for order

Page 8: Geogra ph i cal Data

Interval scale

• Temperature in degrees Celsius or Fahrenheit• Time/year on Christian calendar

Unbounded number of classes, each with a value

Testing for equivalence, for order and for difference(a unit distance exists)

Page 9: Geogra ph i cal Data

Ratio scale

• Measurements: concentration of lead in soil• Counts: population, number of airports• Percentages: unemployment percentage,

percent of landuse type forest

Unbounded number of classes, each with a value

Testing for equivalence, for order, for difference and for ratio (a natural zero exists)

Page 10: Geogra ph i cal Data

Examples

Page 11: Geogra ph i cal Data

Overview

nominal Categories equivalence number of occurrences, mode

ordinal Categories … and order … and median

interval Unbounded … and difference

… and average

ratio Unbounded … and ratio

two data collection

Page 12: Geogra ph i cal Data

Other scales

• Angle (wind direction, direction of spreading)• Vector: angle and value (primary wind

direction and speed)• Categorical scales with partial membership

(fuzzy sets; points on indeterminate boundary between “plains” and “mountains”; location of coast line: tide)

Page 13: Geogra ph i cal Data

Example

Page 14: Geogra ph i cal Data

Classification schemes

Data on nominal scale: hierarchical classification schemes

nature

water

working

houses

flats

cattle

plants fruit

urban

agriculture

living

landuse

Page 15: Geogra ph i cal Data
Page 16: Geogra ph i cal Data

Classification schemes

Data on interval and ratio scales

• Fixed intervals

• Fixed intervals based on spread

• Quantiles: equal representatives

• “Natural” boundaries

[1-10], [11-20], [21-30]

[4-11], [12-19], [20-27]

4, 5, 5, 8, 12, 14, 17, 23, 27

[4-5], [8-14], [17-27]

[4-5], [8-17], [23-27]

Page 17: Geogra ph i cal Data

Classification schemes, cont’d

• Statistical boundaries: average , standard deviation , then e.g. boundaries - 2, - , , + , + 2

• Arbitrary

Page 18: Geogra ph i cal Data

Two classifications

Four equal intervalsQuartiles

Counties of Arizona, total population

Page 19: Geogra ph i cal Data

Why is choice of classification important?

• Visualization often needs classification• Choice of class intervals influences

interpretation

Think of a report that addresses air pollution due to a factory made by the board of the factory or by an environmental organization

Page 20: Geogra ph i cal Data

Data: object and field view

• Object view: discrete objects in the real world– road– telephone pole– lake

• Field view: geographic variable has a “value” at every location in the real world– elevation– temperature– soil type– land cover

Page 21: Geogra ph i cal Data

Reference system

• Data according to the scales of measurement are attribute values in a reference system

• A geographical reference system is spatial, temporal or both

At 12 noon of August 26, 1999 , a temperature of 17.6 degrees Celsius is measured at 5 degrees longitude and 53 degrees latitude

Page 22: Geogra ph i cal Data

Spatial objects

• Points; 0-dimensional, e.g. measurement point

• (Polygonal) line; 1-dimensional, e.g. border between Bolivia and Peru

• Polygons; 2-dimensional, e.g. Switzerland

• Sets of points, e.g. locations of accidents• Systems of lines (trees, graphs), e.g. street

network• Sets of polygons, subdivisions, e.g. island

group, provinces of Nederland

Page 23: Geogra ph i cal Data

Dependency of dimension

• Dimension of an object can be scale dependent: Rhine river at scale 1 on 25.000 is 2-dim.; Rhine at scale 1 on 1.000.000 is 1-dim.

• Dimension of an object can be application dependent: Rhine as transport route is 1-dim.(length is relevant; not the surface area); Rhine as land cover in Nederland is 2-dim.

Page 24: Geogra ph i cal Data

The third dimension

• Elevation can be considered an attribute on the ratio (!?) scale at (x,y)-coordinates

• For civil engineering: crossing of street and railroad can be at the same level, or one above the other

• Data on subsurface layers and their thickness

Page 25: Geogra ph i cal Data

The time component

• Same region, same themes, different dates: Allows computation of change

• Trajectories give the locations at certain times for moving objects

Page 26: Geogra ph i cal Data

Level of aggregation

Income of an individual

Average income in a municipality

Average income in a province

Average income in a country

Higher level of aggregation

Page 27: Geogra ph i cal Data

Various aggregations in the Netherlands

• Prinvines (12)• Municipalities (441)• COROP regions (40)• Water districts (39)• Economic-geographic regions (129)• 2- and 4-number postal codes• Macro-regions (4 of 5; provinces joined)• Labor exchange district (127), planning region

(43), nodal region (80), ...

Page 28: Geogra ph i cal Data

Aggregation: dangers

• MAUP: modifiable areal unit problem

0 - 12 - 45 -

Located occurrences of a rare disease

clustering?

Page 29: Geogra ph i cal Data

Aggregation: dangers

• MAUP: modifiable areal unit problem

0 - 12 - 45 -

Aggregation boundarieshave got nothing to do with mapped theme

Located occurrences of a rare disease

clustering?

Page 30: Geogra ph i cal Data

Aggregation: dangers

• Not enough aggregation: privacy violations(e.g. AIDS-cases with complete postal code)

• Correction for population spread is necessaryin case of data on people

0 - 12 - 45 -

Located occurrences of a rare disease

clustering?

Page 31: Geogra ph i cal Data

Huntington’s disease,1800-1900

Page 32: Geogra ph i cal Data

Summary

• Data is geometry, attribute, and time• Data is coded in a reference system• Attribute data is usually on one of the standard

scales of measurement• Classification of interval and ratio data is needed

for mapping (isoline or choropleth) and histograms• The object view and field view exist• Geometric data has a dimension (point, line, area),

but this may depend on scale and application• Data is often spatially aggregated