1 | Basics of GIS | 2 |
2 | Spatial data | 4 |
3 | Spatial reference systems | 3 |
4 | Spatial data basic analysis | 3 |
5 | Data visualization | 2 |
Learning outcomes:
# | Content |
1.1 | What is GIS? |
1.2 | Uses of GIS |
Geographic Information System (GIS)
"is a computer system capable of capturing, storing, analyzing, and displaying geographically referenced information; that is, data identified according to location. Practitioners also define a GIS as including the procedures, operating personnel, and spatial data that go into the system."
"is a computer-based tool for mapping and analyzing things that exist and events that happen on earth. GIS technology integrates common database operations such as query and statistical analysis with the unique visualization and geographic analysis benefits offered by maps."
The spatial (geographic) part differentiates a GIS from a standard computer database.
A GIS consists of five key components: hardware, software, data, people, and methods.
The main goal of GIS is to provide spatial information to decision makers.
Data means simply facts or figures - pieces of information, but not information itself.
Data is collected and stored in databases. When data are processed, interpreted, organized, structured or presented so as to make them meaningful or useful, they are called information. Information provides context for data.
In a GIS, spatial analysis and modelling are the main source of information.
Spatial analysis - a set of methods and tools for performing operations on spatial data in order to obtain additional information.
Geospatial software and tools:
|
# | Content |
2.1 | Definition and properties |
2.2 | Vector data model |
2.3 | Raster data model |
2.4 | Comparing vector and raster data models |
Spatial object means an abstract representation of a real-world phenomenon related to a specific location or geographical area.
Spatial data means any data with a direct or indirect reference to a specific location or geographical area.
Spatial data set means an identifiable collection of spatial data.
Data model is a set of guidelines to convert the real world (called entity) to the digitally and logically represented spatial objects consisting of the attributes and geometry.
The attributes are managed by thematic or semantic structure while the geometry is represented by geometric-topological structure.
Spatial data describes shape, location, spatial relationships and attributes of features related to the Earth's surface.
Two common spatial data models:
A vector data model defines discrete objects such as fire hydrants, rivers, lakes.
A vector data models divided into three basic types:
All three of these types of vector data are composed of coordinates and attributes attached to the geometry.
|
|
|
Topology is required to determine spatial relationships between objects in a GIS.
If the features are deformed (e.g. through projections or datum transformations), some properties change: area, shape, direction, distance, relative proximities.
Other properties (topological properties) remain constant after distortion: adjacency, containment, connectivity.
Benefits of topology:
Three examples of properties that remain constant after deformation
An attribute is a nonspatial information about a geographic feature in a GIS, usually stored in a table and linked to the feature by a unique identifier (ID). A database or tabular file containing information about a set of geographic features, usually arranged so that:
The attribute values can be used to find, query, analyze and symbolize features. |
Each column in the database may contain different type of data.
Basic data types:
Vector data sources
(A - GPS measurements, B - list of coordinates, C - digitizing and conversion tools e.g. raster to vector, D, E - existing databases)
A raster data model defines continous data and phenomena.
Raster's are:
A raster consists of a matrix of cells (or pixels) organized into rows and columns (or a grid) where each cell contains one value representing information such as temperature, elevation, or spectral data. Pixel - smallest visible element of an image. Grid - 2-D object feature that represents a single element of a continous surface. |
Cells are identified by their positions in the grid. Raster data is georeferenced by: |
A spatial resolution refers to the dimension of the cell size representing the area covered on the ground. Higher resolution means better feature quality but it means also bigger raster file size.
A raster dataset contains one or more layers called bands.
A band is represented by a single matrix of cell values.
For example, a digital elevation model (DEM) is a single-band raster (has one band holding elevation values) while satellite imagery is a multispectral image and has multiple bands.
Three main ways to display single-band raster datasets:
Three ways to display raster dataset (binary image, grayscale image and color image)
Raster data can also have attributes only if pixels are represented using a small set of unique integer values. Raster datasets that contain attribute tables typically have cell values that represent or define a class, group, category, or membership.
In raster datasets, each row of an attribute table corresponds to a certain zone of cells having the same value.
The attribute tables can be used to analyze datasets and symbolize raster cells.
Note: Not all GIS raster data formats can store attribute information.
An example of raster dataset with attribute table
(A - orthophoto, B - satellite imagery, C - DEM, D - scanned maps and plans, E - conversion and analysis tools e.g. vector to raster, interpolation)
properties | vector | raster |
---|---|---|
depict | discrete features | continous data |
geometry | coordinates | cells organized into a grid |
attributes | attribute table (with many attributes) | cell value (only one attribute) |
analysis | geoprocessing | map algebra, overlays |
data structure | more complex | more simple |
size | compact data structure – little storage space | greater storage needed |
file formats | ESRI Shapefile, GML, KML, geoJSON, GPX | geoTIFF, IMG, grid |
# | Content |
3.1 | Definition |
3.2 | Geographic Coordinate System |
3.3 | Projected Coordinate System |
A spatial reference describes where features are located in the real world. It is a key component of spatial data and applications, and differentiates a GIS from a standard databases.
A spatial reference system (SRS) or coordinate reference system (CRS) is a coordinate-based local, regional or global system used to locate geographical entities.
Champ de Mars, 5 Avenue Anatole France, 75007 Paris, France
48° 51' 29.1348''N; 2° 17' 40.8984''E = 48.858093; 2.294694
448256.00 m E; 5411928.00 m N
Two common types of coordinate systems used in a GIS:
Coordinate systems (both geographic and projected) provide a framework for defining real-world locations.
A geographic coordinate system is a reference system for identifying locations on the curved surface of the earth. Locations on the earth’s surface are measured in angular units from the center of the earth relative to two planes: the plane defined by the equator and the plane defined by the prime meridian (which crosses Greenwich England). A location is therefore defined by two values: a latitudinal value and a longitudinal value.
Latitude and Longitude (Wikimedia Commons)
|
Latitude and Longitude - relationships
|
Geographic Coordinate Systems can only be used to measure angles, not distances or areas.
Projection is a method by which the curved surface of the Earth is portrayed on a flat surface. This requires a systematic mathematical transformation of the Earth’s graticule of lines of longitude and latitude onto a plane. All projection types can be aggregated into three groups: planar, cylindrical and conical.
The idea of projection - mapping the Earth on a flat surface
In a projected coordinate system, locations are identified by X, Y coordinates on a grid.
Unlike a geographic coordinate system, a projected coordinate system can be used to measure distances and areas.
A planar projection maps the Earth's surface to a flat surface (QGIS Documentation)
Planar projections can be of three different types:
Three different types of planar projection (polar, equatorial, oblique)
A cylindrical projection maps the Earth's surface to a flat surface (QGIS Documentation)
A conical projection maps the Earth's surface to a flat surface (QGIS Documentation)
The EPSG codes are 4-5 digit numbers that represent coordinate reference system definitions.
Most geographic information systems use EPSG codes as Spatial Reference System Identifiers (SRIDs) and EPSG definition data for identifying projections and performing transformations between these systems.
Common EPSG codes:
# | Content |
4.1 | Attribute query |
4.2 | Spatial query |
4.3 | Geoprocessing |
Query is a request to select features or records from a database. Often written as a statement or logical expression.
Attribute query is the process of searching and retrieving records of features in a database based on desired attribute values.
Typically, attribute query is performed using a criteria-based query language, most commonly SQL.
SQL (eng. Structured Query Language) is a domain-specific language used in programming and designed for managing data held in a relational database management system (RDBMS).
In most cases, the desired information can be given as a set of criteria based on the available attributes. These criteria are formatted in the appropriate query language as a Boolean expression, which can be validated as either true or false for each record in the database.
Individual criteria can be constructed and combined using:
|
An attribute query consists of three basic elements: attribute field, operator, and attribute value.
1 element of the query is ALWAYS:
SELECT * FROM "name of field" WHERE operator 'value'
Examples:
Several elements of the query can be combined with operators: OR and AND.
LIKE - search for a specified pattern in a column
BETWEEN ... AND ... - select values within a given range (concerns numbers, text, or dates)
IN (..., ...) - specify multiple values
IS (NOT) NULL - test for (non)empty values
OR - result of query contains features which have one or another attribute value
"owner" = 'Smith' OR "owner" = 'Jones'
AND - result of query contains features which have two attributes at the same time
"owner" = 'Smith' AND "street" = 'Lisson St.'
("owner" = 'Evans' OR "owner" = 'Smith') AND "street" = 'Lisson St.' |
"owner" = 'Evans' OR ("owner" = 'Smith' AND "street" = 'Lisson St.') |
Query is a request to select features or records from a database. Often written as a statement or logical expression.
Spatial query is the process of searching and retrieving records of features in a database based on location or spatial relationship.
Spatial relationships between objects: Theoretical example: different spatial relations between objects A (buildings) and objects B (landslides) |
Select: buildings located outside of the landslide Spatial relation: objects A and B do not have any common space Result of request: 17 buildings |
Select: buildings located partially within the landslide Spatial relation: objects A and B overlap partially, but are not completely contained by each other Result of request: 4 buildings |
Select: buildings located either fully or partially within the landslide Spatial relation: objects A and B share any portion of space Result of request: 11 buildings |
Select: buildings containing the landslide (theoretical case) Spatial relation: objects A contain objects B within their boundaries Result of request: 2 buildings |
Select: buildings located fully within the landslide Spatial relation: objects A are completely inside objects B Result of request: 5 buildings |
Select: buildings touching the border of the landslide Spatial relation: objects A and B have at least one point in common, but their interiors do not intersect Result of request: 1 building |
Select: buildings with the same geometry as the landslide (theoretical case) Spatial relation: objects A and B have strictly equal geometries Result of request: 1 building |
A buffer is a zone around a map feature measured in units of distance or time.
A buffer is useful for proximity analysis.
Buffer tool and its parameters
# | Content |
5.1 | Data visualization |
5.2 | Creating maps |
Data visualization is the graphic representation of data.
Spatial data is usually presented in the form of maps. There are many different types of maps to show different types of information.
In a GIS, one of the most common types of maps are thematic maps that can represent a variety of information including climate, vegetation, population, and many others.
A single vector layer can be represented in different ways.
A GIS software provides many different methods of symbolization, e.g.:
You can also set other layer display properties such as:
Single symbol applies the same symbol to all features in a layer. This method of symbology is used for representing a layer with only one category.
Properties of symbol according geometry types: |
Unique values symbolize qualitative categories of values. Unique value symbology can be based on one or more attribute fields in the dataset, or you can write an expression to generate values on which to symbolize.
Graduated colors symbology is used to show a quantitative differences in features by varying the color of symbols. Data is classified into ranges that are each assigned a different color to represent the range.
An example of graduated color map
Graduated symbols symbology is used to show a quantitative differences in features by varying the size of symbols. Data is classified into ranges that are each assigned a symbol size to represent the range.
An example of graduated symbol map
A chart is a type of statistical graphic that represents data.
Chart symbology show quantitative differences between attributes, with each part of the chart representing an attribute value that contributes to the overall whole set of values.
An example of chart map
A map can be composed of many different map elements.
They may include:
Thank you for your attention!