Data models and data specifications

Data models and data specifications


Use of models, scope and specification objectives. Modelling frameworks and applied examples


SADL KU Leuven

WELCOME!

With the following slides and interactive material you will be able to take part in the journey to discover Spatial Data Infrastructures (SDI) its components and benefits through the observation of multiple examples and exercises. 


You can navigate through the course by pressing the navigation arrows at the bottom of each slide or using your arrow keys on your keyboard. You can move horizontally (← →) for viewing each theme and vertically (↑↓) to view extra recommended information


navigation arrows

MetadataData models and data specifications

#Contents
1Motivation and Background
2Scope & objectives of data specifications for SDI’s
3The modelling framework for data specifications (ISO 19131)
4Development of data specifications
5Examples of data models: ISO 19152

SDI components (example of INSPIRE)


Interoperability of data

Motivation and Background

The starting point


  • Access to spatial data in various ways: copies via CD


  • User has to deal with interpreting heterogeneous data in different formats, identify, extract and post-process the data needed



→ Lack of interoperability

Examples of incompatibility and inconsistency of spatial data

Semantic and schematic differences

Semantic and schematic differences

Levels of heterogeneity (1)

Syntactic heterogeneity


Data may be implemented in a different syntax of different paradigms, such as relational or object- oriented models. Syntactic heterogeneity is also related to the geometric representation of geographic objects, e.g., raster and vector representations.


Structural or schematic heterogeneity


Objects in one database are considered as properties in another, or object classes can have different aggregation or generalisation hierarchies, although they might describe the same Real World concepts.

Levels of heterogeneity (2)

Semantic heterogeneity


A Real World concept may have more than one meaning to comply with various disciplines, giving as a consequence semantic heterogeneity.


e.g. Different classifications/definitions of roads when viewed from different perspectives: traffic network route directions, spatial planning… <> 1 on 1 match

Interoperability of data (1)

Technical interoperability

should guarantee that system components can interoperate


Semantic interoperability

should guarantee that data content is understood by all in the same way

Interoperability of data (2)

…what a SDI aiming at




Provide access to spatial data via network services and according to a harmonised data specification to achieve interoperability of data


Datasets used within organizations may remain unchanged


Data or service providers have to provide a transformation between their internal model and the harmonised data specification

How?


Facilitate data use and interoperability by adopting common cross-domain models to exchange data








DATA INTEROPERABILITY

Implementation alternatives



Conclusion: common Data Specifications is the goal

Member States should make data available within the scope of INSPIRE using


  • the same spatial object types (and definitions)

  • the same attributes (and definitions, types, code lists) and relationships to other types, e.g. BuildingHeight, BuildingSize

  • a common encoding (GML application schemas)

  • common portrayal rules



This facilitates interoperability and pan-European/cross-border applications (e.g. information systems, reporting systems, forecasting models)

Targeted benefit


Source: EC Joint Research Centre

Example: key requirements of the INSPIRE directive (1)

Art 3(7): “Interoperability means the possibility for spatial data sets to be combined, and for services to interact, without repetitive manual intervention, in such a way that the result is coherent and the added value of the data sets and services is enhanced”

Art 7(1): “Implementing rules laying down technical arrangements for the interoperability and, where practicable, harmonisation of spatial data sets and services … shall be adopted…. Relevant user requirements, existing initiatives and international standards for the harmonisation of spatial data sets, as well as feasibility and cost-benefit considerations shall be taken into account in the development of the implementing rules.”

Example: key requirements of the INSPIRE directive (2)




Art 8(2): The implementing rules shall address the following aspects of spatial data:


(a) a common framework for the unique identification of spatial objects, to which identifiers under national systems can be mapped in order to ensure interoperability between them;


(b) the relationship between spatial objects;


(c) the key attributes and the corresponding multilingual thesauri commonly required for policies which may have an impact on the environment;


(d) information on the temporal dimension of the data;


(e) updates of the data.


02 | Scope & objectives of data specifications for SDI’s

Thematic scope



SDI data scope

Scope is spatial data – not all kinds of thematic/descriptive data









Re-use the INSPIRE data specs for own usage


  • Extensions

  • Additional constraints

  • Re-use of common objects

Exercise 1: Find your scope

Go to INSPIRE website https://inspire.ec.europa.eu/inspire-tools

Use the tool “Find your scope” (toolkit):


  1. In catalogue of INSPIRE objects:


    • find “zone” --> limit to only “Spatial object type” --> narrow search “terrestrial zone”

    • Which Object, INSPIRE Data Theme, Application Schema

    • What are the other possible specialisations of TransportArea?


  2. To find the Spatial object that should be used for a dataset that stores the locations of stations where magnetic measurements are performed. (use “Direct Search”)


  3. Find your own scope…

1

Example of INSPIRE

Exercise 1: Result (1)

PortArea – Transport Networks – Water TN

Exercise 1: Result (2)

Direct search: “magnetic field”

Relevant objects? Observed Event (NZ) vs Geoph Station (GE - Geophysics)

03 | The modelling framework for data specifications (ISO 19131)

Data harmonisation and data specs aspects

Example from

INSPIRE

Harmonisation General Principles

The modelling framework

Harmonisation Schemas

Harmonisation Translations

Harmonisation Identification

Harmonisation Data Quality

Harmonisation Other aspects

Some wrap-up questions

The modelling framework


Where does the abbreviation GCM stands for (used in the INSPIRE context)?


How many thematic domains, divided in how many annexes, are addressed by the INSPIRE directive?


Which kind of heterogeneity?







What are next from “Encoding” and “Harmonised vocabularies” the other two major cornerstones of data interoperability?

04 | Development of data specifications

What is a data specification?


DATA SPECIFICATION

II

Synonym to data product specification



Detailed description of a data set or data set series together with additional information that will enable it to be created, supplied to and used by another party


[ISO 19131]

Data Specification




Step-wise methodology

Use case development

Step 1


Major sources are:


  • European environmental policies

  • User requirements survey

  • SDIC/LMO reference material

  • EU-funded initiatives and projects

Identification of user requirements and spatial object types

Step 2


Identify requirements on:

the data content

metadata, data quality, portrayal and other elements of the data specification

As-is analysis

Step 3


Analyse the current situation regarding spatial data sets for the theme, based on:


  • the reference material

  • existing internationally standardised data specifications

  • expertise/field experts

Example of As-is analysis

Gap analysis

Step 4


Compare identified data sources with identified user requirements

Data specification development

Step 5


The data specifications must be designed to ensure easy mapping between existing data and the harmonised data specification.


Consider:


  • No excessive costs

  • No collection of new data!

Implementation, validation and Cost-Benefit Analysis

Development

Step 6-7


  • review process

  • test under real world conditions

  • analyse costs and benefits


  • Final round of harmonisation

Which level of harmonisation is “just right”?

Requires:


  • an iterative process

  • well-established requirements

  • good understanding of the existing geographic information

  • testing and validation

Result

Data specification for all Annex Themes


  • Textual description of the data model

  • UML model

  • GML application schema

05 | Examples of data models: ISO 19152

CP - Scope


  • The scope of the cadastral information in the INSPIRE context is limited to the geographic side of the cadastral information systems (land administration)

  • INSPIRE does not aim at harmonising the concepts of ownership and rights related to the parcels

  • Cadastral parcels should serve the purpose of generic information locators. Having included the reference to the national registers as a property (attribute) of the INSPIRE parcels, national data sources can be reached.

CP - Bacground


  • All countries run a register

    • Usually a partition of the country with exceptions

  • Basic unit of the system is the parcel

  • The cadastral parcels should be, as much as possible, single areas of Earth surface (land and/or water) under homogenous real property rights and unique ownership, where real property rights and ownership are defined by national laws.

CP - Basic components


  • Parcel (basic unit)

  • Subdivision (municipalities, sections, districts, parishes, urban or rural blocks, etc)

    • Carry information for the parcels inside the subdivision: accuracy or scale

  • Cadastral boundaries

    • Only neccessary if spatial accuracy is associated with them

CP - Application schema

CP – Feature types

  • CadastralParcel (mandatory)

  • CadastralZoning (auxiliary)

  • CadastralBoundary (auxiliary)

  • BasicPropertyUnit (auxiliary)



Core Profile

CP - Requirements & Recommendations

CP - Requirements & Recommendations

CP - Requirements & Recommendations

CP - Geometry

0-, 1-,2-,2,5 dimensional geometries

CP - Enumerations/codelists

Land Administration Domain Model

ISO 19152

Major packages of the data model


  • Party

  • Administrative

  • Spatial unit

  • Surveying and representation


Reference list




  • ISO/TC 211 Geographic information/Geomatics. (2007). ISO 19131:2007. ISO. Retrieved October 19, 2020, from https://www.iso.org/cms/render/live/en/sites/isoorg/contents/data/standard/03/67/36760.html

  • Brodeur, J., & Badard, T. (2008). Modeling with ISO 191xx Standards. In S. Shekhar & H. Xiong (Eds.), Encyclopedia of GIS (pp. 705–716). Springer US. https://doi.org/10.1007/978-0-387-35973-1_811

  • ISO, I. (2015). 19109: 2015 Geographic information-Rules for application schema. International Organisation for Standardization.