In addition to metadata and standardised metadata schemas, research infrastructures can also use other forms of “knowledge representation system” to enhance the researcher’s experience of the interoperable data they present. When we talk about ‘Knowledge Representation Systems’ in research infrastructures, we usually mean a specific category of hierarchical systems of terms known more commonly as an ‘ontology’.  Before the digital age, philosophers referred to an ontology as “the study of the kinds of things that exist”.  Ontologies are similar to taxonomies, another knowledge organisation framework you probably remember from early lessons in biology.

Ontologies as Taxonomies

According to the Linnean Taxonomy, the relationships between kinds of animal can be described according to the hierarchy of:

Kingdom – Phylum – Class – Order – Family – Genes – Species.

A Golden Retriever is a dog, a dog is a canine etc..  Similar hierarchical thinking underlies a taxonomy as an ontology, although the relationships between the levels tend to be more clear and unique in an ontology.  In either case, the structure allows for a certain amount of reasoning to occur.  I may not have recorded in my data whether a hydra is a plant or an animal, but if I know that it is of the class ‘hydrozoa’, then I can work backwards up the taxonomy to determine its kingdom (animal).

As with metadata and metadata standards, we must remain mindful in using ontologies that we do not use them in ways that overestimate their applicability or underestimate the effect of applying them.  Ontologies are not objects in themselves, they are more like placeholders, or descriptors applied by human agents to specific objects.  These placeholders that make a definite statement regarding the position of an object in the world, and indeed, about the overall structure of that world.  As such, they model intelligent reasoning (I know a hydra is an animal) but only on a fragmentary basis (assuming that we can speak of ‘animals’ as a coherent group). All of this said, ontologies are exceptionally powerful for facilitating efficient computation, most often seen as effective searches where data may not be as well structured or as complete as would be ideal.  The ontology helps us fill in gaps in our knowledge or in our descriptive records, and for that they are a powerful addition to the suite of tools an infrastructure can use to support interoperability.

Simplified view of how to create an ontology

People who create ontologies like pizza: who doesn’t?

The following examples are based on a (now relatively infamous) exercise on ontology building.  Ontology engineering is a pretty complex discipline, but this will give you some sense of what it is like (and by extension, how it works and what its limitations are).

First of all, ontologies are based on a structure called a “triple.” Triples are like statements of fact, having a ‘subject,’ ‘verb’ and ‘object.’

“Pizza1 (S) hasCheeseTopping (V) MozzarellaCheese (O)” (= Pizza 1 has mozzarella cheese)

From this we can also infer: “MozzarellaCheese (S) isToppingof (V) Pizza1 (O)”

This may seem largely the same, but being able to include inverses in our reasoning gives another layer to functionality.  In this way we can start to define the parameters that define pizzas:

INFRA101 Pizza Ontology

In addition to lists of toppings, and indeed types of pizza bases, the ontology can also manage types of pizza which may be defined by certain rules (margherita has cheese and tomato only; vegetarian has no meat, etc.) or types of toppings with certain rules (spicy beef is very spicy, pineapple is not spicy).

You can see how some of these relationships are assigned in a common ontology editor called Protege in the following image:

INFRA101 Ontologies image

By the time you have finished establishing the full range of categories and properties, you will have something that looks a bit like this:

INFRA101 Ontologies image 2


‘What is Ontology? Introduction to the word and the concept’.  Kent Lofgren.
(YouTube video)
https://youtu.be/XTsaZWzVJ4c

’05-04 Web Ontology Language – OWL’ Lecture by Dr. Harald Sack, 4th video in a lecture, that is part 5 of a series of lectures.
(YouTube video)
https://youtu.be/LdsYkpFvYxU

‘A Simple Protege Tutorial’ playlist, 10 short videos in total.
Dr. Noureddin Sadawi
(YouTube video Playlist)
https://youtu.be/R9ERlUgvgwM?list=PLea0WJq13cnAfCC0azrCyquCN_tPelJN1  

‘Pizza Ontology’
Atif Khan
PDF / PowerPoint presentation
https://cs.uwaterloo.ca/~a78khan/docs/PizzaOntologyReview.pdf

‘Pizzas in 10 Minutes’
A Quick Demonstration of Handy Shortcuts and Features.
Alan Rector
Wiki page
http://protegewiki.stanford.edu/wiki/Protege4Pizzas10Minutes

‘Knowledge Representation and Reasoning’
Ronald J. Brachman & Hector J. Levesque
(Elsevier 2004)
Book
http://www.cin.ufpe.br/~mtcfa/files/in1122/Knowledge%20Representation%20and%20Reasoning.pdf

‘What is a Knowledge Representation?’
This paper appeared as R. Davis, H. Shrobe, and P. Szolovits. What is a Knowledge Representation?
AI Magazine, 14(1):17-33, 1993
Journal article
http://groups.csail.mit.edu/medg/ftp/psz/k-rep.html

Team Preservation and the Deadly Cryptic Conundrum
WePreserve
YouTube Video
https://youtu.be/Yun9hkPPF9M


You have completed ‘Interoperability’

 


 


Your progress through the “Introduction to Research Infrastructures” module

40%