Chapter 9. Basic OWL

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

CHAPTER 9 Basic OWL

In previous chapters, we saw how RDFS-Plus as a modeling system provides considerable support for distributed information and federation of information. Simple constructs in RDFS-Plus can be combined in various ways to match properties, classes, and individuals. We saw its utility in application to social networking (FOAF) and knowledge organization (SKOS); although RDFS-Plus has provided considerable and valuable infrastructure for these projects, we also identified capabilities required by these systems that RDFS-Plus cannot provide. In this chapter, we go further into the modeling capabilities of OWL, beyond RDFS-Plus, which provides a systematic treatment of information description. OWL provides constructs for describing information structure that will satisfy many of the outstanding requirements of FOAF and SKOS, as well as a number of more general information integration issues.

We continue our presentation of OWL with a treatment of owl:Restriction. This single construct opens up the representational power of OWL by allowing us to describe classes in terms of other things we have already modeled. As we shall see, this opens up whole new vistas in modeling capabilities.

RESTRICTIONS

Suppose we have defined in RDFS a class we call BaseballTeam, with a particular subclass called MajorLeagueTeam, and another class we call BaseballPlayer. The roster for any particular season would be represented as a property playsFor that relates a BaseballPlayer to a BaseballTeam. Certain players are special in that they play for a MajorLeagueTeam. We’d like to define that class and call it MajorLeaguePlayers. If we are interested in the fiscal side of baseball, we could also be interested in the class of Agents who represent Major League Players, and then the bank accounts controlled by the Agents who represent Major League Players and so on.

One of the great powers of the Semantic Web is that information that has been specified by one person in one context can be reused either by that person or by others in different contexts. There is no expectation that the same source who defined the roster of players will be the one that defines the role of the agents or of the bank accounts. If we want to use information from multiple sources together, we need a way to express concepts from one context in terms of concepts from the other. In OWL, this is achieved by having a facility with which we can describe new classes in terms of classes that have already been defined. This facility can also be used to model more complex constructs than the ones we’ve discussed so far.

We have already seen how to define simple classes and relationships between them in RDFS and OWL, but none of the constructs we have seen so far can create descriptions of the sort we want in our Major League Baseball Player example. This is done in OWL using a language construct called a Restriction.

Consider the case of a MajorLeaguePlayer. We informally defined a MajorLeaguePlayer as someone who plays on a MajorLeagueTeam. The intuition behind the name Restriction is that membership in the class MajorLeaguePlayer is restricted to those things that play for a MajorLeagueTeam. Since a Restriction is a special case of a Class, we will sometimes refer to a Restriction as a Restriction Class just to make that point clear.

More generally, a Restriction in OWL is a Class defined by describing the individuals it contains. This simple idea forms the basis for extension of models in OWL: If you can describe a set of individuals in terms of known classes, then you can use that description to define a new class. Since this new class is now also an existing class, it can be used to describe individuals for inclusion in a new class, and so on. We will return to the baseball player example later in this chapter, but first we need to learn more about the use of restriction classes.

Example: Questions and Answers

To start with, we will use a running example of managing questions and answers, as if we were modeling a quiz, examination, or questionnaire. This is a fairly simple area that nevertheless illustrates a wide variety of uses of restriction classes in OWL.

Informally, a questionnaire consists of a number of questions, each of which has a number of possible answers. A question includes string data for the text of the question, whereas an answer includes string data for the text of the answer. In contrast to a quiz or examination, there are typically no “right” answers in a questionnaire. In questionnaires, quizzes, and examinations, the selection of certain answers may preclude the posing of other questions.

This basic structure for questionnaires can be represented by classes and properties in OWL. Any particular questionnaire is then represented by a set of individual questions, answers, and concepts, and particular relationships between them.

FIGURE 9-1 Question, answer, and the properties that describe them.

The basic schema for the questionnaire is as follows and is shown diagrammatically in Figure 9-1. Throughoutthe example, wewilluse thename-space q: to refer to elements that relate to questionnaires in general, and the namespace d: to refer to the elements of the particular example questionnaire.

q:Answer a owl:Class.

q:Question a owl:Class.

q:optionOf a owl:ObjectProperty;

rdfs:domain q:Answer;

rdfs:range q:Question;

owl:inverseOf q:hasOption.

q:hasOption a owl:ObjectProperty.

q:answerText a owl:DatatypeProperty;

rdfs:domain q:Answer;

rdfs:range xsd:string.

q:questionText a owl:FunctionalProperty,

owl:DatatypeProperty;

rdfs:domain q:Question;

rdfs:range xsd:string.

A particular questionnaire will have questions and answers. For now, we will start with a simple questionnaire that might be part of the screening for the helpdesk of a cable television and Internet provider:

What system are you having trouble with?

Possible answers (3): Cable TV, High-Speed Internet, Both

What television symptom(s) are you seeing?

Possible answers (4): No Picture, No Sound, Tiling, Bad Reception

This is shown as follows and graphically in Figure 9-2.

FIGURE 9-2 Some particular questions and their answers.

d:WhatProblem a q:Question;

q:hasOption d:STV, d:SInternet, d:SBoth;

q:questionText “What system are you having trouble with?”.

d:STV a q:Answer;

q:answerText “Cable TV”.

d:SInternet a q:Answer;

q:answerText “High-speed Internet”.

d:SBoth a q:Answer;

q:answerText “Both”.

d:TVsymptom a q:Question;

q:questionText “What television symptoms are you having?”;

q:hasOption d:TVSnothing, d:TVSnosound, d:TVStiling,

d:TVSreception.

d:TVSnothing a q:Answer;

q:answerText “No Picture”.

d:TVSnosound a q:Answer;

q:answerText “No Sound”.

d:TVStiling a q:Answer;

q:answerText “Tiling”.

d:TVSreception a q:Answer;

q:answerText “Bad reception”.

Consider an application for managing a questionnaire in a web portal. This application performs a query against this combined data to determine what question(s) to ask next. Then for each question, it presents the text of the question it self and the text of each answer, with a select widget (e.g., radio button) next to it. We haven’t yet defined enough information for such an application to work, and we have made no provisions to determine which questions to ask before any others or how to record answers to the questions. We start with the latter.

We first define a new property hasSelectedOption, a subproperty of hasOption :

q:hasSelectedOption a owl:ObjectProperty;

rdfs:subPropertyOf q:hasOption.

When the user who is taking a questionnaire answers a question, a new triple will be entered to indicate that a particular option for that question has been selected. That is, if the user selects “Cable TV” from the options of the first question d:WhatProblem, then the application will add the triple

d:WhatProblem q:hasSelectedOption d:STV.

to the triple store. Notice that there is no need to remove any triples from the triple store; the original d:hasOption relationship between d:WhatProblem and d:STV still holds. As we develop the example, the model will provide ever-increasing guidance for how the selection of questions will be done.

Adding “Restrictions”

The language construct in OWL for creating new class descriptions based on descriptions of the prospective members of a class is called the Restriction (owl:Restriction). An owl:Restriction is a special kind of class (i.e., owl:Restriction is a rdfs:subClassOf owl:Class). A Restriction is a class that is defined by a description of its members in terms of existing properties and classes.

In OWL, as in RDF, the AAA slogan holds: Anyone can say Anything about Any topic. Hence, the class of all things in owl (owl:Thing) is unrestricted. A Restriction is defined by providing some description that limits (or restricts) the kinds of things that can be said about a member of the class. So if we have a property orbitsAround, it is perfectly legitimate to say that anything orbitsAround anything else. If we restrict the value of orbitsAround by saying that its object must be TheSun, then we have defined the class of all things that orbit around the sun (i.e. our solar system).

Kinds of Restrictions

OWL provides a number of restrictions, three of which are owl:allValuesFrom, owl:someValuesFrom, and owl:hasValue. Each describes how the new class is constrained by the possible asserted values of properties.

Additionally, a restriction class in OWL is defined by the keyword owl: onProperty. This specifies what property is to be used in the definition of the restriction class. For example, the restriction defining the objects that orbit around the sun will use owl:onProperty orbitsAround, whereas the restriction defining major league players will use owl:onProperty playsFor.

A restriction is a special kind of a class, so it has individual members just like any class. Membership in a restriction class must satisfy the conditions specified by the kind of restriction (owl:allValuesFrom, owl:someValuesFrom, or owl: hasValue), as well as the onProperty specification.

owl:someValuesFrom

owl:someValuesFrom is used to produce a restriction of the form “All individuals for which at least one value of the property P comes from class C.” In other words, one could define the class AllStarPlayer as “All individuals for which at least one value of the property playsFor comes from the class AllStarTeam.” This is what the restriction looks like:

[a owl:Restriction;

owl:onProperty :playsFor;

owl:someValuesFrom :AllStarTeam]

Notice the use of the […] notation. As a reminder from Chapter 3, this refers to an anonymous node (a bnode) described by the properties listed here; that is, this refers to a single bnode, which is the subject of three triples, one per line (separated by semicolons).

The restriction class defined in this way refers to exactly the class of individuals that satisfy these conditions on playsFor and AllStarTeam. In particular, if an individual actually has some value from the class AllStarTeam for the property playsFor, then it is a member of this restriction class. Note that this restriction class, unlike those we’ve learned about in earlier chapters, has no specific name associated with it. It is defined by the properties of the restriction (i.e., restrictions on the members of the class) and thus it is sometimes referred to in the literature as an “unnamed class.”

EXAMPLE Answered Questions

In the questionnaire example, we addressed the issue of recording answers to questions by defining a property hasOption that relates a question to answer options and a subproperty hasSelectedOption to indicate those answers that have been selected by the individual who is taking the questionnaire. Now we want to address the problem of selecting which question to ask.

There are a number of considerations that go into such a selection, but one of them is that (under most circumstances) we do not want to ask a question for which we already have an answer. This suggests a class of questions that have already been answered. We will define the set of AnsweredQuestions in terms of the properties we have already defined. Informally, an answered question is any question that has a selected option.

An answered question is one that has some value from the class Answer for the property hasSelectedOption. This can be defined as follows:

q:AnsweredQuestion owl:equivalentClass

[a owl:Restriction;

owl:onProperty q:hasSelectedOption;

owl:someValuesFrom q:Answer].

Since

d:WhatProblem q:hasSelectedOption d:STV.

and

d:STV a Answer.

are asserted triples, the individual d:WhatProblem satisfies the conditions defined by the restriction class. That is, there is at least one value (someValue) for the property hasSelectedOption that is in the class Answer. Individuals that satisfy the conditions specified by a restriction class are inferred to be members of it. This inference can be represented as follows:

d:WhatProblem a [a owl:Restriction;

owl:onProperty q:hasSe1ectedOption;

owl:someValuesFrom q:Answer]

and thus, according to the semantics of equivalentClass,

d:WhatProblem a Answered/Question.

These definitions and inferences are shown in Figure 9-3.

owl:allValuesFrom

owl:allValuesFrom is used to produce a restriction class of the form “the individuals for which all values of the property P come from class C.” This restriction looks like the following:

[a owl:Restriction;

owl:onProperty P;

owl:allValuesFrom C]

FIGURE 9-3 Definition of q:AnsweredQuestion and the resulting inferences for d:WhatProblem. Since d:WhatProblem has something (d:STV) of type q:Answer on property q:hasSelectedOption, it is inferred (dotted line) to be a member of AnsweredQuestion.

The restriction class defined in this way refers to exactly the class of individuals that satisfy these conditions on P and C. If an individual x is a member of this allValuesFrom restriction, a number of conclusions can follow, one for each triple describing x with property P. In particular, every value of property P for individual x is inferred to be in class C. So, if individual MyFavoriteAllStarTeam (a member of the class BaseballTeam) is a member of the restriction class defined by owl:onProperty hasPlayer and owl: allValuesFrom StarPlayer, then every player on MyFavoriteAllStarTeam is a StarPlayer.So, if MyFavorite AllStarTeam hasPlayer Kaneda and MyFavoriteAllStarTeam hasPlayer Gonzales, then both Kaneda and Gonzales must be of type StarPlayer.

There is a subtle difference between someValuesFrom and allValuesFrom. Since someValuesFrom is defined as a restriction class such that there is at least one member of a class with a particular property, then it implies that there must be such a member. On the other hand, allValuesFrom technically means “if there are any members, then they all must have this property.” This latter does not imply that there are any members. This will be more important in later chapters.

EXAMPLE Question Dependencies

In our questionnaire example, we might want to ask certain questions only after particular answers have been given. To accomplish this, we begin by defining the class of all selected answers, based on the property hasSelectedOption we have already defined. We can borrow a technique from Chapter 4 to do this. First, we define a class for the selected answers:

q:SelectedAnswer a owl:Class;

rdfs:subClassOf q:Answer.

We want to ensure that any option that has been selected will appear in this class. This can be done easily by asserting that

q:hasSelectedOption rdfs:range q:SelectedAnswer.

This ensures that any value V that appears as the object of a triple of the form

? q:hasSelectedOption V.

is a member of the class SelectedAnswer. In particular, since we have asserted that

d:WhatProblem q:hasSelectedOption d:STV.

we can infer that

d:STV a q:SelectedAnswer.

Now that we have defined the class of selected answers, we describe the questions that can be asked only after those answers have been given. We introduce a new class called EnabledQuestion; only questions that also have type EnabledQuestion are actually available to be asked:

q:EnabledQuestion a owl:Class.

When an answer is selected, we want to infer that certain dependent questions become members of EnabledQuestion. This can be done with a restriction, owl:allValuesFrom.

To begin, each answer potentially makes certain questions available for asking. We define a property called enablesCandidate for this relationship. In particular, we say that an answer enables a question if selecting that answer causes the system to consider that question as a candidate for the next question to ask:

q:enablesCandidate a owl:ObjectProperty;

rdfs:domain q:Answer;

rdfs:range q:Question.

In our example, we only want to ask a question about television problems if the answer to the first question indicates that there is a television problem:

d:STV q:enablesCandidate d:TVsymptom.

d:SBoth q:enablesCandidate d:TVsymptom.

That is, if the answer to the first question, “What system are you having trouble with?,” is either “Cable TV” or “Both,” then we want to be able to ask the question “What television symptoms are you having?”

The following owl:allValuesFrom restriction does just that: It defines the class of things all of whose values for d:enablesCandidate come from the class d:EnabledQuestion:

[a owl:Restriction;

owl:onProperty q:enablesCandidate;

owl:allValuesFrom q:EnabledQuestion]

Which answers should enforce this property? We only want this for the answers that have been selected. How do we determine which answers have been selected? So far, we only have the property hasSelectedOption to indicate them. That is, for any member of SelectedAnswer, we want it to also be a member of this restriction class. This is exactly what the relation rdfs:subClassOf does for us:

q:SelectedAnswer rdfs:subClassOf

[a owl:Restriction;

owl:onProperty q:enablesCandidate;

owl:allValuesFrom q:EnabledQuestion].

That is, a selected answer is a subclass of the unnamed restriction class.

Let’s watch how this works, step by step. When the user selects the answer “Cable TV” for the first question, the type of d:STV is asserted to be Selected-Answer, like the preceding.

d:STV a q:SelectedAnswer.

However, because of the rdfs:subClassOf relation, d:STV is a member of the restriction class, that is, it has the restriction as its type:

d:STV a

[a owl:Restriction;

owl:onProperty q:enablesCandidate;

owl:allValuesFrom q:EnabledQuestion].

Any individual who is a member of this restriction necessarily satisfies the allValuesFrom condition; that is, any individual that it is related to by d:enablesCandidate must be a member of d:EnabledQuestion. Since

d:STV q:enablesCandidate d:TVsymptom.

we can infer that

d:TVsymptom a q:EnabledQuestion.

as desired. Finally, since we have also asserted the same information for the answer d:SBoth,

d:SBoth q:enablesCandidate d:TVsymptom.

FIGURE 9-4 d:STV enablesCandidate TVSymptom, but it is also a member of a restriction on the property enablesCandidate, stipulating that all values must come from the class q:EnabledQuestion. We can therefore infer that d:TVSymptom has type q:EnabledQuestion.

We can see this inference and the triples that led to it in Figure 9-4. Restrictions are shown in the figures using a shorthand called the Manchester Syntax (named after its development at the University of Manchester). The shorthand summarizes a restriction using the keywords all, some, and has to indicate the restriction types owl:allValuesFrom, owl:someValuesFrom, and owl:hasValue, respectively. The restriction property (indicate in triples by owl:onProperty) is printed before the keyword, and the target class (or individual, in the case of owl:hasValue) is printed after the keyword. We see an example of an owl:allValuesFrom restriction in Figure 9-4. It is important to note that this is only a shorthand; all the information needed for inferences is expressed in RDF triples.

Since SBoth also enables the candidate TVSymptom, the same conclusion will be drawn if the user answers “Both” to the first question. If we were to extend the example with another question about Internet symptoms d:InternetSymptom, then we could express all the dependencies in this short questionnaire as follows:

d:STV q:enablesCandidate d:TVsymptom.

d:SBoth q:enablesCandidate d:TVsymptom.

FIGURE 9-5 Questions and the answers that enable them.

d:SBoth q:enablesCandidate d:InternetSymptom.

d:SInternet q:enablesCandidate d:InternetSymptom.

The dependency tree is shown graphically in Figure 9-5.

EXAMPLE Prerequisites

In the previous example, we supposed that when we answered one question, it made all of its dependent questions eligible for asking. Another way questions are related to one another in a questionnaire is as prerequisites. If a question has a number of prerequisites, all of them must be answered appropriately for the question to be eligible.

Consider the following triples that define a section of a questionnaire:

d:NeighborsToo a q:Question;

q:hasOption d:NTY, d:NTN, d:NTDK;

q:questionText “Are other customers in your building also experiencing problems?”.

d:NTY a q:Answer;

q:answerText “Yes, my neighbors are experiencing the same problem.”.

d:NTN a q:Answer;

q:answerText “No, my neighbors are not experiencing the same problem.”.

d:NTDK a q:Answer;

q:answerText “I don’t know.”.

This question makes sense only if the current customer lives in a building with other customers and is experiencing a technical problem. That is, this question depends on the answers to two more questions, shown following. The answer to the first question (d:othersinbuilding) should be d:OYes, and the answer to the second question(d:whatissue) should be d:PTech:

d:othersinbuilding

a q:Question;

q:hasOption d:ONo, d:OYes;

q:questionText

”Do you live in a multi-unit dwelling with other customers?”.

d:OYes a q:Answer;

q:answerText “Yes.”.

d:ONo a q:Answer;

q:answerText “No.”.

d:whatIssue

a q:Question;

q:hasOption d:PBilling, d:PNew, d:PCancel, d:PTech;

q:questionText

”What can customer service helpyouwith today?”.

d:PBilling a q:Answer;

q:answerText “Billing question.”.

d:PNew a q:Answer;

q:answerText “New account”.

d:PCancel a q:Answer;

q:answerText “Cancel account”.

d:PTech a q:Answer;

q:answerText “Technical difficulty”.

A graphic version of these questions can be seen in Figure 9-6.

Challenge 22 How can we model the relationship between d:NeighborsToo, d:whatIssue, and d:othersinbuilding so that we will only ask d:NeighborsToo when we have appropriate answers to both d:whatIssue and d:othersinbuilding?

FIGURE 9-6 Questions about neighbors have two prerequisite questions.

We introduce a new property q:hasPrerequisite that will relate a question to its prerequisites:

q:hasPrerequisite

rdfs:domain q:Question;

rdfs:range q:Answer.

FIGURE 9-7 Some questions and their prerequisites.

We can indicate the relationship between the questions using this property:

d:NeighborsToo q:hasPrerequisite d:PTech, d:OYes.

This prerequisite structure is shown in graphical form in Figure 9-7.

Now we want to say that we will infer something is a d:EnabledQuestion if all of its prerequisite answers are selected. We begin by asserting that

[a owl:Restriction;

owl:onProperty q:hasPrerequisite;

owl:allValuesFrom q:SelectedAnswer]

rdfs:subClassOf q:EnabledQuestion.

Notice that we can use the restriction class just as we could any other class in OWL, so in this case we have said that the restriction is a subclass of another class. Any question that satisfies the restriction will be inferred to be a member of d:EnabledQuestion by this subclass relation. But how can we infer that something satisfies this restriction?

For an individual x to satisfy this restriction, we must know that every time there is a triple of the form

x hasPrerequisite y.

y must be a member of the class d:SelectedAnswer. But by the Open World assumption, we don’t know if there might be another triple of this form for which y is not a member of d:SelectedAnswer. Given the Open World assumption, how can we ever know that all prerequisites have been met?

The rest of this challenge will have to wait until we discuss the various methods by which we can (partially) close the world in OWL. The basic idea is that if we can say how many prerequisites a question has, then we can know when all of them have been selected. If we know that a question has only one prerequisite, and we find one that is satisfied, then it must be the one. If we know that a question has no prerequisites at all, then we can determine that it is an Enabled-Question without having to check for any SelectedAnswers at all.

owl:hasValue

The third kind of restriction in OWL is called owl:hasValue. As in the other two restrictions, it acts on a particular property as specified by owl:onProperty. Itis used to produce a restriction whose description is of the form “All individuals that have the value A for the property P” and looks as follows:

[ a owl: Restriction;

owl:onProperty P;

owl:hasValue A]

Formally, the hasValue restriction is just a special case of the someValuesFrom restriction, in which the class C happens to be a singleton set {A}.

Although it is “just” a special case, owl:hasValue has been identified in the OWL standard in its own right because it is a very common and useful modeling form. It effectively turns specific instance descriptions into class descriptions. For example, “The set of all planets orbiting the sun” and “The set of all baseball teams in Japan” are defined using hasValue restrictions.

EXAMPLE Priority Questions

Suppose that in our questionnaire, we assign priority levels to our questions. First we define a class of priority levels and particular individuals that define the priorities in the questionnaire:

q:PriorityLevel a owl:Class.

q:High a q:PriorityLevel.

q:Medium a q:PriorityLevel.

q:Low a q:PriorityLevel.

Then we define a property that we will use to specify the priority level of a question:

q:hasPriority

rdfs:range q:PriorityLevel.

We have defined the range of q:hasPriority but not its domain. After all, we might want to set priorities for any number of different sorts of things, not just questions.

We can use owl:hasValue to define the class of high-priority items:

q:HighPriorityItem owl:equivalentClass

[a owl:Restriction;

owl:onProperty q:hasPriority;

owl:hasValue q:High].

These triples are shown graphically in Figure 9-8. Note that where before we defined subclasses and superclasses of a restriction class, here we use owl:equivalentClass to specify that these classes are the same. So we have created a named class (q:HighPriorityItem) that is the same as the unnamed restriction class, and we can use this named class if we want to make other assertions or to further restrict the class.

We can describe Medium and Low priority questions in the same manner:

q:MediumPriorityItem owl:equivalentClass

[a owl:Restriction;

owl:onProperty q:hasPriority;

owl:hasValue q:Medium].

q:LowPriorityItem owl:equivalentClass

[a owl:Restriction;

owl:onProperty q:hasPriority;

owl:hasValue q:Low].

FIGURE 9-8 Definition of a HighPriorityItem as anything that has value High for the hasPriority property.

If we assert the priority level of a question, such as the following:

d:WhatProblem q:hasPriority q:High.

d:InternetSymptom q:hasPriority q:Low.

then we can infer the membership of these questions in their respective classes:

d:WhatProblem a q:HighPriorityItem.

d:InternetSymptom a q:LowPriorityItem.

We can also use owl:hasValue to work “the other way around.” Suppose we assert that d:TVsymptom is in the class HighPriorityItem:

d:TVsymptom a q:HighPriorityItem.

Then by the semantics of owl:equivalentClass, we can infer that d:TVsymptom is a member of the restriction class and must be bound by its stipulations. Thus, we can infer that

d:TVsymptom q:hasPriority q:High.

Notice that there is no stipulation in this definition to say that a HighPriorityItem must be a question; after all, we might set priorities for things other than questions. The only way we know that d:TVsymptom is a q:Question is that we already asserted that fact. In the next chapter, we will see how to use set operations to make definitions that combine restrictions with other classes.

CHALLENGE PROBLEMS

As we saw in the previous examples, the class constructors in OWL can be combined in a wide variety of powerful ways. In this section, we present a series of challenges that can be addressed using these OWL constructs. Often the application of the construct is quite simple; however, we have chosen these challenge problems because of their relevance to modeling problems that we have seen in real modeling projects.

Challenge: Local Restriction of Ranges

We have already seen how rdfs:domain and rdfs:range can be used to classify data according to how it is used. But in more elaborate modeling situations, a finer granularity of domain and range inferences is needed. Consider the following example of describing a vegetarian diet:

:Person a owl:Class.

:Food a owl:Class.

:eats rdfs:domain :Person.

:eats rdfs:range :Food.

From these triples and the following instance data

:Maverick :eats :Steak.

we can conclude two things:

:Maverick a :Person.

:Steak a :Food.

The former is implied by the domain information, and the latter by the range information.

Suppose we want to define a variety of diets in more detail. What would this mean? First, let’s suppose that we have a particular kind of person, called a Vegetarian, and the kind of food that a Vegetarian eats, which we will call simply

VegetarianFood, as subclasses of Person and Food, respectively:

:Vegetarian a owl:Class;

rdfs:subClassOf :Person.

:VegetarianFood a owl:Class;

rdfs:subClassOf :Food.

Suppose further that we say

:Jen a :Vegetarian;

:eats :Marzipan.

We would like to be able to infer that

:Marzipan a :VegetarianFood.

but not make the corresponding inference for Maverick’s steak until someone asserts that he, too, is a vegetarian.

Challenge 23 It is tempting to represent this with more domain and range statements—thus:

:eats rdfs:domain :Vegetarian.

:eats rdfs:range :VegetarianFood.

But given the meaning of rdfs:domain and rdfs:range, we can draw inferences from these triples that we do not intend. In particular, we can infer

:Mavericka :Vegetarian.

:Steak a :VegetarianFood.

which would come as a surprise both to Maverick and the vegetarians of the world.

How can the relationship between vegetarians and vegetarian food be correctly modeled with the use of the owl:Restriction?

SOLUTION

We can define the set of things that only eat VegetarianFood using a restriction, owl:allValuesFrom; we can then assert that any Vegetarian satisfies this condition using rdfs:subClassOf. Together, it looks like this:

:Vegetarian rdfs:subClassOf

[a owl:Restriction;

owl:onProperty :eats;

owl:allValuesFrom :VegetarianFood].

Let’s see how it works. Since

:Jen a :Vegetarian.

we can conclude that

:Jen a [a owl:Restriction;

owl:onProperty :eats;

owl:allValuesFrom :VegetarianFood].

Combined with the fact that

:Jen :eats :Marzipan.

we can conclude that

:Marzipan a :VegetarianFood.

as desired. How does Maverick fare now? We won’t say that he is a Vegetarian but only, as we have stated already, that he is a Person. That’s where the inference ends; there is no stated relationship between Maverick and Vegetarian, so there is nothing on which to base an inference. Maverick’s steak remains simply

a Food, not a VegetarianFood.

The entire model and inferences are shown in Figure 9-9.

Challenge: Filtering Data Based on Explicit Type

We have seen how tabular data can be used in RDF by considering each row to be an individual, the column names as properties, and the values in the table as values. We saw sample data in Table 3-10, which we repeat on page 200 as Table 9-1. Some sample triples from this data are shown in Table 9-2.

Each row from the original table appears in Table 9-2 as an individual in the RDF version. Each of these individuals has the same type—namely, mfg: Product—from the name of the table. This data includes only a limited number of possible values for the “Product_Line” field, and they are known in advance (e.g., “Paper machine,” “Feedback line,” “Safety Valve,” etc.).

A more elaborate way to import this information would be to still have one individual per row in the original table but to have rows with different types depending on the value of the Product Line column. For example, the following triples (among others) would be imported:

FIGURE 9-9 Definition of a Vegetarian as a restriction on what the person eats.

mfg:Product1 rdf:type ns:Paper_machine.

mfg:Product4 rdf:type ns:Feedback_line.

mfg:Product7 rdf:type ns:Monitor.

mfg:Product9 rdf:type ns:SafetyValve.

This is a common situation when actually importing information from a table. It is quite common for type information to appear as a particular column in the table. If we use a single method for importing tables, all the rows become individuals of the same type. A software-intensive solution would be to write a more elaborate import mechanism that allows a user to specify which column should specify the type. A model-based solution would use a model in OWL and an inference engine to solve the same problem.

Challenge 24 Build a model in OWL so we can infer the type information for each individual, based on the value in the “Product Line” field but using just the simple imported triples described in Chapter 3.

SOLUTION

Since the classes of which the rows will be members (i.e., the product lines) are already known, we first define those classes:

ns:Paper_Machine rdf:type owl:Class.

ns:Feedback_Line rdf:type owl:Class.

ns:Active_Sensor rdf:type owl:Class.

ns:Monitor rdf:type owl:Class.

ns:Safety_Valve rdf:type owl:Class.

Each of these classes must include just those individuals with the appropriate value for the property mfg:Product_Product_Line. The class constructor that achieves this uses an owl:hasValue restriction, as follows:

ns:Paper_Machine owl:equivalentClass

[a owl:Restriction;