Chapter 11: Adding Semantics to Decision Tables: A New Approach to Data and Knowledge Engineering? (2/6)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

304 ◾ Yan Tang, and Robert Meersman, and Jan Vanthienen

produce an accurate or nonfuzzy decision output. e imprecision inherent in

decision making is not noticed.” An FDT includes fuzziness in the condition and/or

action/decision segments. For example, in Table11.3, the condition entries for short,

about average, long, a little and the action stubs (possible blood deciency and possible

Qi deciency) are described with fuzzy words. e condition and/or action congura-

tion in an FDT is a value in [0, 1]. is approach brings fuzzy logic to decision tables.

11.3 Semantic Decision Tables

e semantic decision table (SDT) concept (Tang and Meersman 2007, Tang and

Meersman 2008, Tang et al. 2008) is introduced to deal with certain problems in

designing decision tables within a community (decision group):

1. Ambiguity in the information representation of the condition stubs and

action stubs. For example, Angelica X in Table11.1 is ambiguous because

Table11.2 Example of an SODT

Condition

Symptoms BD, QD, IN BD, QD BD, IN QD, IN BD QD IN

Decision

Donkey-hide

gelatin

Yes Yes Yes

Angelica X Yes Yes Yes Yes Yes Yes

Eel Yes Yes Yes Yes

BD = blood deﬁciency; QD = Q

deﬁciency; IN = indigestion.

Table11.3 Example of a Fuzzy Decision Table

Condition

Headache No A little

Sleep Good

About

average Bad Good

About

average Bad

Decision

Possible blood deﬁciency * *

Possible Qi deﬁciency * *

Adding Semantics to Decision Tables ◾ 305

Angelica can refer to many species such as Angelica atropurpurea and

Angelica arguta.

2. Conceptual duplication that occurs between conditions, actions, and decision

rules. For example, in Table11.1, the indigestion symptom may be included

among the symptoms of the Qi denition.

3. Uncertainty in the condition entries, e.g., possible blood deciency in

Table11.3 contains a certain level of uncertainty.

4. Diculties in managing large tables (also known as the structural scalability

problem).

5. Specication of hidden or meta decision rules. A hidden decision rule in

Table11.1 might be, “Donkey-hide gelatin is not easily digested; therefore, the

doctor should not prescribe it for a patient who has indigestion problems.”

We use domain ontologies to store semantics from an SDT. With OE technologies,

we can easily disambiguate the decision items, check the conceptual duplications

in a decision table, and specify the uncertainties of the condition entries. Just as an

ideal ontology is extensible and scalable, an SDT is easy and ready to be extended.

While constructing SDTs, hidden and implicit (meta) decision rules are specied

and stored. Since the SDTs are formal commitments based on agreement within a

decision group, they are easily shared.

An SDT is modeled in a three-layer format: (1) decision binary fact types called

SDT lexons; (2) a commitment layer containing constraints and axioms of the fact

types; and (3) decision tasks or applications. e three-layer format was based on the

principles of developing ontology-grounded methods and applications (DOGMA,

Meersman 2001, Spyns et al. 2002) and has served as the main research topic at the

VUB STARLab for 10 years.

An SDT lexon is a quintuple like (γ, disease, has, is of, symptom), in which γ is

the context identier and points to the documents in which disease and symptom are

dened; has and is of represent two roles. A lexon is a fact type.

An SDT commitment corresponds to an explicit instance of an intentional

interpretation for a decision task. Semantic decision rules (including ontological

constraints cited in some OE papers) are stored at this level. We primarily consider

uniqueness, mandatory nature, subset, equality, exclusion, subtype, occurrence,

and frequency when using SDT for data management.

An SDT is the result of annotating a decision table with ontologies. During the anno-

tation process, the decision makers must specify all the hidden rules, for example, one with

a mandatory constraint stating that “EACH disease has AT LEAST ONE symptom.”

is rule is written in semantic decision rule language (SDRule-L, Tang and Meersman

2009) as an SDT commitment: P1 = [disease, has, is of, symptom]: UNIQ (P1).

With these “extra” rules, we can benet from the reasoning advantages brought

forward by OE.

Among the many other interesting use cases of SDTs, one is to embed an SDT in

a process and separate decision rules from the process to improve system tractability

306 ◾ Yan Tang, and Robert Meersman, and Jan Vanthienen

(Tang and Meersman 2007). An important feasibility provided by SDTs is visual-

izing the results and decision rules in the form of decision tables when the decisions

are taken in every decision process. SDTs also increase exibility at the system

management level because the knowledge engineers can create dierent algorithms

and decision rules based on their needs.

Another interesting use is applying SDTs to visualize decision tables at dierent lev-

els that contain instance data at the lowest level and (meta) decision rules at higher levels

(Tang et al. 2008). e idea is to illustrate the meta decision rules of a decision table.

11.3.1 SDTs for Supporting Ontology-Based Data Matching

In the EC Prolix project,* an ontology-based data matching framework was required

to enhance a matching engine in the domains of human resource management and

e-learning. Note that ontology-based data matching is not the same problem as ontol-

ogy matching. e goal of the latter is to resolve semantic inconsistency while inte-

grating more than two ontologies. e goal of the former (the scope of our work) is

to nd the similarities between two data sets, each of which corresponds to one part

of the ontology. ere is only one ontology in that problem.

By considering this ontology as a connected network, the issue of ontology-

based data matching (ODM) can then be viewed as nding connections between

two subnetworks (Korn et al. 1996, Venkateswaran et al. 2006) and the related

work of nding and measuring such connections. Venkateswaran et al. (2006)

focus on nding data objects or web pages belonging to the same (or similar) con-

texts. Korn et al. (1996) used a two-phase method to discover data based on feature

distance. Barrett et al. (2008) illustrate an algorithm of nding the shortest path

between two nodes in a labeled and weighted network.

Our work does not focus on how to draw a boundary between searching

spaces like that of Venkateswaran et al. (2006). We calculate similarity scores

between two subnetworks (or nodes) based on the shortest path between the sub-

networks. We use SDTs to grant semantics to the labeled arcs and weights of the

ontology graph to provide an extra layer of conceptual analysis to a network. In

the next subsection, we illustrate an example of semantic matching in e-learning

and human resource management.

11.3.2 Human Resource Management Use Case

A human resource management (HRM) department uses text descriptions of com-

pany values (such as trustworthy, straightforward, and heart) to evaluate its employ-

ees. e results are recorded on evaluation forms. e training department uses

competency notations of skills and abilities to categorize utilization of learning

courses and materials. Each department uses its own supporting tools and terms.

http://www.prolixproject.org/

Adding Semantics to Decision Tables ◾ 307

To achieve better collaboration between departments and enhance the interopera-

bility of the applications across departments, we developed an HRM ontology with

which we annotate the company values and learning components. Each company

value or learning material corresponds to one subgraph in the same ontology.

Figure11.1 illustrates the use case by considering an ontology as a graph and

an annotation result set as a subgraph. e subgraph indicated by straightforward

& heart is the union of the annotation sets of straightforward and heart. e course

ITIl1 subgraph is the annotation set of ITIL1. Note that in our problem settings,

every item (company values and learning) is annotated with the domain ontology.

If a knowledge engineer cannot nd a proper dened concept, then he or she must

dene the new concept by executing the ontology creation methods listed earlier.

We now illustrate our approach based on this use case.

11.3.3 SDTs and GRASIM for Semantic Matching

Our matching strategy consists of the following steps: (1) label the arcs on an ontol-

ogy graph; (2) choose an algorithm for calculating the shortest paths; and (3) con-

vert the values of the shortest paths into a similarity score. SDTs are used to propose

weights to end users for labeling the arcs in step (1). e default weights are calcu-

lated based on the decision rules stored in the SDTs.

After an engineer generates default weights based on the SDTs, he can update

the weights if he is not satised with them. We also use the semantics in these SDTs

to restrict the boundaries of the weights modied by the engineers.

Table11.4 shows an SDT with which our semantic matching engine automati-

cally assigns weights on the ontology graph. Figure 11.2 shows an example for

applying the decision rule specied by column 2 of Table11.4. l

=〈γ, CV, describe,

is described by, Network engineer〉, l

=〈γ, Network engineer, is a supertype of, engi-

neer〉, l

∈

L ⊏ Ω, l

∈ Ω and l

∉L. According to the decision rule, the weight range

is [0, 50]. e engine can choose three dierent ways (pessimistic, optimistic, and

moderate) to assign the exact values as the weights.

If it takes the pessimistic action, the largest permitted values will be assigned

(see Table11.5). If it takes the optimistic action, the smallest permitted values

will be assigned (see Table11.6). e moderate action will lead to a set of aver-

ages from the two situations above. As noted, a user can update the weights if

he is unsatised with them. Table11.7 shows an SDT containing user-specied

weights. Note that not all the conditions from Table11.4 are used in Table11.7.

Users only need to specify the weights based on the conditions they consider

necessary to specify.

After the weights are assigned, we may choose any shortest path algorithm to

nd the shortest path for one node from one annotation subgraph (suppose it is G

)

to another node in the other annotation subgraph (suppose it is G

). It is executed

in step (2). In step (3), we convert the shortest path values into a similarity score

that must fall in the range of [0, 1].

308 ◾ Yan Tang, and Robert Meersman, and Jan Vanthienen

Straightforward & Heart

person

Interact with

context

Respect

/is respected by

Empathize

/is empathized with by

emotion

manage

/is managed by

communication

clarity

Simplicity

Has characteristic of

/is char. of

Has characteristic of

/is char. of

Course ITIL1

employee

Practice

service

function

role

Process model

Characteristic of process

Is listed

/by list

describe

/is described by

deﬁne

/is deﬁned by

distinguish

/is distinguished by

deﬁne

/is deﬁned by

deﬁne

is deﬁned by

describe

/is described by

Deal with

/is dealt with by

describe

/is described by

Figure 11.1 Company values (straightforward and heart) from the HRM department and learning course (ITIL1) from training/e-

learning department correspond to two sub-graphs of HRM ontology.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 11: Adding Semantics to Decision Tables: A New Approach to Data and Knowledge Engineering? (2/6)

Create new playlist

Sign In

Sign Up

Table of Contents for
Chapter 11: Adding Semantics to Decision Tables: A New Approach to Data and Knowledge Engineering? (2/6)