Semantic Web-Module1
Semantic Web-Module1
Module 1
CONTENTS ( SYLLABUS)
The Future of the Internet:
Introduction
The Syntactic Web
The Semantic Web
How the Semantic Web Will Work.
Classifying Ontologies
Web Ontologies
Web Ontology Description Languages
2
Ontology - Categories and Intelligence.
THE FUTURE OF THE INTERNET
Introduction
The Syntactic Web
The Semantic Web
How the Semantic Web Will Work.
3
WHERE WE ARE TODAY: THE SYNTACTIC WEB
[Goble 03]
IMPOSSIBLE(?) VIA
THE SYNTACTIC WEB…
WWW2002
The eleventh international world wide web conference
Sheraton waikiki hotel
Honolulu, hawaii, USA
7-11 may 2002
1 location 5 days learn interact
Registered participants coming from
australia, canada, chile denmark, france, germany, ghana, hong kong,
india, ireland, italy, japan, malta, new zealand, the netherlands, norway,
singapore, switzerland, the united kingdom, the united states, vietnam,
zaire
Register now
On the 7th May Honolulu will provide the backdrop of the eleventh
international world wide web conference. This prestigious event …
Speakers confirmed
Tim berners-lee
8
WHAT INFORMATION CAN A MACHINE SEE…
WWW2002
The eleventh international world wide web conference
Sheraton waikiki hotel
Honolulu, hawaii, USA
7-11 may 2002
1 location 5 days learn interact
Registered participants coming from
australia, canada, chile denmark, france, germany, ghana, hong kong, india,
ireland, italy, japan, malta, new zealand, the netherlands, norway,
singapore, switzerland, the united kingdom, the united states, vietnam,
zaire
Register now
On the 7th May Honolulu
9
SEMANTIC WEB
Semantic Web will overcome such the problems by
making the Web not only human-understandable but
also machine-understandable.
Semantic Web is a vision of information that is
understandable by computers, so that they can
perform more of the tedious works involved in
finding, sharing, and combining information on the
Web.
10
Tim
Berners-Lee expressed the vision of the
Semantic Web as follows:
I have a dream for the Web [in which computers] become
capable of analyzing all the data on the Web – the content, links,
and transactions between people and computers. A ‘Semantic
Web’, which should make this possible, has yet to emerge, but
when it does, the day-to-day mechanisms of trade, bureaucracy,
and our daily lives will be handled by machines talking to
machines. The ‘intelligent agents’ people have touted for ages
will finally materialize.
11
SEMANTIC WEB
It derives from W3C1 director Sir Tim Berners-Lee's vision
of the Web as a universal medium for data, information, and
knowledge exchange.
12
1. W3C (World Wide Web Consortium) is a forum for information, commerce, communication,
and collective understanding. W3C was founded in Oct. 1994.
SEMANTIC WEB (CONT’D)
Statements are built with syntax rules
The syntax of a language defines the rules for building the language
statements. But how can syntax become semantic?
13
SEMANTIC WEB (CONT’D)
Semantic Web (SW), proposed by W3C, is one of the most
promising and accepted approaches to make the Web
content becomes more machine-readable so that intelligent
agents can retrieve and process information readily [Dong
and Dan 05].
14
SEMANTIC WEB: SOME DEFINITIONS
The Semantic Web is not a separate Web but an extension of the
current one, in which information is given well-defined meaning,
better enabling computers and people to work in cooperation
(Berners-Lee, Hendlers, J. & Lassila, O., 2001)
“The Semantic Web is a vision: the idea of having data on the Web
defined and linked in a way that it can be used by machines not just
for display purposes, but for automation, integration and reuse of
data across various applications (W3C, 2003)
“Soon it will be possible to access the Web resources by content
rather than just by keywords (Anutariya et al, 2001)
15
SEMANTIC WEB (SW): DEFINITIONS
16
A SEMANTIC WEB — FIRST STEPS
17
WHY IS THERE A NEED FOR THE SEMANTIC WEB?
Knowledge Management
Withthe large number of documents made available online by
organizations, several document management systems have
entered the market. However, these systems have severe
weaknesses [Fensel et al. 03]:
Searching information: Existing keyword-based search retrieves
irrelevant information that uses keyword in a context other than the one
in which the searcher is interested.
Extracting information: Human browsing and reading is currently
mapping.
19
WHY IS THERE A NEED FOR THE SEMANTIC WEB?
(CONT’D)
Web Commerce
Very early on in B2C (Business to Customer) development, shopbots
were developed that visit several stores, extract product information,
and present to the customer an instant market overview.
Their functionality is provided via wrappers written for each online
store. Such wrappers use a keyword search together with
assumptions on regularities in the presentation format of stores’ Web
sites and text extraction heuristics, to find information about the
requested product and return it to the customer.
20
WHY IS THERE A NEED FOR THE SEMANTIC WEB?
(CONT’D)
22
Themes related to the Semantic Web [Breitman et al. 07]
• Metadata
Concepts • Ontologies
• Ontology Languages
• Web Services
Semantic
Web
Applications Technologies
• Methodologies for
• Software agents
Ontology development
• Semantic desktop • Tools for Ontology
• Geospatial semantic web 23
development
• Ontology sources
HOW WILL THE SEMANTIC WEB WORK?
Metadata
Metadata is “data about data”
They serve to index Web pages and Web sites in
the Semantic Web
Allowing other computers to acknowledge what
a Web page is about
24
HOW WILL THE SEMANTIC WEB WORK?
Ontologies
In computer science, ontologies were adopted in
AI to facilitate knowledge sharing and reuse
[Fensel 01]
Becoming widespread in areas:
Intelligent information integration
Cooperative information systems
E-commerce
25
How Will the Semantic Web Work?
Ontologies
Ontology Languages
Designed to define ontologies
They are sometimes called:
Lightweight ontology languages
Web-based ontology languages
27
HOW WILL THE SEMANTIC WEB WORK?
Web Services
Web services will be greatly improved if
semantics is added to the present Web resources
Computer will be able to:
Make doctor appointments
Synchronize with our agenda
28
TOOLS FOR ONTOLOGY DEVELOPMENT
• Metadata
Concepts • Ontologies
• Ontology Languages
• Web Services
Semantic
Web
Applications Technologies
• Methodologies for
• Software agents
Ontology development
• Semantic desktop • Tools for Ontology
• Geospatial semantic web 30
development
• Ontology sources
HOW WILL THE SEMANTIC WEB WORK?
Applications of Semantic Web
Personal Agent in Semantic Web
Responsible for capturing user preferences, searching for
information on available resources, etc. to provide answer
that meet a user’s query
Semantic desktop application
Ontology applications in Art
Cataloguing online cultural heritage or online museums
The Hermitage Museum Web site
www.hermitagemuseum.org
31
ONTOLOGY IN COMPUTER SCIENCE
32
ONTOLOGY IN COMPUTER SCIENCE
Natural
languages are not suitable for building models in
computer science, because they are too ambiguous.
33
ONTOLOGY IN COMPUTER SCIENCE
Formal Languages (FL : FOL)
Have strict mathematical definitions
Defined as a (possibly infinite) set of strings.
words.
Grammar
A finite set of rules that specifies a language
FL have an official grammar, specified in manuals or books.
Artificial Intelligence- A Modern Approach, 2nd Edition, by Russell, S., and Norvig,
P., 1995, Prentice Hall, pp. 791-792
ONTOLOGY IN COMPUTER SCIENCE
Formal Languages
A finite set of signs (alphabet) and a finite set of production rules (grammar)
produces an infinite set of expressions or sentences that define the
language.
Producing a syntactic correct language does not mean that one has
captured the meaning and sense of the sentences of a given language.
Ontology
Ontology means to bridge the “semantic gap” existing between the actual
syntactic representation of information (symbol) and its conceptualization
(concept).
“explicit” means that “the type of concepts used and the constraints on their
use are explicitly defined”;
“formal” refers to the fact that “it should be machine readable”;
“shared” refers to the fact that the knowledge represented in an ontology
are agreed upon and accepted by a group”;
“conceptualization” refers to an abstract model that consists the relevant
concepts and the relationships that exists in a certain situation
36
ONTOLOGY
39
ONTOLOGY
Cook?
You mean
chef
information about how to cook something,
or simply a place, person, business or some other entity with "cook" in its name.
42
MOTIVATION (2)
Inability to use the abundant information resources on the web
The WEB has tremendous collection of useful information however getting information
from the web is difficult.
Search engines are restricted to simple keyword based techniques. Interpretation of
information contained in web documents is left to the human user.
43
MOTIVATION (3)
Database-style queries are effective
45
BENEFITS OF ONTOLOGY
46
APPLICATION AREAS OF ONTOLOGIES
Information Retrieval
As a tool for intelligent search through inference mechanism instead
of keyword matching
Easy retrievability of information without using complicated Boolean
logic
Cross Language Information Retrieval
Improve recall by query expansion through the synonymy relations
Improve precision through Word Sense Disambiguation
(identification of the relevant meaning of a word in a given context
among all its possible meanings)
Information Integration
Seamless integration of information from different websites and
databases
Natural Language Processing
Better machine translation
Queries using natural language
47
APPLICATION AREAS OF ONTOLOGIES
Digital Libraries
Building dynamical catalogues from machine readable meta data
Automatic indexing and annotation of web pages or documents with
meaning
To give context based organisation (semantic clustering) of
information resources
Site organization and navigational support
Knowledge Engineering and Management
As a knowledge management tools for selective semantic access
(meaning oriented access)
Guided discovery of knowledge
48
THESAURI AND ONTOLOGY SIMILARITIES
Both serve the same purpose, namely to provide a shared conceptualisation about a
specific part of the world to different users in order to facilitate an efficient
communication of complex knowledge.
Both disciplines are based on concept systems representing highly complex knowledge
independent of any language.
Both are concerned about covering a broad range of terminology used in a particular
domain, and in understanding the relationships among these terms.
Both utilize a hierarchical organization to group terms into categories and subcategories.
(Source:http://www.xmluk.org/slides/magic-
circle_2002/wilson/XML_UK_SW_Thes/all.htm) 51
ONTOLOGY VS TAXONOMY
Taxonomy
Is a classification of terms in form of a hierarchy using typically a father-son
relationship (i.e. Type of)
Example The taxonomy of the leaving beings
Kingdom: Animalia
Filo: Chordata
Subfilo: Vertebrata
……
52
ONTOLOGY AS TAXONOMY
Taxonomy is a classification system where each node has only one parent
– simple ontology
Living Beings
Animals Plants
Vertebrates Invertebrates
53
STRUCTURE OF AN ONTOLOGY
54
A SIMPLE ONTOLOGY: ANIMALS
55
ONTOLOGY OF PEOPLE AND THEIR
ROLES
Employee Contractor
advises
funds
56
ONTOLOGY EXAMPLES
– Ontoclean
– Sumo
57
CLASSIFYING ONTOLOGIES
58
[Deborah McGuinness, Stanford]
CLASSIFYING ONTOLOGY BASED ON
SEMANTIC SPECTRUM
General
Formal Frames
Thesauri Logical
Is-a (properties)
constraints
Catalog/
ID
Disjointness,
Informal Formal Inverse, partof
Terms/
Is-a instance
glossary Value
restrictions
59
CLASSIFYING ONTOLOGY BASED ON DEGREE OF
CONCEPTUALIZATION
Top-level ontologies
describes very general notions which are independent of a particular problem or domain
are applicable across domains and includes vocabulary related to things, events, time, space,
etc
Domain ontologies
knowledge represented in this kind of ontologies is specific to a particular domain such as
forestry, fishery, etc.
They provide vocabularies about concepts in a domain and their relationships or about the
theories governing the domain.
60
CLASSIFYING ONTOLOGY BASED ON GENERALITY
[Guarino, 98]
Describe very general concepts like space, time, event, which are independent of a
particular problem or domain. It seems reasonable to have unified top-level
ontologies for large communities of users.
Describe the
vocabulary related Describe the
to a generic vocabulary
domain by related to a
specializing the generic task or
concepts introduced activity by
in the top-level specializing the
ontology. top-level
ontologies.
Heavy-weight Ontology
cardinality constraints
Taxonomy of relations
Axioms (restrictions)
Metaclasses
Type constraints on relations
63
Expressiveness
Inference systems
WEB ONTOLOGIES
Web Ontology Description Languages
64
ARCHITECTURE OF SEMANTIC WEB[Berners Lee
65
XML (EXTENSIBLE MARKUP
LANGUAGE)
Standard markup language to represent the user-defined
markup language
Simple, but flexible text-format defined from SGML
Advantages
Data representation
Structured & independent
Data sharing and interoperability
Hierarchical
Disadvantages
Lack of representation of relationship between objects
Lack of representation of data meaning
Lack of inheritance of meaning
66
RDF (RESOURCE DESCRIPTION FRAMEWORK)
Markup language based on XML syntax
Developed to representation the multiple, various resources
dispersed in the distributed web environment
Used as a basis for the other markup language
Disadvantages
Weak in the representation of semantic of data
67
RDF SCHEMA
RDF is a data model for objects and relations between
them
RDF Schema (RDFS) is a vocabulary description
language based on XML and logic programming.
Describes properties and classes of RDF resources
Provides semantics for generalization hierarchies of
properties and classes
68
OIL (ONTOLOGY INFERENCE LAYER)
Based on Frame-based System, Description Logic and
Web Languages
Advantages
Hierarchical extensions
Effective inference mechanism based on the Description
Logic
Well-defined semantics
Disadvantages
Impossible to define the default-value
Impossible to provide the meta-class
69
DAML (DARPA AGENT MARKUP
LANGUAGE)
Based on XML and RDF
Disadvantages
Can’t exclude the RDF and XML
Can’t be formal language
Less extensible compared with OIL
70
DAML+OIL
71
OWL (WEB ONTOLOGY LANGUAGE)
72
OWL (WEB ONTOLOGY LANGUAGE)
Relationships between classes
equivalentClass
subClassOf
Intersection, union, complement, disjunction
Properties of properties
Domain, Range
Cardinality
Transitive, Symmetric
allValuesFrom, someValuesFrom
Functional, InverseFunctional
74
EXPRESSIVENESS OF LANGUAGES
XOL
RDFS
SHOE
OML
OIL
DAML+OIL
Heavyweight
75