Sei sulla pagina 1di 5

Meeting the challenge of various layers of Semantic Web Architecture

1
Navneet Bhardwaj, 2Sanjay Kumar Malik but it represents a set of technologies that will work
1
B.Tech (IT) (8th) Sem, University School of Information equally well on internal corporate intranets. The
Technology, GGS Indraprastha University, Delhi use of semantic web technology has overcome the
2
Lecturer, University School of Information Technology, limitations of current information technology
GGS Indraprastha University, Delhi architectures. The three main problems in
1
navneet.it@gmail.com, 2sdmalik@hotmail.com
discussion are:
Overload of Information [3]. It is the
ABSTRACT
type of imperfection when the information
prescribed are above the normal level i.e.
The World Wide Web (WWW) has
rapid rate of growth in amount of
dramatically changed the availability of
information available.
electronically accessible information. The web
currently contains around 3 billion static Hardwired Systems [3]. These systems
documents, which are accessed by over 500 are also called as Stovepipe systems in
million users internationally. At the same time, which all the databases and syntaxes are
this enormous amount of data has made it designed to work together and thus
increasingly difficult to find, access, present and cannot be exposed to the systems and
maintain relevant information. This is because other organization.
information content is presented primarily in Poor aggregated content. It is putting
natural language. Thus, a wide gap has emerged together information from desperate
between the information available for tools sources in a number of areas.
aimed at addressing these problems and the A common development in semantic web
information maintained in human – readable language is important for semantic web
form. Tim Berners Lee has a two-part vision for envisioned in several languages. The
the future of the web. The first part is to make semantic web architecture [16] presented
the web a more collaborative medium. The by Tim Berners Lee shows which
second part is to make the web understandable languages of increasing power are layered
and thus processable by machines. Semantic on top of the other. Unfortunately the
web has transformed the prospects using old relationships between the adjacent layers
web language to new web language like XML, are not specified, either with respect to
RDF, RDF/XML, XMLS and web techniques, syntax and semantics.
which are used in information retrieval of the
web. This paper presents a brief analysis of 2. SEMANTIC WEB
semantic web technology emphasizing the Semantic refers to “meaning”. The semantic web is
challenges of various layers of semantic web an extension of the current web in which
architecture. information is given well-defined meaning, better
enabling computers and people to work in
Keywords: Semantic Web, information overload, cooperation. The Semantic Web is the abstract
ontologies, RDF, URI, XML, Logic, proof, trust. representation of data on the World Wide Web,
based on the RDF standards and other standards to
1. INTRODUCTION be defined. It is being developed by the World
Wide Web Consortium (W3C), in collaboration
In a short span of time the WWW has with a large number of researchers and industrial
revolutionized information, which is transferred partners.
between computer applications. That information is
not necessary for humans to explore its syntax and 3. FOUR STAGE PROCESS OF SEMANTIC
make parsers for flow of information transfer when WEB TECHNOLOGY
modern technological languages like XHTML,
XML and other other related languages are used Briefly presenting, the semantic web technology
[1]. The semantic web is an open dynamic network works as four-stage process, the process steps are
of independent information providers all having given below [3]:
different views of the world, different levels of 3.1 Pre-XML text and databases: In this first
knowledge and different intentions [2]. The stage of process, the data should be accurate and
semantic web is not just for the World Wide Web correct with respect to an application.

1
3.2 Single Domain of XML documents: In this Figure above shows the 9 layers architecture of
second stage the data passes the application semantic web in which the lowest layer starts from
independence in a specified domain. In this stage the bottom of the layer, followed by the highest
the data is intelligent enough to move between layer, which is at the topmost position. The various
applications in a single domain. These standards layers and its challenges are described below.
are adopted by several health care or real estate
industries. 4.1 UNICODE

3.3 Mixed vocabularies of Taxonomies and Unicode is the basic universal number for every
Data: In general taxonomy is the hierarchical character, which works in multiple platforms. It is
classification of data, in the form of tree. In the the basic notation, which is supported by top
third stage, data can be composed from multiple multinational companies like Motorola, IBM, Intel,
domains and accurately classified in a hierarchical etc. Unicode allows a single software, text or single
taxonomy. In fact, the classification can be used for character to be transported to other parts without
discovery of data. Simple relationships between corruption and re-engineering [4].
categories in the taxonomy can be used to relate
and thus combine data. Thus, data is now smart 4.2 URI (Uniform Resource Identifier)
enough to be easily discovered and sensibly
combined with other data. The URI is termed as Uniform Resource Identifier,
which is a basic syntax for strings that is used to
3.4 Ontologies and rules. The last stage explains identify a resource. URI is the generic term of
when new data can be inferred from existing data addresses and names of objects or resources in the
by following logical rules. In essence, data is now WWW. A resource is any physical or abstract
intelligent enough to be described with concrete things in which each item has an identifier. The
relationships and sophisticated formalisms where URI consists of two types: First is Uniform
logical calculations can be made on this “semantic Resource locators (URL) which identifies a
algebra”. This allows the combination and resource and how it can be accessed, and the
recombination of data at a more atomic level and second part is Uniform Resource Names (URN)
very fine-grained analysis of data. Thus in this that is used to create a universal and persistent
stage the data doesn’t represent in the form of name about a resource in its namespace. This
small part but it resembles into large structure namespace dictates the syntax of URN identifier
containing lots of information [3]. [3]. URN is also implemented in a specific syntax
and associated protocols. For example, the URI
As a consequence semantic web technology has syntax consists of only simple characters such as
been widely accepted in the information “mailto”, “ftp” etc.
technology branch with many research project and
industrial applications arisen from it. Now, we 4.3 XML (Extensible Markup Language)
briefly present the several layers of semantic web
architecture. The Extensible Markup Language (XML) is a
W3C-recommended general-purpose markup
4. SEMANTIC WEB ARCHITECTURE language that supports a wide variety of
applications. XML languages can be designed by
anyone and may be processed by conforming
software. XML is also designed to be reasonably
human-legible, and to this end, terseness was not
considered essential in its structure. XML is
evolved from simplified subset of Standard
Generalized Markup Language (SGML). Its main
task is to facilitate the sharing of data across
different information systems, particularly systems
connected via the Internet. XML is the simplest
way to send the document across the web to its
specific format. It allows users to edit or modify it
and again transfer it. These document formats can
Figure: Semantic web architecture include mark up also to enhance the meaning of the
http://www.w3.org/2000/Talks/1206-xml2k- document. XML is the primary and core
tbl/slide10-0.html

2
foundation of the semantic web will be built on the checked for things like legal element and attributes
top of XML. Scientifically, XML is built upon names, correct number of children, and required
Unicode characters and URI’s. The Unicode attributes. All XML documents should be checked
characters allow XML to be characterized using for validity before they are transferred to another
International characters [3]. partner or system [3]. There are different types of
XML Schema Languages [9]:
There are 10 design goals of extensible markup
language [8]: -
Document Definition Markup Language
(DDML)
1. XML should be easily and directly used from the
Document Schema Definition Languages
Internet.
(DSDL)
Document Structure Description (DSD)
2.XML should support wide variety of applications Document Type Definition (DTD)
and shall be capable in multiple platforms.
Namespace Routing Language (NRL)
RELAX NG and its predecessors RELAX
3. XML should work and cooperate with SGML. and TREX
SGML
4. XML should be easy to program and it shall be Schema for Object-Oriented XML (SOX)
re-programmable. Schematron
XML-Data Reduced (XDR)
5. Optional features in XML should be minimum; XML Schema (W3C) (WXS or XSD)
therefore it shall be ideally zero.
4.5 XML Namespaces
6. XML documents shall be human readable, and it
should be clear to understand. An XML namespace is the W3C recommendation
for providing uniquely named elements and all of
7. XML documents shall take less time to prepare its attributes in an XML instance. An instance of an
it. XML contains element or attribute names from
more than one vocabulary. If each vocabulary is
8. The design of XML shall be formal and concise. given a namespace then the uncertainity or what is
unexpected between identically named elements or
attributes can be resolved. All the elements which
9. XML documents shall be easy to create.
are within a namespace must be in unique
component [10].
10. Terseness in XML shall be of minimum
importance. 4.6 RDF and RDF Schema

4.4 XML Schema The RDF and RDF Schema layer is located above
the XML layer, which provides more functions and
XML Schema is a document definition language capabilities than in XML and XML schema.
that enables you to develop XML documents into a The Resource Description Framework is an XML
specific vocabulary and a specific hierarchical based language that is used to describe resources.
structure. The things you want to define in your Such a resource is identified via a uniform resource
language are element types, attribute types, and the locator (URL). As compared to XML documents
composition of both into composite types (called that attach the metadata of the document, RDF
complex types). XML Schema is different to a captures the metadata of the externals of the
database schema, which defines the column names document such as author, creation, date, etc. [3].
and data types in database tables. XML Schema RDF is the machine processable language unlike
has been approved by W3C consortium in the year XML which is human processable, and it is not
2001. XMLS allows the validation of instances to understandable by humans i.e. it know what the
ensure the accuracy of field values and document machine is doing in the way it does. So that it can
structure at the time of creation. The accuracy of store smart information back on the Web [6]. RDF
fields is checked against the type of the field; for model is also called as triple, because it contains
example, a quantity typed as an integer or money three parts viz. Subject, Predicate and Object. In
typed as a decimal. The structure of a document is Subject there is a none of phrase that is the source

3
of action. In Predicate, which is a part of triple edit up languages led to the development of new web
the subject and includes the verb phrase. And in the ontology languages such as OIL, DAML–ONT and
object, it is a noun phrase that is the source of DAML+OIL, OWL.
action by the verb [3]. OWL(Web Ontology Language): OWL is intended
In Semantic Web applications, and in most popular to be used when the information contained in
applications of RDF like RSS and FOAF (Friend of documents needs to be processed by applications,
a Friend resources which are represented by the as opposed to situations where the content only
URI denotes the actual and accurate data on the needs to be presented to humans. OWL can be used
World Wide Web. But RDF, in general, is not to explicitly represent the meaning of terms in
limited to the description of Internet-based vocabularies and the relationships between those
resources. In fact, the URI that names a resource terms. This representation of terms and their
does not have to be dereference able at all. For interrelationships is called ontology. OWL has
example, a URI that begins with "http:" and is used more facilities for expressing meaning and
as the subject of an RDF statement does not semantics than XML, RDF, and RDF-S, and thus
necessarily have to represent a resource that is OWL goes beyond these languages in its ability to
accessible via HTTP [11]. represent machine interpretable content on the
Web. OWL is a revision of the DAML+OIL web
RDFS or RDF Schema is an knowledge ontology language. OWL has been designed to
representation language, providing basic elements meet the requirements of RDF, RDFS, XML
for the description of ontologies, otherwise called Schema [13].
RDF vocabularies, intended to structure RDF
resources. The data model of RDF schema allows 4.8 Logic Layer
creating classes of data. A class is defined as group
of things with common characteristics. An object Logic layer in the semantic web is the universal
in the RDF schema is the instance of the class. The language of monotonic logic. In the logic layer of
first version was released by W3C in 1998, and the semantic web architecture any rule can export the
final version was released in 2004. Classes and code but cannot import it. Any system in the logic
subclasses of RDF Schema are described below layer can validate proofs [14].
[12]: This layer functions on the basic principle of first
order predicate logic [6], so the information is
rdfs:class : it is used to declare the resource of the displayed accurately on the web.
class
rdfs:subClassOf : it is used to declare the 4.9 Proof
attributes and hierarchies of the classes.
In this layer, the ultimate goal of semantic web is
rdfs:domain of an property declares the class of
to create a much smarter content which could be
the subject in a triple using this property as
predicate. understood by the machines. When the content is
rdfs:range of an RDF property that declares class understood by machine, some assertions may come
out of the content and new pieces of information
or data type of the object in triple using this
will be produced [15]. Unfortunately this layer has
property as predicate.
been not investigated enough and this lack of
investigation is not yet sufficiently considered to be
4.7 Ontology Vocabulary
a crucial problem.
The main layer of semantic web architecture is
Ontology vocabulary, which typically consists of 4.10 Trust
hierarchical distribution of important concepts in a
This is the top most layer of the semantic web
domain, along with descriptions of the properties
of each concept. Ontologies play a pivotal role in architecture in which the trustworthiness of
the semantic web by providing a source of shared information should be subjectively evaluated by
each information consumers. The trust does not
and precisely defined terms that can be used in
exclude information providers which have not been
metadata. The recognition of the key role in
rated or do not publish trust relevant information in
ontologies are likely to play in the future of the
a specific way. The trust layer in semantic web
web that has led to extension of web mark up
languages like XML Schema, RDF and RDF architecture is analyzed into four parts: The
Schema. The recognition of the limitations in mark Information integration layer handles
aggregation of information from different sources

4
and adds provenance metadata in the information.
The Repository Layer stores the aggregate [7] http://www.w3.org/TR/WD-xml-961114.html
information. The Query and Trust evaluation
layer handles the actual trust decisions using query [8] http://www.w3.org/TR/xml11/#sec-origin-goals
specific trust policies. The Application and
explanation layer on which the retrieved [9] http://en.wikipedia.org/wiki/XML_schema
information is used which an application context
which provides functionality to browse through [10] http://www.w3.org/TR/REC-xml-names/
explanations why data should be trusted [2].
[11] Resource Description Framework
4.11 Digital Signature http://en.wikipedia.org/wiki/Resource_Description
_Framework
Digital signatures run horizontal to the RDF family
up through the proof layer and support the notion [12] http://www.w3.org/TR/rdf-schema/
of trust. Developments in the area of digital
signatures are progressing, and could eventually [13] http://www.w3.org/TR/owl-features/
help validate the integrity of metadata that an agent
will use for reasoning and task completion. The [14] http://www.w3.org/2002/Talks/04-
main purpose of the digital signature is to digitally sweb/slide20-0.html
sign the document. Also encryption methods are
used to run on digital signed documents to prevent [15] A Min Tjoa, Amin, Ferial, Roland Wagner
against unauthorized access. “Semantic web Challenges and new requirements”,
16th International Workshop on database and expert
5. CONCLUSION systems (DEXA ’05), 1529-4188/05

A lot of work is being done in all layers but not [16] http://www.w3.org/2000/Talks/1206-xml2k-
being much done in Proof and Logic. tbl/slide10-0.html
This paper gives an idea of semantic web and an
analysis of various layers of semantic web
architecture which may assist to meet the various
challenges of layers of semantic web architecture.

6. REFERENCES

[1] Ian Horrocks, Peter Schneider “Three Theses of


Representation in the Semantic web”, WWW 2003,
May 20-24 2003, Hungary, ACM 1-58113-680-
3/03/0005

[2] Christian Bizer, Radoslaw Oldakowski


“Using Context and Content based trust policies on
the Semantic Web”, WWW 2004, May 17 – 22
2004, USA, ACM 1-58113-912-8/04/0005

[3] Michael Daconta, Leo Obrst, Kevin T. Smith


“The Semantic web: the guide to future of XML,
web services and knowledge management

[4] Unicode Information


http://www.unicode.org/standard/WhatIsUnicode.h
tml

[5] http://www.w3c.tut.fi/talks/2003/0331umedia-
on/slide6-0.html

[6] http://blogspace.com/rdf/SwartzHendler

Potrebbero piacerti anche