Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Organizing Information
Objective: finding a way through the overwhelming volume of material.
Approach: organize information into patterns with related items
brought together
Information collections used by many people are organized in a way
that correspond to the needs of most users, e.g.
the navigation on the intranet or on a homepage
the arrangement of books in libraries or bookshops
support users in searching
Information collections for individual use are also organized
the order of paper files reflects the way you normally use them
file management on a computer groups items according to their
shared characteristics, e.g. the nature of the item (software,
document, database etc) the project reference
Prof. Dr. Knut Hinkelmann
6 Classification Schemes
Classification
Classification is an organization means arranging information items
into classes - dividing the universe of information into manageable
and logical portions.
A class or category is a group of concepts that have something in
common. This shared property gives the class its identity.
Classifications may be designed for various purposes like
scientific classification
classification for information indexing and retrieval
A class may be further divided into smaller classes (or subclasses),
and so on, until no further subdivision is feasible. So classification is
likely to be hierarchic.
Source: UDConline
(http://www.udconline.net/)
6 Classification Schemes
books in a library or
directory on a computer
internet directory
online
Prof. Dr. Knut Hinkelmann
6 Classification Schemes
Flat Organisation:
categories arranged in a
hierarchical structure
no structure between
categories
6 Classification Schemes
Classification Schemes
The classification scheme can be
decided locally
represent a consensus
The greater the quantity or complexity of items, the more helpful it is
to follow a ready-made classification scheme, which represents a
consensus as to a helpful order of classes
Classification schemes may be either:
special, i.e. limited to a specific domain of interest; or
general, i.e. aiming to cover all subjects equally
('the universe of information').
Three most widely used general classification schemes are :
Dewey Decimal Classification (DDC)
Universal Decimal Classification (UDC)
Library of Congress Classification (LCC)
Prof. Dr. Knut Hinkelmann
6 Classification Schemes
GENERALITIES
GENERALITIES
PHILOSOPHY.
PHILOSOPHY. PSYCHOLOGY
PSYCHOLOGY
RELIGION.
RELIGION. THEOLOGY
THEOLOGY
SOCIAL
SOCIAL SCIENCES
SCIENCES
VACANT
VACANT
NATURAL
NATURAL SCIENCES
SCIENCES
TECHNOLOGY
TECHNOLOGY
THE
THE ARTS
ARTS
LANGUAGE.
LANGUAGE. LINGUISTICS.
LINGUISTICS.
LITERATURE
LITERATURE
99 GEOGRAPHY.
GEOGRAPHY. BIOGRAPHY.
BIOGRAPHY.HISTORY
HISTORY
Prof. Dr. Knut Hinkelmann
Computer
Computer science
science and
and technology
technology
Artificial
Artificialintelligence
intelligence
Artificial
Artificialintelligence
intelligence application
application
systems
systems
004.891
004.891 Expert
Expert systems
systems
004.891.2
004.891.2 Consultation
Consultation expert
expert systems
systems
http://www.udcc.org/
6 Classification Schemes
Applications of UDC
libraries
shelf arrangement
information retrieval
(classified catalogues)
collection management
(acquisition, circulation
statistics, weeding)
information services
selective dissemination of
information (user's profile
description)
Internet
subject gateways (information
presentation and navigation)
metadata (information
discovery)
As a source for building
knowledge domain maps
(ontologies), other indexing
languages (thesauri) and various
kinds of taxonomies and special
classifications
6 Classification Schemes
Notation
Most classification schemes, including UDC, have a notation - a code
that symbolizes the subject of each class and its place in the
sequence.
A simple list of named classes, which would file alphabetically, would
not fulfil the purpose of keeping related things together, and separated
from unrelated things.
This can be done by using a notation which has an inherent order,
such as numerals, alphabetic notation or a mixture (alphanumeric).
Notation with variable length can also express the position in the
hierarchy, with each extra character representing a lower level; this is
called expressive notation. Arabic numerals arranged as decimal
fractions are ideal for this purpose.
Decimal fractions also have the advantage of being infinitely
extensible, so it is always possible to introduce further subdivisions
without altering the ordinal value of the rest of the sequence. Such
notation is said to be hospitable.
Prof. Dr. Knut Hinkelmann
6 Classification Schemes
languages
legal aspects
management
measurement
performance
reliability
security
standardization
theory
verification
6 Classification Schemes
10
Classification codes
A.
A.
B.
B.
C.
C.
D.
D.
E.
E.
F.
F.
G.
G.
H.
H.
I.I.
J.
H.
J.
K.
K.
General
GeneralLiterature
Literature
Hardware
Hardware
Computer
ComputerSystems
SystemsOrganisation
Organisation
Software
Software
Data
Data
Theory
Theoryof
ofComputation
Computation
Mathematics
Mathematicsof
ofComputing
Computing
Information
Systems
Information Systems
Computer
ComputerMethodologies
Methodologies
Computer
Information
Systems
ComputerApplications
Applications
Computing
Milieux
Computing Milieux
H.0
H.0 General
General
H.1
H.1 Models
Modelsand
andPrinciples
Principles
H.2
Database
H.2 DatabaseManagement
Management
H.3
H.3 Information
InformationStorage
Storageand
andRetrieval
Retrieval
H.4
H.4 Information
InformationSystems
SystemsApplications
Applications
H.5
H.5 Information
InformationInterfaces
Interfaces&&Presentation
Presentation
H.m
H.mMiscellaneous
Miscellaneous
H.3
H.3Information
InformationStorage
Storageand
andRetrieval
Retrieval
H.3.0
General
H.3.0 General
H.3.1
H.3.1 Content
ContentAnalysis
Analysisand
andIndexing
Indexing
H.3.2
H.3.2 Information
InformationStorage
Storage
H.3.3
H.3.3 Information
InformationSearch
Searchand
andRetrieval
Retrieval
H.3.4
H.3.4 System
Systemand
andSoftware
Software
H.3.5
H.3.5 Online
OnlineInformation
InformationSystems
Systems
H.3.6
H.3.6 Library
LibraryAutomation
Automation
H.3.7
H.3.7 Digital
DigitalLibraries
Libraries
H.3.m
H.3.m Miscellaneous
Miscellaneous
11
6 Classification Schemes
H.3.3
H.3.3
Information
InformationStorage
Storageand
and
Retrieval
Retrieval
Clustering
Clustering
Information
Informationfiltering
filtering
Query
Queryformulation
formulation
Relevance
Relevancefeedback
feedback
Retrieval
Retrievalmodels
models
Search
Searchprocesses
processes
Selection
Selectionprocesses
processes
I.2.8
I.2.8Problem
ProblemSolving,
Solving,Control
ControlMethods,
Methods,and
andSearch
Search
Backtracking
Backtracking
Control
ControlTheory
Theory
Dynamic
DynamicProgramming
Programming
Graph
Graphand
andtree
treesearch
searchstrategies
strategies
Heuristic
Heuristicmethods
methods
Plan
Planexecution,
execution,formation,
formation,and
andgeneration
generation
Scheduling
Scheduling
Prof. Dr. Knut Hinkelmann
6 Classification Schemes
12
6 Classification Schemes
13
6 Classification Schemes
14
Dokument
Dokument
[Aspect:
[Aspect: Type]
Type]
report
report
correspondence
correspondence
invoice
invoice
presentation
presentation
[Aspect:
[Aspect: kind]
kind]
text
text
image
image
video
video
[Aspect:
[Aspect: Format]
Format]
ASCII
ASCII
XML
XML
doc
doc
ppt
ppt
txt
txt
Prof. Dr. Knut Hinkelmann
Monodimensional
Classification according to one
aspect
Polydimensional
Classification according to
different (independent) aspects
Documents may have different
classes
Each dimension corresponds to
one aspect/attribute
6 Classification Schemes
Products:
Europe
Europe
Switzerland
Switzerland
EU
EU
Eastern
EasternEurope
Europe
North
NorthAmerica
America
USA
USA
Canada
Canada
South
America
South America
Asia
Asia
China
China
India
India
Japan
Japan
Africa
Africa
health
healthinsurance
insurance
in-patient
in-patient
out-patient
out-patient
dental
dental
short-term
short-termdisability
disability
life
lifeinsurance
insurance
endowment
endowment
term
term
accident
accident
property
propertyinsurance
insurance
hausehold
hausehold
liability
liability
yy car
car
yy private
private
6 Classification Schemes
16
Index:
Document
Document type:
type:
report
report
Document
Document format:
format: MS
MS Word
Word
Product:
Product:
Life
Life -- annuity
annuity
Market:
Market:
Europa
Europa -- Switzerland
Switzerland
6 Classification Schemes
17