Sei sulla pagina 1di 3

h a n g e Pro h a n g e Pro

XC d XC d
F- F-

uc

uc
PD

PD
!

!
W

W
t

t
O

O
N

N
y

y
bu

bu
to

to
ww

ww
om

om
k

k
lic

lic
C

C
.c

.c
w

w
tr re tr re
.

.
ac ac
k e r- s o ft w a k e r- s o ft w a
Technology Data Mining Tutorial 03 - Data Mining Architecture

C Search...
H
A
P Home Articles
T
E
R
S
03 - Data Mining Architecture
TOC
O 02 - Data Mining - Real World Scenario 04 - Data Mining Processes
T
H
E Introduction
R
Data mining is a very important process where potentially useful and previously unknown
S information is extracted from large volumes of data. There are a number of components
U involved in the data mining process. These components constitute the architecture of a data
B mining system.
J Data Mining Architecture
E
The major components of any data mining system are data source, data warehouse server,
C
data mining engine, pattern evaluation module, graphical user interface and knowledge
T
base.
S
h a n g e Pro h a n g e Pro
XC d XC d
F- F-

uc

uc
PD

PD
!

!
W

W
t

t
O

O
N

N
y

y
bu

bu
to

to
ww

ww
om

om
k

k
lic

lic
C

C
.c

.c
w

w
tr re tr re
.

.
ac ac
k e r- s o ft w a
a) Data Sources
Technology Data Mining Tutorial 03 - Data Mining Architecture
k e r- s o ft w a

Database, data warehouse, World Wide Web (WWW), text files and other documents are
C the actual sources of data. You need large volumes of historical data for data mining to be
H successful. Organizations usually store data in databases or data warehouses. Data
A warehouses may contain one or more databases, text files, spreadsheets or other kinds of
P information repositories. Sometimes, data may reside even in plain text files or
T spreadsheets. World Wide Web or the Internet is another big source of data.
E
R Different Processes
S
The data needs to be cleaned, integrated and selected before passing it to the database or
data warehouse server. As the data is from different sources and in different formats, it
O cannot be used directly for the data mining process because the data might not be
T complete and reliable. So, first data needs to be cleaned and integrated. Again, more data
H than required will be collected from different data sources and only the data of interest
E needs to be selected and passed to the server. These processes are not as simple as we
R think. A number of techniques may be performed on the data as part of cleaning, integration
and selection.
S
b) Database or Data Warehouse Server
U
B The database or data warehouse server contains the actual data that is ready to be
J processed. Hence, the server is responsible for retrieving the relevant data based on the
E data mining request of the user.
C
T c) Data Mining Engine
S
The data mining engine is the core component of any data mining system. It consists of a
number of modules for performing data mining tasks including association, classification,
characterization, clustering, prediction, time-series analysis etc.

d) Pattern Evaluation Modules


The pattern evaluation module is mainly responsible for the measure of interestingness of
the pattern by using a threshold value. It interacts with the data mining engine to focus the
search towards interesting patterns.

e) Graphical User Interface


The graphical user interface module communicates between the user and the data mining
system. This module helps the user use the system easily and efficiently without knowing
the real complexity behind the process. When the user specifies a query or a task, this
module interacts with the data mining system and displays the result in an easily
understandable manner.

f) Knowledge Base
The knowledge base is helpful in the whole data mining process. It might be useful for
guiding the search or evaluating the interestingness of the result patterns. The knowledge
base might even contain user beliefs and data from user experiences that can be useful in
the process of data mining. The data mining engine might get inputs from the knowledge
h a n g e Pro h a n g e Pro
XC d XC d
F- F-

uc

uc
PD

PD
!

!
W

W
t

t
O

O
N

N
y

y
bu

bu
to

to
ww

ww
om

om
k

k
lic

lic
C

C
.c

.c
w

w
tr re tr re
.

.
ac ac
k e r- s o ft w a k e r- s o ft w a
Technology Data Mining Tutorial 03 - Data Mining Architecture
with the knowledge base on a regular basis to get inputs and also to update it.

C Summary
H
Each and every component of data mining system has its own role and importance in
A
completing data mining efficiently. These different modules need to interact correctly with
P
each other in order to complete the complex process of data mining successfully.
T
E
R
S

Find Flights to Islamabad from Fa… Find Flights to Dubai from Lahore
O
LEARN MORE LEARN MORE
T
H
E TOC
R 02 - Data Mining - Real World Scenario 04 - Data Mining Processes

S
U
B
J
E
C
T
S

Like us on Facebook

Like Page

Be the first of your friends to like this

Potrebbero piacerti anche