Sei sulla pagina 1di 3

NITK Surathkal

Department of Computer Science & Engineering


Course Plan
Name of the Course: Data Course No: CO461 No. of Credits (L-T-P): 3(3-0-0)
Warehousing and Data Mining
Year : 2017 Course Type: Program Specific Academic Session: ODD
Semester: VII Elective(PSE)
Section:-

Prerequisites (if any):None

Name and Contact Details of Course Instructor:


Dr.M.Venkatesan, venkisakthi@nitk.edu.in

Evaluation Scheme: Project - 30%, Mid Sem - 30%, Final Exam - 40%.

Course Objectives:

1. Understand the concept of data mining functionalities.


2. Understand the concepts of data warehouse and its architecture
3. To know the importance of data Pre-processing methods.
4. Study the classification and prediction algorithms
5. Understand the concept of clustering and its real time applications
6. Learn the various data mining techniques and related tools to handle real-time data sets.

Course (Learning) Outcomes (COs):

CO1 Able to know the basic concepts of data warehouse and OLAP operations
CO2 Handle the real-time data sets using various data pre-processing techniques
CO3 Apply predictive and descriptive modeling to analyze complex data
CO4 Able to use data mining techniques and tools to solve real time social problems.

Mapping of COs with POs:

(Strength of correlation: S-Strong, M-Medium, W-Weak)


PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12
CO1 S S S M S M M W W S S S
CO2 S S S S S M W W W M S S
CO3 S S S S S S M M M S M S
CO4 S S S S S M S M S M S S
1. Teaching Learning Interaction:

L-T-P
Module Title Content
hours

M1 Introduction to Overview of Data Mining, Type of Data, Data Mining 6-0-0


Data Mining Classification, KDD Process, Data Mining Functionalities-Real
Time Case Studies.

M2 Data Pre- Need of data pre-processing, Data Cleaning, Missing Value 7-1-0
processing analysis, Handling noisy data using binning methods, Data
Integration and its issues, Data redundancy ,Data normalization
and Data Reduction.

M3 Association Support, Confidence, Apriori Algorithm, FP-growth algorithm- 8-2-0


Rule Mining Eclat method.

M4 Classification Classification Model-Bayesian, Decision Tree,SVM,KNN, 10-2-0


and Prediction Prediction, Regression, Linear, Nonlinear, Multiple Linear and
Logistic Regression

M5 Clustering and Clustering issues, Types of data, Similarity measures, Clustering 9-2-0
Outlier analysis methods, Partition based clustering-K means, Hierarchical
Clustering, Outlier analysis-distance based outlier and density
based outlier techniques.

M6 Data OLTP vs OLAP, Characteristics of Data warehouse, Multi- 8-2-0


Warehouse dimensional data model, Star schema, Snow flake schema,
Concept hierarchy, data warehouse architecture, OLAP
operations, OLAP Models-ROLAP,MOLAP,HOLAP.

Topics beyond syllabus/Advanced Topics (if any): Deep Learning

Gaps in the Syllabus (if any)

2. List of Text Books & Reference Books, On-line Course Resources:

1. Raph Kimball, "Data Warehouse Toolkit", John Wiley and Sons Publications.

2. Jiawei Han, Micheline Kamber, Jian Pei, Data Mining Concepts and Techniques, Third
Edition, Morgan Kaufmann Publishers.

3.Michael. J. Berry, Gordon Linoff, "Data Mining Techniques: Marketing, Sales, Customer
support", John Wiley and Sons
NPTEL Courses (http://www.nptel.ac.in):

Introduction to Machine Learning- Prof-Balaraman Ravindran-IIT Madras

Introduction to Data Analytics -Prof Nandan Sudarsanam and ProfBalaraman Ravindran IIT Madras

Coursera course:

Data Mining- https://www.coursera.org/specializations/data-mining

3. Suggested list of Assignments / home works /problems/ ANY OTHER :

a. Design Multi-dimensional data model using Star Schema and Snowflake schema
b. Explain various OLAP Operations.
c. State the current research issues in data mining.
d. Collect data from Water pollution department and standardise the data using data
normalization techniques
e. Apply frequent pattern mining to find the frequent items from the big bazar transaction data
f. Develop Classification model to classify students based upon their performance.
g. Design and develop a prediction model for Mangalore rainfall forecasting..

4. Laboratory Instructions (if any) : Implementation of Data Mining Techniques using


RapidMiner Tool

5. Assessment Pattern (Use Blooms Taxonomy to design rubrics for evaluating student
performance)

Level Knowledge Assessment


No. Level Evaluation Component (%)
Project (30%) Mid Sem (30%) Final Exam
(40%)
K1 Remember 0% 10% 10% 7
K2 Understand 10% 20% 15% 15
K3 Apply 30% 25% 25% 26.5
K4 Analyse 20% 20% 25% 22
K5 Evaluate 20% 15% 15% 16.5
K6 Create 20% 10% 10% 13
100%

Name and Signature of Course Instructor: M.Venkatesan

HOD Signature:

Potrebbero piacerti anche