Sei sulla pagina 1di 12

`

R.V.S. COLLEGE OF ENGINEERING AND TECHNOLOGY


COIMBATORE – 641 402
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
ACADEMIC YEAR 2019-2020
LESSON PLAN

LT PC
3 00 3
Subject Name: DATA ANALYTICS
Subject Code : IT6006 CLASS / SEM: IV CSE/ VII
Staff In-Charge: BALAKIRUBA J

GENERAL OBJECTIVES:

The student will be able to:


• Be exposed to big data
• Learn the different ways of Data Analysis
• Be familiar with data streams
• Learn the mining and clustering
• Be familiar with the visualisation
UNIT I - INTRODUCTION TO BIG DATA

Unit wise objectives :

• To understand the concept of Big Data and its challenges

• To learn about evolution of analytic scalability ,analytical process and tools.

• To study about the statistical concepts.


Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques*

Introduction to Big
To introduce
Data Platform –
1. the concept of 1 Black Board
Challenges of R1
Big Data
conventional systems
To know the
Web data,Evolution evolution of
2. 1 Black Board R1
of Analytic scalability analytic
scalability

To study the
- Analytic process and
3. processes and 1 Black Board R1
tools used for
tools
analysis

To compare the
differences
4. Analysis vs reporting between 1 Black Board R1
Analysis and
Reporting
To study the
Modern data analytic modern tools
5. 1 Black Board T1
tools used for
analytics
To explain
Statistical concepts: about the
6. Sampling probability and 1 Black Board T1
distributions sampling
techniques

To study about
resampling, statistical Discussion/
7. the statistical 1 T1
inference Black Board
inference

To study the
concept of
8. prediction error 1 Black Board T1
prediction error
in analytics
Study on
conventional
9. Tutorial 1 Black Board T1
and modern
analytical tools

To study the
Content Beyond conventional
syllabus database and
10. 1 Black Board T1,R1
{Conventional compare with
Database System} modern
database

Total Hours 10

Learning Outcomes:

On learning this unit, the student should be able to:


• Recollect the concepts of Big Data
• Understand the difference between conventional and modern analytic tools.

Real – world applicability of the topics under this unit:


• Probability
• Hypothesis testing

Bridging with other subjects/ Applicability in learning other topics / subjects:


• Relational Database system
• SQL
Scope for extra learning / Assignments / Activities:
• Study on database management systems
UNIT- II
DATA ANALYSIS
Unitwise objectives :
• To understand the concept of data analysis.
• To study about the Bayesian networks.
• To study on Support vector and kernel methods.
• To introduce the concept of Neural Networks.
• To learn about the fuzzy logic system.

Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques *
1. To predict the
value
Regression modeling, of a variable
1
Multivariate analysis from the values T1
of other Black Board
variables

2. Bayesian modeling, To study the


concept of
Black Board T1
Bayesian 1
modeling
3. inference and To study the
Bayesian networks, concept of
T1
Bayesian 1 Black Board
networks

4. Support vector and To know the


kernel methods, support vector
and kernel Black Board T1,R1
1
method
concepts
5. To analyse the
Analysis of time
time series Discussion/
series: linear systems T1,R3
using linear 1 Black Board
analysis,
system analysis
6. nonlinear dynamics To study on non http://
linear www.tutorialsp
dynamics. oint.com/
1 Black Board
design_pattern/
proxy_pattern.h
tml

To understand
7. Rule induction the concept of
1 T1
rule induction. Black Board

Neural networks: To introduce


8. learning and the topic on
1 T1
generalization, neural networks Black Board

9. competitive learning To study the


competitive
1 Black Board T1
learning in
neural networks
10. principal component To understand
analysis and neural the principal
networks component
1 T1
analysis Black Board
algorithm.

11 Fuzzy logic: To introduce


extracting fuzzy the concept of
T1
models from data fuzzy logic.
1 Black Board

To learn the
fuzzy decision trees,
stochastic
12 Stochastic 1 Black Board T1
search
search methods.
methods

To study on
neural
13 Tutorial 1 Black Board T1
networks and
fuzzy logic.
Content Beyond To understand
14 syllabus the topic on 1 Black Board T1
{ Normalization} normalization

Total Hours 14
Learning Outcomes:

On learning this unit, the student should be able to:

Explain the neural network concept



Learned fuzzy logic techniques


Explain the Bayesian modeling technique.
Real – world applicability of the topics under this unit:

• Face recognition system using PCA algorithm

Bridging with other subjects/ Applicability in learning other topics / subjects:

• Artificial Intelligence

Scope for extra learning / Assignments / Activities:

Study on neural networks and fuzzy logic system.

UNIT- III
MINING DATA STREAMS

Unit wise objectives :

• To explain the concept data streams


• To introduce the mining technologies

Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques *

1. Introduction to
Streams Concepts To introduce
the stream
T1
concepts . Black Board

2. Stream To understand
data model and the architecture
architecture Black Board T1
of stream data
model
3. Stream Computing, To introduce
Sampling data in a the concept of 1 Black Board T1,R2
stream sampling data
4. Filtering streams To study the
concept of 1 Black Board T1
filtering
5. Counting distinct To study the
elements in a stream technique to
count distinct 1 Black Board T1,R1
elements in a
stream

6. Estimating moments To study about PPT / Craig Larman,


1
estimation Black Board T1
7 Counting oneness in a To study about T1, Prof.
window counting Dr.S.Srinath
oneness in a NPTEL Video/ Computer
1
window. Black Board Science and
Engineering
IIT-Bombay

8 Decaying window - To study the


Realtime Analytics concept of
1 Black Board T1
Platform(RTAP) decaying
applications windows
9 Case studies Case study on
mining 1 Black Board T1
databases

10 real time sentiment To study on


analysis, stock sentiment 1 Black Board T1
market predictions. analysis
11 Tutorial To study on
1 Black Board T1
stream mining

12 Content Beyond To study the


syllabus algorithms on Discussion/
1 T1
{ Introduction to data data mining Black Board
mining}
12
Total Hours
Learning Outcomes:

On learning this unit, the student should be able to:

• Learn the data mining concepts


• Explain filtering concepts and sentiment analysis.

Real – world applicability of the topics under this unit:

• Searching for data-google

Bridging with other subjects/ Applicability in learning other topics / subjects:

• Data mining
Scope for extra learning / Assignments / Activities:

• Study on stream concepts and prediction techniques.

UNIT- IV
FREQUENT ITEMSETS AND CLUSTERING

Unit wise objectives :

• To study the concept of mining and related algorithms


• To understand the concept of clustering.

Hours Teaching
Objective(s) Resources
S.No Topic Require Methodology/
of the Topic referred
d Techniques *

1. Mining Frequent itemsets To study


about
mining 1 T1,R2
Black Board
frequent
itemsets
2. Market based mode To
understand
the concept 1 Black Board T1
of market
based mode

3. Apriori Algorithm – To know the


Handling large data sets concept of
1 Black Board T1
in Main memory apriori
algorithm
4. Limited Pass algorithm To study the
limited pass 1 Black Board T1
algorithm
5. Counting frequent To study the
itemsets in a stream process of
counting
1 Black Board T1,R1
frequent
itemsets in a
stream
6. Clustering To introduce
Techniques ,Hierarchical the topic of 1 Black Board T1,R1
clustering

7. K- Means – Clustering To
high dimensional data understand
the concepts 1 Black Board T1
K-Means
algorithm
8. CLIQUE and PROCLUS To explain
the topic on
CLIQUE 1 Black Board T1
and
PROCLUS

9. Frequent pattern based To study on


clustering methods – different
1 Black Board T1
Clustering in non- clustering
euclidean space method.
10. Clustering for streams To
and Parallelism. understand
the
clustering 1 Black Board T1
for streams
and
parallelism

11. Tutorial To study on


1 Black Board T1
clustering
12. Content beyond syllabus To introduce
Discussion/
{Clustering algorithm} clustering in 1 T1
Black Board
data mining

Total Hours 12
Learning Outcomes:

On learning this unit, the student should be able to:

• Learn mining frequent itemsets


• Understand different clustering methods

Real – world applicability of the topics under this unit:
• Clustering in networks

Bridging with other subjects/ Applicability in learning other topics / subjects:

• Data mining clustering


Scope for extra learning / Assignments / Activities:

• Study on mining item sets

UNIT- V
FRAMEWORKS AND VISUALIZATION

Unit wise objectives :

• To understand the MapReduce technique and different Big Data tools


• To study the visualization techniques.
• To know about Hadoop Distributed File System.

Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques *
To study about
1. the MapReduce
1 Black Board T1
programming
MapReduce model

2. To understand T1
the Hadoop 1 Black Board
Hadoop, Hive, MapR technology
3. To study the T1
concept of 1 Black Board
Sharding sharding

4. To explain the T1
NoSQL 1 Black Board
NoSQL Databases databases
5. To study the Discussion/ T1
1
S3 concept of S3 Black Board
6 To know about T1
the Hadoop
Hadoop 1 Black Board
Distributed File
Distributed file systems System
7 To understand T1
the
1 Black Board
visualization of
Visualizations data.

8. To understand Black Board T1


the concept of
visual data 1
Visual data analysis analysis
techniques technique.
9. Ti study about Black Board T1
interaction 1
interaction techniques technique.

10. To understand Black Board T1


Systems and the applications 1
applications and systems
Study on
11. Tutorial Hadoop File 1 Black Board T1
system.

12 Content beyond To learn the Discussion/


1 T2
syllabus {SQL} SQL. Black Board

Total Hours 12
On learning this unit, the student should be able :

• understand the MapReduce technique.


• Explain the Hadoop File System

Real – world applicability of the topics under this unit:

• Google
• SQL Databases

Bridging with other subjects/ Applicability in learning other topics / subjects

• Prerequisite for subjects like Relational Databases

Scope for extra learning / Assignments / Activities:

• Study on Hadoop File System

Total Hours:60 Lecture Hours:55 Tutorial:5

TEXT BOOKS:

1. Michael Berthold, David J. Hand, Intelligent Data Analysis, Springer, 2007.

2. Anand Rajaraman and Jeffrey David Ullman, Mining of Massive Datasets,Cambridge
University Press, 2012.


REFERENCES:

1. Bill Franks, Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams
with advanced analystics, John Wiley & sons, 2012.

2. Glenn J. Myatt, Making Sense of Data, John Wiley & Sons, 2007 Pete Warden, Big Data
Glossary, O‟ Reilly, 2011.

3. Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition,
Elsevier, Reprinted 2008.

Staff In-charge HOD

Potrebbero piacerti anche