Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
LT PC
3 00 3
Subject Name: DATA ANALYTICS
Subject Code : IT6006 CLASS / SEM: IV CSE/ VII
Staff In-Charge: BALAKIRUBA J
GENERAL OBJECTIVES:
Introduction to Big
To introduce
Data Platform –
1. the concept of 1 Black Board
Challenges of R1
Big Data
conventional systems
To know the
Web data,Evolution evolution of
2. 1 Black Board R1
of Analytic scalability analytic
scalability
To study the
- Analytic process and
3. processes and 1 Black Board R1
tools used for
tools
analysis
To compare the
differences
4. Analysis vs reporting between 1 Black Board R1
Analysis and
Reporting
To study the
Modern data analytic modern tools
5. 1 Black Board T1
tools used for
analytics
To explain
Statistical concepts: about the
6. Sampling probability and 1 Black Board T1
distributions sampling
techniques
To study about
resampling, statistical Discussion/
7. the statistical 1 T1
inference Black Board
inference
To study the
concept of
8. prediction error 1 Black Board T1
prediction error
in analytics
Study on
conventional
9. Tutorial 1 Black Board T1
and modern
analytical tools
To study the
Content Beyond conventional
syllabus database and
10. 1 Black Board T1,R1
{Conventional compare with
Database System} modern
database
Total Hours 10
Learning Outcomes:
Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques *
1. To predict the
value
Regression modeling, of a variable
1
Multivariate analysis from the values T1
of other Black Board
variables
To understand
7. Rule induction the concept of
1 T1
rule induction. Black Board
To learn the
fuzzy decision trees,
stochastic
12 Stochastic 1 Black Board T1
search
search methods.
methods
To study on
neural
13 Tutorial 1 Black Board T1
networks and
fuzzy logic.
Content Beyond To understand
14 syllabus the topic on 1 Black Board T1
{ Normalization} normalization
Total Hours 14
Learning Outcomes:
• Artificial Intelligence
UNIT- III
MINING DATA STREAMS
Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques *
1. Introduction to
Streams Concepts To introduce
the stream
T1
concepts . Black Board
2. Stream To understand
data model and the architecture
architecture Black Board T1
of stream data
model
3. Stream Computing, To introduce
Sampling data in a the concept of 1 Black Board T1,R2
stream sampling data
4. Filtering streams To study the
concept of 1 Black Board T1
filtering
5. Counting distinct To study the
elements in a stream technique to
count distinct 1 Black Board T1,R1
elements in a
stream
• Data mining
Scope for extra learning / Assignments / Activities:
UNIT- IV
FREQUENT ITEMSETS AND CLUSTERING
Hours Teaching
Objective(s) Resources
S.No Topic Require Methodology/
of the Topic referred
d Techniques *
7. K- Means – Clustering To
high dimensional data understand
the concepts 1 Black Board T1
K-Means
algorithm
8. CLIQUE and PROCLUS To explain
the topic on
CLIQUE 1 Black Board T1
and
PROCLUS
Total Hours 12
Learning Outcomes:
UNIT- V
FRAMEWORKS AND VISUALIZATION
Hours Teaching
Objective(s) of Resources
S.No Topic Require Methodology/
the Topic referred
d Techniques *
To study about
1. the MapReduce
1 Black Board T1
programming
MapReduce model
2. To understand T1
the Hadoop 1 Black Board
Hadoop, Hive, MapR technology
3. To study the T1
concept of 1 Black Board
Sharding sharding
4. To explain the T1
NoSQL 1 Black Board
NoSQL Databases databases
5. To study the Discussion/ T1
1
S3 concept of S3 Black Board
6 To know about T1
the Hadoop
Hadoop 1 Black Board
Distributed File
Distributed file systems System
7 To understand T1
the
1 Black Board
visualization of
Visualizations data.
Total Hours 12
On learning this unit, the student should be able :
• Google
• SQL Databases
TEXT BOOKS:
1. Michael Berthold, David J. Hand, Intelligent Data Analysis, Springer, 2007.
2. Anand Rajaraman and Jeffrey David Ullman, Mining of Massive Datasets,Cambridge
University Press, 2012.
REFERENCES:
1. Bill Franks, Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams
with advanced analystics, John Wiley & sons, 2012.
2. Glenn J. Myatt, Making Sense of Data, John Wiley & Sons, 2007 Pete Warden, Big Data
Glossary, O‟ Reilly, 2011.
3. Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, Second Edition,
Elsevier, Reprinted 2008.