Sei sulla pagina 1di 5

Module 1: Introduction to Data Warehouse and Dimensional modelling (15-20 marks) Asked(times)

1. What is DWH? 12
Architecture of DWH? Need for DWH?
2. Explain Metadata and its types? 12
Role of Metadata in DWH?
3. Datawarehouse vs DataMart’s 5
4. What is Dimensional Modelling? 3
5. Characteristics of DWH? 3
6. Top down vs Bottom Up 3
7. DWH design Strategies? 2
8. Updates to Dimension Tables. 2
9. Trends in DWH? 2
10 Various schemas used in DWH. 2
11 Characteristics of data present in DWH? 1
12 [DWH deployment and maintenance] 1
13 STAR schema 1

Module 2: ETL Process and OLAP (15-20 marks) Asked(times)


1. Explain ETL process? 12
ETL Cycle?
2. OLAP vs OLTP 6
3. OLAP models and their Architecture? 4
4. OLAP operations (slice, rollup, dice, drilldown, pivot) 4
5. Indexing OLAP data? 1
6. Data cubes 1
Module 3: Introduction to Data Mining, Data Exploration and Pre-processing (20-25 marks) Asked(times)
1. What is Datamining? 6
Applications of DM? (Application of DM in Financial Analysis)
Architecture of DM system?
2. KDD Process. 12
3. Steps in Data Pre-processing. 6
4. Visualization techniques in Datamining? 4
5. Issues in Datamining? 3
6. Attribute oriented induction 3
7. Types of attributes. 2

Module 5: Mining Frequent Patterns and Association Rules (20 marks) Asked(times)
1. Explain Multilevel and Multidimensional association rules with example? 5
2. Market Based Analysis? Explain terms? 4
Support
Confidence
Iceberg Queries
3. Explain Apriori algorithm? 3
4. Discuss Association Rule Mining? 2
5. Advanced Association Rules? 1
6. Generalized Association Rules? 1
Module 4: Classification, Prediction and Clustering (30 marks) Asked(times)
1. Explain K-Means Clustering? 5
2. What is Clustering? 4
Explain anyone Clustering Algo?
Examples of clustering?
Clustering Techniques?
Explain how supermart can use Clustering techniques?
Hierarchical Clustering?
Partitioning Methods in clustering?
3. Explain what Classification is and Different Algo? 4
Issues in Classification?
[How is Classification Accuracy determined?]
[Metrics of evaluating classifier performance]
4. Major Factors related to performance of Decision Tree based classification? 3
[ID 3 Algo? Pros and Cons?]
5. Explain Naïve Bayes classification? 3
Why it is called Naïve Bayes?
Outline major ideas of naïve Bayes classification techniques?
6. Define Linear, non-linear and multiple regression? 1
7. Different ways of finding distance between clusters? 1
Module 6: Spatial and Web Mining (10-15 marks) Asked
1 What is Web mining? 2
Any one algo?
2 Web personalization 1
3 Web usage mining 3
4 Crawlers 1
5 Explain spatial and temporal Data Mining 1

Define following Terms Asked


1 Factless Fact Tables 3
2 Snowflake schema 3
10 Aggregate Fact tables 1
5 Dimension Tables 1
7 Fact Constellation 2
4 Classification 1
6 Supervised Learning 1
8 FP tree 4
9 Concept Hierarchy 1
11 Snapshot and Transaction table 1
3 Web structure mining 3
OLD Syllabus
HITS
Page Rank
DMQL
DBSCAN
Key Restructuring
Outlier Mining
Operational vs Decision support systems