Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
ADMIN
INTODUCTION
What is Big Data?
What is Hadoop?
Need of Hadoop
Challenges with Big Data
o i.Storage
o ii.Processing
Comparison with Other Technologies
Hadoop Echo System components
MAP REDUCE
Map Reduce Architecture
Processing Daemons of Hadoop
Job Tracker (Roles and Responsibilities)
Task Tracker(Roles and Responsibilities)
Input split
Input split vs Block size
Data Types in Map Reduce
Map Reduce Programming Model
PIG
Introduction to pig
Pig Latin Script
Pig Console / Grunt Shell
Execting Pig Latin Script
Pig Relations, Bags, Tuples, Fields
Data Types
Nulls
Constants
Expressions
Schemas
Parameter Substitution
Arithmetic Operators
Comparison Operators
Null Operators
Boolean Operators
Sign Operators
Flatten Operators
HIVE
Introduction
Hive Architecture
Hive Metastore
Hive Query Launguage
Difference between HQL and SQL
Hive Built in Functions
Hive UDF (user defined functions)
Hive UDAF (user defined Aggregated functions)
Hive UDTF (user defined table Generated functions)
Hive Serde?
Hive & Hbase Integration
Hive Working with unstructured data
Hive Working With Xml Data
Hive Working With Json Data
NOSQL
What is “Not only SQL”
NOSQL Advantages
What is problem with RDBMS for Large
Data Scaling Systems
Types of NOSQL & Purposes
Key Value Store
Columer Store
HBASE
Introduction to big table
What is NOSQL and colummer store Database
HBASE Introduction
Hbase use cases
Hbase basics
Column families
Scans
Hbase Architecture
Thrift
Map Reduce Integration
Map Reduce Over Hbase
Hbase data Modeling
Hbase Schema design
Hbase CRUD operators
Hive & Hbase interagation
Hbase storage handles
FLUME
Introduction to FLUME
What is the streaming File
FLUME Architecture
FLUME Nodes & FLUME Manager
FLUME Local & Physical Node
FLUME Agents & FLUME Collector
KAFKA
Introduction to KAFKA
KAFKA Architecture
Kafka components
BROKER
Topics
Producers
OOZIE
Introduction to OOZIE
OOZIE as a seheduler
OOZIE as a Workflow designer
Seheduling jobs (OOZIE CODE)
Defining Dependences between jobs
(OOZIE Code Examples)
Conditionally controlling jobs
(OOZIE Code Examples)
Defining parallel jobs (OOZIE Code Examples)
YARN
Introduction
YARN Architecture
o Resource Manager
o Application Master
o Node Manager
MR vs. YARN
IMPALA
What is Impala?
Impala for query processing
HIVE vs Impala
Usecases with impala
MONGODB
Introduction to MongoDB
Features of MongoDB
MongoDB Basic operations