Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Hyderabad
Hadoop course
content
COURSE DETAILS IN A NUTSHELL
1. In-detail explanation on the concepts of HDFS & MapReduce Programming
frameworks languages: Java & Scala
2. What is Hadoop 2.X Architecture & How to set up Hadoop Frame works: Hadoop
Cluster Distributed File
3. How to write complex MapReduce Programs
System (HDFS) &
4. In-detail explanation on how to load data using tools like
MapReduce, spark
Sqoop & Flume,solr
Loading Tools: Sqoop &
5. How to perform data analysis using tools like PIG, HIVE &
Flume
YARN
Analytical Tools: Pig,
6. How to implement & integrate HBASE & MapReduce
7. How to execute Advanced Usage and Indexing Hive and YARN
8. How to schedule jobs using Oozie Scheduling Tools:
9. What are the best practices for overall Hadoop development Oozie
10. RTAs on Data Analytics
11. What is Spark & brief about its ecosystem & how to work on
RDD Using Spark
11. vmware
Basics
Hadoop Installations
2 requirem Backups
e nts
12. sql basics
Introduction to SQL
MySQL Essentials
Database Fundamentals
PROJECTS:
:
Multi-node cluster setup Running a Hadoop multi-node
using a 4 node cluster
Topics : This is a project that gives you Deploying of MapReduce job on
opportunity to work on real world Hadoop the Hadoop cluster
Hadoop multi-node cluster setup in a distributed You will get a complete
environment. demonstration of working with
2.
Project2 various Hadoop cluster master and
slave nodes, installing Java as a
prerequisite for running Hadoop,
installation of Hadoop and
mapping the nodes in the Hadoop
cluster.
Streaming Twitter data
Social media analytics
Store data into hadoop
Process social media data
Hadoop
3. Sentiment analysis on twitter data
Project3 Topics : This is a project that gives you
Final result store in table
opportunity to work on social media
Connect BI Tool.
Analytics.