Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
PROSPECTUS
HADOOP ADMINISTRATION
UNIVERSITY OF SKILLS
UNIVERSITY OF SKILLS
Our Training methods and quality of service are far ahead of our competitors
which makes ISM to be a unique place to fine tune skills.
1. Ability to store and process huge amounts of any kind of data, quickly. With data
volumes and varieties constantly increasing, especially from social media and the
Internet of Things (IoT), thats a key consideration.
2. Computing power. Hadoops distributed computing model processes big data fast.
The more computing nodes you use, the more processing power you have.
3. Fault tolerance. Data and application processing are protected against hardware
failure. If a node goes down, jobs are automatically redirected to other nodes to male
sure the distributed computing does not fail. Multiple copies of all data are stored
automatically.
4. Flexibility. Unlike traditional relational database, you dont have to preprocess data
before storing it, You can store as much data as you want and decide how to use it
later. That includes unstructured data like text, images and videos.
5. Low cost. The open-source framework is free and used commodity hardware to
store large quantities of data.
6. Scalability. You can easily grow your system to handle more data simply by adding
nodes. Little administration is required.
UNIVERSITY OF SKILLS
PLACEMENT RECORDS
We are a proud Institution having helped most of our students in their
career building process.
We conduct 25 interviews per month and place 40 students per month,
which is genuinely far ahead of any of our competitors.
We have client base across India and abroad , we work with MNC's
and MSI , we cater all our clients with trained manpower and we ensure
our client satisfied with the manpower supplied. we ensure this with
Quality training.
We provide 100% Genuine placement assistance and guidance and help
you to begin an innovative career. We promise you that we provide
interviews until you get a job.
We have placed 5000+ students so far.
COURSE OUTLINE
1. Introduction to Big Data,
What is Big Data ?
Big Data Facts
The Three Vs of Big Data
2. Understanding Hadoop
What is Hadoop ?
Why learn Hadoop ?
Relational Databases Vs. Hadoop
Motivation for Hadoop
6 Key Hadoop Data Types
3. The Hadoop Distributed File system (HDFS)
What is HDFS ?
HDFS components
Understanding Block Storage
The Name Node
Data Node Failures
HDFS Commands
HDFS File Permissions
4. The MapReduce Framework
Overview of MapReduce
Understanding MapReduce
The Map Phase
The Reduce Phase
WordCount in MapReduce
Running MapReduce Job
5. Planning Your Hadoop Cluster
Single Node Cluster Configuration
Multi-Node Cluster Configuration
UNIVERSITY OF SKILLS
COURSE OUTLINE
6. Cluster Maintenance
Checking HDFS Status
Breaking the Cluster
Copying Data Between Clusters
Adding And Removing Cluster Nodes
Rebalancing the cluster
Name Node Metabata Backup
Cluster Upgrading
7. Installing and Mangaing Hadoop Ecosystem Projects
Sqoop
Flume
Hive
Pig
HBase
Oozie
8. Managing and Scheduling Jobs
Managing Jobs
The FIFO Scheduler
The Fair Schedule
How to stop and start jobs running on the cluster
9. Cluster Monitoring, Troubleshooting, and Optimizing
General System conditions to Monitor
Name Node and Job Tracker Web Uis
View and Manage Hadoops Log files
Ganglia Monitoring Tool
Common cluster issues and their resolutions
Benchmark your clusters performance
10. Populating HDFS from External Sources
How to use Sqoop to import data from RDBMSs to HDFS
How to gather logs from multiple systems using Flume
Features of Hive, Hbase and Pig
How to populate HDFS from external Sources
UNIVERSITY OF SKILLS
1. 2 e-learning courses
2. Soft copy of all software used in the course
3. Access to E-library during course period
COURSE FEE
Course Fee : Rs. 8,000/-
Duration : 48 hrs
COURSE INCLUDES
ELIGIBILITY
Basics of Linux
UNIVERSITY OF SKILLS
UNIVERSITY OF SKILLS
ISO 9001-2008
learn@ismuniv.com www.ismuniv.com