Hadoop Administration

Hadoop Administration Training Provided by Revanth Technologies Training Institute in Hyderabad

Beginner 0(0 Ratings) 0 Students enrolled
Created by Revanth Technologies Training Institute staff Last updated Sat, 09-Apr-2022 English


Hadoop Administration free videos and free material uploaded by Revanth Technologies Training Institute staff .

Syllabus / What will i learn?

Understanding Big Data and Hadoop

Learning Objectives: In this module, you will understand what is Big Data and Apache Hadoop. You will also learn how Hadoop solves the Big Data problems, Hadoop Cluster Architecture, its core components & ecosystem, Hadoop data loading & reading mechanism and role of a Hadoop Cluster Administrator.

Introduction to big data

limitations of existing solutions

Hadoop architecture

Hadoop components and ecosystem

data loading & reading from HDFS

replication rules

rack awareness theory

Hadoop cluster administrator: Roles and responsibilities

Hadoop Architecture and Cluster setup

Learning Objectives: In this module, you will understand different Hadoop components; understand the working of HDFS, Hadoop cluster modes, configuration files, and more. You will also understand the Hadoop 2.0 cluster setup and configuration, setting up Hadoop Clients using Hadoop 2.0 and resolve problems simulated from a real-time environment.

Hadoop server roles and their usage

Hadoop installation and initial configuration

Deploying Hadoop in a pseudo-distributed mode

Deploying a multi-node Hadoop cluster

Installing Hadoop Clients

Understanding the working of HDFS and resolving simulated problems

Hadoop Cluster Administration and Understanding MapReduce

Learning Objectives: In this module you will understand the working of the secondary namenode, working with Hadoop distributed cluster, enabling rack awareness, maintenance mode of Hadoop cluster, adding or removing nodes to your cluster in an ad-hoc and recommended way, understand the MapReduce programming model in the context of Hadoop administrator and schedules.

Understanding secondary namenode

Working with Hadoop distributed cluster

Decommissioning or commissioning of nodes

Understanding MapReduce

Understanding schedulers and enabling them

Backup, Recovery and Maintenance

Learning Objectives: In this module, you will understand the day to day cluster administration tasks, balancing data in a cluster, protecting data by enabling trash, attempting a manual failover, creating backup within or across clusters, safeguarding your metadata and doing metadata recovery or manual failover of NameNode recovery, learn how to restrict the usage of HDFS in terms of count and volume of data, and more.

Key Hadoop Admin Commands

Trash

Import Check Point

Distcp, data backup, and recovery

Enabling trash

Namespace count quota or space quota

Manual failover or metadata recovery

Cluster planning and management

Learning Objectives: In this module, you will gather insights around cluster planning and management; learn about the various aspects one needs to remember while planning a setup of a new cluster, capacity sizing, understanding recommendations and comparing different distributions of Hadoop, understanding workload and usage patterns and some examples from the world of big data.

Planning a Hadoop 2.0 cluster

Cluster sizing, hardware

Network and software considerations

Popular Hadoop distributions

Workload and usage patterns

Industry recommendations

Hadoop 2.0 and features

Learning Objectives: In this module, you will learn more about the new features of Hadoop 2.0, HDFS High Availability, YARN framework and job execution flow, MRv2, federation, limitations of Hadoop 1.x and setting up Hadoop 2.0 Cluster setup in pseudo-distributed and distributed mode.

Limitations of Hadoop 1.x

Features of Hadoop 2.0

YARN framework

MRv2

Hadoop high availability and federation

YARN ecosystem and Hadoop 2.0 Cluster setup

Setting up Hadoop 2.X with highly availability and upgrading Hadoop

Learning Objectives: In this module, you will learn to setup Hadoop 2 with high availability, upgrading from v1 to v2, importing data from RDBMS into HDFS, understand why Oozie, Hive, and HBase are used and working on the components.

Configuring Hadoop 2 with high availability

upgrading to Hadoop 2

working with Sqoop

understanding Oozie

working with Hive

working with HBase

Project: Cloudera manager and Cluster setup, Overview on Kerberos

Learning Objectives: In this module, you will learn about Cloudera manager to setup Cluster, optimizations of Hadoop/Hbase/Hive performance parameters and understand the basics on Kerberos. You will learn to setup Pig to use in local/distributed mode to perform data analytics.

Cloudera manager and cluster setup

Hive administration

HBase architecture

HBase setup, Hadoop/Hive/HBase performance optimization

Pig setup and working with a grunt, why Kerberos and how it helps

 



Curriculum for this course
0 Lessons 00:00:00 Hours
+ View more
Description
You need online training / explanation for this course?

1 to 1 Online Training contact instructor for demo :


+ View more

Other related courses
About the instructor
  • 0 Reviews
  • 1 Students
  • 160 Courses
Student feedback
0
Average rating
  • 0%
  • 0%
  • 0%
  • 0%
  • 0%
Reviews

Material price :

₹ 0
Buy now

1:1 Online Training Fee: 10000 /-
Contact instructor for demo :