hadoop administration - mindscripts.com administration.pdf · hadoop administration hadoop...

2
Course Duration For Haddop Admininstration Course : 8 Weekend (Weekend batches) BSc, BCS, BCA, BE, B.Tech, MSc, MCS, MCA, M.Tech Fundamental knowledge of Java and Linux environment shall be preferred Objective For Hadoop Administration Course : Understand the fundamental concepts of Hadoop Understand HDFS features Provide Insights About the Roles of a Data Scientist Ability to Analyze Big Data Make predictions using machine learning Learn to apply hypotheses and data into actionable predictions Eligibility For Hadoop Administration Course : Hadoop Administration www.mindscripts.com Syllabus for Hadoop Administration Hadoop Distributed File System The MapReduce Framework Hadoop Installation and Configuration Hadoop Administration Introduction to Big Data and Hadoop Types Of Data Characteristics Of Big Data Business Benefits Of Big Data Technology Hadoop And Traditional Rdbms Hadoop Core Services Introduction to Hadoop Distributed File System Goals of HDFS HDFS Architecture Design of HDFS Hadoop Storage Mechanism Measures of Capacity Execution HDFS Storage Architecture Heterogeneous Storage HDFS Commands Ubuntu Server-Introduction Hadoop and Multi-Node Installation Create a Clone of Hadoop Virtual Machine Perform Clustering of the Hadoop Environment Understanding MapReduce The Map and Reduce Phase WordCount in MapReduce Running MapReduce Job

Upload: vuongnguyet

Post on 08-Mar-2018

222 views

Category:

Documents


5 download

TRANSCRIPT

Page 1: Hadoop Administration - mindscripts.com Administration.pdf · Hadoop Administration Hadoop Distributed File System The MapReduce Framework Hadoop Installation and Configuration Hadoop

8805674210 / 9764560238

Course Duration For Haddop Admininstration Course :

8 Weekend (Weekend batches)

BSc, BCS, BCA, BE, B.Tech, MSc, MCS, MCA, M.Tech Fundamental knowledge of Java and Linux environment shall be preferred

Objective For Hadoop Administration Course :

Understand the fundamental concepts of HadoopUnderstand HDFS features Provide Insights About the Roles of a Data ScientistAbility to Analyze Big DataMake predictions using machine learningLearn to apply hypotheses and data into actionable predictions

Eligibility For Hadoop Administration Course :

Hadoop Administration

www.mindscripts.com

Syllabus for

Hadoop Administration Hadoop Distributed File System

The MapReduce Framework

Hadoop Installation and Configuration

Hadoop Administration

Introduction to Big Data and HadoopTypes Of DataCharacteristics Of Big DataBusiness Benefits Of Big Data TechnologyHadoop And Traditional RdbmsHadoop Core Services

Introduction to Hadoop Distributed File SystemGoals of HDFSHDFS ArchitectureDesign of HDFSHadoop Storage MechanismMeasures of Capacity ExecutionHDFS Storage Architecture HeterogeneousStorageHDFS Commands

Ubuntu Server-IntroductionHadoop and Multi-Node InstallationCreate a Clone of Hadoop Virtual MachinePerform Clustering of the Hadoop Environment

Understanding MapReduceThe Map and Reduce PhaseWordCount in MapReduceRunning MapReduce Job

Page 2: Hadoop Administration - mindscripts.com Administration.pdf · Hadoop Administration Hadoop Distributed File System The MapReduce Framework Hadoop Installation and Configuration Hadoop

Planning Hadoop Cluster

Architecture of Hadoop ClusterWorkflow of Hadoop ClusterHDFS WritesPreparing for HDFS WritesPipelined HDFS WriteNameNode FunctionalityReplicating Missing ReplicasHDFS ReadsFactors for Planning Hadoop ClusterSingle-Node and Multi-Node Cluster ConfigurationHDFS Block replication and rack awarenessTopology and Components of Hadoop Cluster

YARN

Introduction to YARNNeed for YARNYARN ArchitectureYARN Installation and Configuration

Installing and Managing Hadoop Ecosystem

SqoopFlumeHivePigHBaseOozie

Extending Hadoop

Simplifying information accessEnabling SQL–like querying with HiveInstalling Pig to create MapReduce jobsImposing a tabular view on HDFS with HBaseConfiguring Oozie to schedule workflows

Cluster Maintenance

Checking HDFS StatusBreaking the clusterCopying Data Between ClustersAdding and Removing Cluster NodesRebalancing the clusterName Node Metadata BackupCluster Upgrading

Cluster Monitoring, Troubleshootingand Optimizing

Hadoop Configuration OverviewTypes of Configuration Files

General System conditions to MonitorName Node and Job Tracker Web UisView and Manage Hadoop's Log filesGanglia Monitoring ToolCommon cluster issues and their resolutions

Hadoop Environment SetupInclude and Exclude Configuration Files

Hadoop Cluster and Map Reduce ConfigurationParameters with Values

Advanced Cluster Configuration Features

Managing JobsThe FIFO and Fair Schedule

How to stop and start jobs running on the cluster

8805674210 / 9764560238 www.mindscripts.com

Syllabus for Hadoop Administration

Managing and Scheduling Jobs