hadoop administration - mindscripts.com administration.pdf · hadoop administration hadoop...
TRANSCRIPT
8805674210 / 9764560238
Course Duration For Haddop Admininstration Course :
8 Weekend (Weekend batches)
BSc, BCS, BCA, BE, B.Tech, MSc, MCS, MCA, M.Tech Fundamental knowledge of Java and Linux environment shall be preferred
Objective For Hadoop Administration Course :
Understand the fundamental concepts of HadoopUnderstand HDFS features Provide Insights About the Roles of a Data ScientistAbility to Analyze Big DataMake predictions using machine learningLearn to apply hypotheses and data into actionable predictions
Eligibility For Hadoop Administration Course :
Hadoop Administration
www.mindscripts.com
Syllabus for
Hadoop Administration Hadoop Distributed File System
The MapReduce Framework
Hadoop Installation and Configuration
Hadoop Administration
Introduction to Big Data and HadoopTypes Of DataCharacteristics Of Big DataBusiness Benefits Of Big Data TechnologyHadoop And Traditional RdbmsHadoop Core Services
Introduction to Hadoop Distributed File SystemGoals of HDFSHDFS ArchitectureDesign of HDFSHadoop Storage MechanismMeasures of Capacity ExecutionHDFS Storage Architecture HeterogeneousStorageHDFS Commands
Ubuntu Server-IntroductionHadoop and Multi-Node InstallationCreate a Clone of Hadoop Virtual MachinePerform Clustering of the Hadoop Environment
Understanding MapReduceThe Map and Reduce PhaseWordCount in MapReduceRunning MapReduce Job
Planning Hadoop Cluster
Architecture of Hadoop ClusterWorkflow of Hadoop ClusterHDFS WritesPreparing for HDFS WritesPipelined HDFS WriteNameNode FunctionalityReplicating Missing ReplicasHDFS ReadsFactors for Planning Hadoop ClusterSingle-Node and Multi-Node Cluster ConfigurationHDFS Block replication and rack awarenessTopology and Components of Hadoop Cluster
YARN
Introduction to YARNNeed for YARNYARN ArchitectureYARN Installation and Configuration
Installing and Managing Hadoop Ecosystem
SqoopFlumeHivePigHBaseOozie
Extending Hadoop
Simplifying information accessEnabling SQL–like querying with HiveInstalling Pig to create MapReduce jobsImposing a tabular view on HDFS with HBaseConfiguring Oozie to schedule workflows
Cluster Maintenance
Checking HDFS StatusBreaking the clusterCopying Data Between ClustersAdding and Removing Cluster NodesRebalancing the clusterName Node Metadata BackupCluster Upgrading
Cluster Monitoring, Troubleshootingand Optimizing
Hadoop Configuration OverviewTypes of Configuration Files
General System conditions to MonitorName Node and Job Tracker Web UisView and Manage Hadoop's Log filesGanglia Monitoring ToolCommon cluster issues and their resolutions
Hadoop Environment SetupInclude and Exclude Configuration Files
Hadoop Cluster and Map Reduce ConfigurationParameters with Values
Advanced Cluster Configuration Features
Managing JobsThe FIFO and Fair Schedule
How to stop and start jobs running on the cluster
8805674210 / 9764560238 www.mindscripts.com
Syllabus for Hadoop Administration
Managing and Scheduling Jobs