hadoop and mapreduce certification

9
Certified Hadoop and Mapreduce Professional VS-1051

Upload: vskills

Post on 26-Jan-2015

106 views

Category:

Education


2 download

DESCRIPTION

Vskills certification for Hadoop and Mapreduce assesses the candidate for skills on Hadoop and Mapreduce platform for big data applications. The certification tests the candidates on various areas in Hadoop and Mapreduce which includes knowledge of Hadoop, Mapreduce, their configuration and administration, cluster installation and configuration, using pig, zookeeper and Hbase. http://www.vskills.in/certification/Certified-Hadoop-and-Mapreduce-Professional

TRANSCRIPT

Page 1: Hadoop and Mapreduce Certification

Certified Hadoop and

Mapreduce Professional

VS-1051

Page 2: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

Certified Certified Certified Certified Hadoop and Mapreduce Hadoop and Mapreduce Hadoop and Mapreduce Hadoop and Mapreduce

Professional Professional Professional Professional

Certification CodeCertification CodeCertification CodeCertification Code VS-1051

Vskills certification for Hadoop and Mapreduce assesses the candidate for skills on

Hadoop and Mapreduce platform for big data applications. The certification tests the

candidates on various areas in Hadoop and Mapreduce which includes knowledge of

Hadoop, Mapreduce, their configuration and administration, cluster installation and

configuration, using pig, zookeeper and Hbase.

Knowledge of Knowledge of Knowledge of Knowledge of JavaJavaJavaJava programming programming programming programming languagelanguagelanguagelanguage is pre is pre is pre is pre----requisite for certification.requisite for certification.requisite for certification.requisite for certification.

Why should one take this certification?Why should one take this certification?Why should one take this certification?Why should one take this certification?

This Course is intended for professionals and graduates wanting to excel in their chosen

areas. It is also well suited for those who are already working and would like to take

certification for further career progression.

Earning Vskills Hadoop and Mapreduce Professional Certification can help candidate

differentiate in today's competitive job market, broaden their employment opportunities by

displaying their advanced skills, and result in higher earning potential.

Who will benefit from taking this certification?Who will benefit from taking this certification?Who will benefit from taking this certification?Who will benefit from taking this certification?

Job seekers looking to find employment in various IT companies or freelance, students

generally wanting to improve their skill set and make their CV stronger and existing

employees looking for a better role can prove their employers the value of their skills

through this certification.

Test DetailsTest DetailsTest DetailsTest Details

• Duration:Duration:Duration:Duration: 60 minutes

• No. of questions:No. of questions:No. of questions:No. of questions: 50

• Maximum marks:Maximum marks:Maximum marks:Maximum marks: 50, Passing marks: 25 (50%)

There is no negative marking in this module.

Fee StructureFee StructureFee StructureFee Structure

Rs. 4,000/- (Includes all taxes)

Page 3: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

Companies that hireCompanies that hireCompanies that hireCompanies that hire Vskills Vskills Vskills Vskills Certified Certified Certified Certified Hadoop and Mapreduce ProfessionalHadoop and Mapreduce ProfessionalHadoop and Mapreduce ProfessionalHadoop and Mapreduce Professional

Vskills Certified Hadoop and Mapreduce Professional finds employment in big or small

cloud based software development and IT infrastructure companies. There is a shortage of

skilled professionals in this field and companies are in a constant look out of people well

acquainted with the work culture and the processes involved.

Page 4: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

Table of Contents

1.1.1.1. IntroductiIntroductiIntroductiIntroductionononon 1.1 History of Hadoop Project 1.2 Need and requirement for Hadoop 1.3 Components of Hadoop project 2.2.2.2. HDFSHDFSHDFSHDFS 2.1 Basics (Blocks, Namenodes and Datanodes) 2.2 Interfaces and Data read and write process 2.3 HAR files and distcp 2.4 Command Line Interface 2.5 SequenceFile and MapFile, Checksumming, codecs and Writables 3.3.3.3. MapReduceMapReduceMapReduceMapReduce 3.1 Basics and Configuration API 3.2 Combiner functions and streaming 3.3 Counters, sorting, joins and side data 3.4 Input formats (Text, binary, database, multiple) 3.5 Output formats (Text, binary, database, multiple) 3.6 Submission and initialization of job and task 3.7 JobTracker and TaskTracker classes 3.8 Scheduling, Shuffle and sort 3.9 Environment and side effects 3.10 Configuration API 3.11 Debugging and Optimizing 4.4.4.4. ClusterClusterClusterCluster 4.1 Installation 4.2 Configuration 4.3 Testing and benchmarking 5.5.5.5. AdministrationAdministrationAdministrationAdministration 5.1 dfsadmin, fsck and balancer 5.2 log4j logging, log levels, stack trace and metrics 5.3 Backup and filesystem checks 5.4 Add and removal of nodes 6.6.6.6. PigPigPigPig 6.1 Installation, Local and hadoop mode 6.2 Grunt, script and embedded execution 6.3 Pig Latin

Page 5: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

6.4 UDF and data processing operator 7.7.7.7. HbaseHbaseHbaseHbase 7.1 Need and evolution 7.2 Installation 7.3 Clients 8.8.8.8. ZookeeperZookeeperZookeeperZookeeper 8.1 Installation 8.2 Group membership and management 8.3 Znodes 8.4 API, triggers and ACL 8.5 States, consistency and sessions 8.6 Implementation

Page 6: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

Course OutlineCourse OutlineCourse OutlineCourse Outline

IntroductionIntroductionIntroductionIntroduction � Illustrating the history and need of hadoop project � Describing the various components of hadoop project HDFSHDFSHDFSHDFS � Explaining the Hadoop Distributed Filesystem (Blocks, Namenodes and Datanodes) � Describing the various interfaces and data read and write process of the HDFS � Applying hadoop archives files and distcp for data storage and transfer � Exploring the various options in the command line interface � Implementing checksumming, SequenceFile and MapFile, codecs and Writables MapReduceMapReduceMapReduceMapReduce � Describing the basics and configuration API of mapreduce � Utilizing the combiner functions and streaming in mapreduce � Illustrating the application of counters, sorting, joins and side data � Accessing data from various input formats (Text, binary, database, multiple) � Explaining generation of various output formats (Text, binary, database, multiple) � Illustrating steps of submission and initialization of job and task � Detailing the JobTracker and TaskTracker classes � Applying scheduling, shuffle and sort � Configuring the environment and impact of side effects � Elucidating the process of debugging and optimizing in mapreduce ClusterClusterClusterCluster � Explaining the steps of installation of cluster � Illustrating the various options of configuration � Describing the process of testing and benchmarking AdministrationAdministrationAdministrationAdministration � Explaining the usage of dfsadmin, fsck and balancer � Monitoring and logging by log4j, log levels, stack trace and metrics � Describing the process of backup and filesystem checks � Illustrating the steps for managing nodes of a cluster PigPigPigPig � Explaining the need, installation and the local and hadoop mode � Describing the various executions like grunt, script and embedded � Illustrating the syntax and semantics of pig latin � Detailing the user defined functions and data processing operator

Page 7: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

HbaseHbaseHbaseHbase � Explaining the need and evolution of hbase � Illustrating the steps for installation � Describing the various clients for connecting to hbase ZookeeperZookeeperZookeeperZookeeper � Describing the installation process � Explaining the concepts of group membership and management � Detailing the Znodes and their types � Various operations in zookeeper by API, triggers and ACL � Implementing consistency, states and sessions in zookeper � Illustrating the steps of implementation of zookeeper

Page 8: Hadoop and Mapreduce Certification

Certified Hadoop and Mapreduce Professional

www.vskills.in

Sample QuestionsSample QuestionsSample QuestionsSample Questions

1. 1. 1. 1. The size of The size of The size of The size of blblblbloooockckckck in HDFS is in HDFS is in HDFS is in HDFS is _____________._____________._____________._____________.

A. 512 bytes

B. 64 MB

C. 1024 KB

D. None of the above

2222. . . . The The The The switch given to “hadoop fs” command for detailed help isswitch given to “hadoop fs” command for detailed help isswitch given to “hadoop fs” command for detailed help isswitch given to “hadoop fs” command for detailed help is _____________. _____________. _____________. _____________.

A. -show

B. -help

C. -?

D. None of the above

3333. . . . RPC meansRPC meansRPC meansRPC means ______________. ______________. ______________. ______________.

A. Remote processing call

B. Remote process call

C. Remote procedure call

D. None of the above

4444. . . . Which method of Which method of Which method of Which method of the FileSystem object the FileSystem object the FileSystem object the FileSystem object is used for reading a file in HDFSis used for reading a file in HDFSis used for reading a file in HDFSis used for reading a file in HDFS????

A. open()

B. access()

C. select()

D. None of the above

5555. . . . How many states does Writable interface definesHow many states does Writable interface definesHow many states does Writable interface definesHow many states does Writable interface defines????

A. Two

B. Four

C. Three

D. None of the above

Answers: 1 (B), 2 (B), 3 (C), 4 (A), 5 (A)

Page 9: Hadoop and Mapreduce Certification