Free Course

Deploying a Hadoop Cluster

Analyze Data with Hadoop and MapReduce

Nanodegree Program

Data Analyst

byKaggle

Accelerate your career with the credential that fast-tracks you to job success.

About this Course

Learn how to tackle big data problems with your own Hadoop clusters! In this course, you’ll deploy Hadoop clusters in the cloud and use them to gain insights from large datasets.

Course Cost
Free
Timeline
Approx. 3 weeks
Skill Level
intermediate
Included in Product

Rich Learning Content

Interactive Quizzes

Taught by Industry Pros

Self-Paced Learning

Student Support Community

Join the Path to Greatness

This course is your first step towards a new career with the Data Analyst Program.

Free Course

Deploying a Hadoop Cluster

Enhance your skill set and boost your hirability through innovative, independent learning.

Icon steps
 
 

Course Leads

Mat Leonard

Mat Leonard

Instructor

Prerequisites and Requirements

This course is intended for students with some experience with Hadoop and MapReduce, Python, and bash commands.

You’ll have to be able to work with HDFS and write MapReduce programs. You can learn about these in our Intro to Hadoop and MapReduce course.

The MapReduce programs in the course are written in Python. It is possible to use Java and other languages, but we suggest using Python, on the level of our Intro to Computer Science course.

You’ll also be using remote cloud machines, so you’ll need to know these bash commands:

  • ssh
  • scp
  • cat
  • head/tail

You’ll also need to be able to work in an editor such as vim or nano. You can learn about these in our Linux Command Line Basics course.

See the Technology Requirements for using Udacity.

Our Nanodegrees are packed with much more
Nanodegree Certification

Rich Learning Content

Interactive Quizzes

Self Paced Learning

Taught by Industry Professionals

1-1 Coaching and Mentorship

See Nanodegree
Free Courses

Rich Learning Content

Interactive Quizzes

Self Paced Learning

Taught by Industry Professionals

 

See Free Courses

Why Take This Course

Using massive datasets to guide decisions is becoming more and more important for modern businesses. Hadoop and MapReduce are fundamental tools for working with big data. By knowing how to deploy your own Hadoop clusters, you’ll be able to start exploring big data on your own.

What do I get?
Instructor videosLearn by doing exercisesTaught by industry professionals

Udacity on the go

Now you can achieve your goals on the move. Discover our offerings, personalised recommendations, classroom experience and so much more. Install Now!

Need Help with Enrollments?1800-121-6240
Mon - Fri, 10 am - 10 pm