HDFS Architecture
February 13, 2017Introduction to MapReduce
February 20, 2017Hadoop Starter Kit – Tutorial
In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS, MapReduce and a very good introduction to Apache Pig & Hive. More importantly you can try what you learn in a 3 node Cloudera CDH5 Hadoop cluster for FREE – yes 100% free.
This course is divided in to 4 sections.
Before you go on reading this post, please note that this post and all the links/posts below is from our free course named Hadoop Starter Kit. It is a free introductory course on Hadoop and it is 100% free. Click here to enroll to Hadoop Starter Kit. You will also get free access to our 3 node Hadoop cluster hosted on Amazon Web Services (AWS) – also free !
Introduction to Big Data
In the very first section, we will see what is Big Data and understand the problems and complexities that comes with storing and analyzing Big Data. We will also see how Hadoop provides a solution and address the complexities involved in Big Data.
Understanding Big Data problem
HDFS
In section 2 we will talk about Hadoop Distributed File System or HDFS which is one of the core components of Hadoop. We will start this section by seeing what is a file system and why we need a new file system like HDFS. You will then learn HDFS commands and try them in our training cluster. Click here to get your free access to the cluster. We will finish this section by learning about the HDFS architecture.
HDFS – Why another filesystem?
MapReduce
In section 3, we will learn about MapReduce. First we will get a good introduction to MapReduce and then we will go in depth to understand the phases involved in MapReduce. We will then write a MapReduce program in Java to calculate the maximum closing price of stock symbols from a stock dataset. We will go over the MapReduce program in detail.
Dissecting MapReduce components
Dissecting MapReduce program (Part 1)
Dissecting MapReduce program (Part 2)
We hope you are excited to start learning Hadoop. Just to remind you again, please note that this post and other links in this post are from our free course named Hadoop Starter Kit. It is a free introductory course on Hadoop and it is 100% free. Click here to enroll to Hadoop Starter Kit. You will also get free access to our 3 node Hadoop cluster hosted on Amazon Web Services (AWS) – also free !