Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the becustom domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home4/joyplace/public_html/wp-includes/functions.php on line 6114

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the wordpress-seo domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home4/joyplace/public_html/wp-includes/functions.php on line 6114

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893

Warning: Cannot modify header information - headers already sent by (output started at /home4/joyplace/public_html/wp-includes/functions.php:6114) in /home4/joyplace/public_html/wp-includes/rest-api/class-wp-rest-server.php on line 1893
{"id":1194,"date":"2017-02-16T10:00:48","date_gmt":"2017-02-16T16:00:48","guid":{"rendered":"http:\/\/hadoopinrealworld.com\/?p=1194"},"modified":"2023-02-19T07:32:43","modified_gmt":"2023-02-19T13:32:43","slug":"hadoop-starter-kit-tutorial","status":"publish","type":"post","link":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/","title":{"rendered":"Hadoop Starter Kit – Tutorial"},"content":{"rendered":"

Hadoop Starter Kit – Tutorial<\/h1>\n

In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS, MapReduce and a very good introduction to Apache Pig & Hive. More importantly you can try what you learn in a 3 node Cloudera CDH5 Hadoop cluster for FREE – yes 100% free.<\/span><\/p>\n

This course is divided in to 4 sections.<\/p>\n

Before you go on reading this post, please note that this post and all the links\/posts below is from our free course named Hadoop Starter Kit. It is a free introductory course on Hadoop and it is 100% free. Click here to enroll to Hadoop Starter Kit.\u00a0<\/a>\u00a0You will also get free access to our 3 node Hadoop cluster hosted on Amazon Web Services (AWS) \u2013 also free !<\/span><\/p>\n

Introduction to Big Data<\/h2>\n

In the very first section, we will see what is Big Data and understand the problems and complexities that comes with storing and analyzing Big Data. We will also see how Hadoop provides a solution and address the complexities involved in Big Data.<\/span><\/p>\n

What is Big Data ?<\/a><\/p>\n

Understanding Big Data problem<\/a><\/p>\n

HDFS<\/h2>\n

In section 2 we will talk about Hadoop Distributed File System or HDFS which is one of the core components of Hadoop. We will start this section by seeing what is a file system and why we need a new file system like HDFS. You will then learn HDFS commands and try them in our training cluster. Click here to get your free access to the cluster.\u00a0<\/span>We will finish this section by learning about the HDFS architecture.<\/span><\/p>\n

HDFS – Why another filesystem?<\/a><\/p>\n

Working with HDFS<\/a><\/p>\n

HDFS Architecture<\/a><\/p>\n

MapReduce<\/h2>\n

In section 3, we will learn about MapReduce. First we will get a good introduction to MapReduce and then we will go in depth to understand the phases involved in MapReduce. We will then write a MapReduce program in Java to calculate the maximum closing price of stock symbols from a stock dataset. We will go over the MapReduce program in detail.<\/span><\/p>\n

Introduction to MapReduce<\/a><\/p>\n

Dissecting MapReduce components<\/a><\/p>\n

Dissecting MapReduce program (Part 1)<\/a><\/p>\n

Dissecting MapReduce program (Part 2)<\/a><\/p>\n

We hope you are excited to start learning Hadoop. Just to remind you again, please note that this post and other links in this post are from our free course named Hadoop Starter Kit. It is a free introductory course on Hadoop and it is 100% free. Click here to enroll to Hadoop Starter Kit.\u00a0<\/a>\u00a0You will also get free access to our 3 node Hadoop cluster hosted on Amazon Web Services (AWS) \u2013 also free !<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"

Hadoop Starter Kit – Tutorial In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS, MapReduce and [\u2026]<\/span><\/p>\n","protected":false},"author":3,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-1194","post","type-post","status-publish","format-standard","hentry","category-hadoop"],"yoast_head":"\nHadoop Starter Kit - Tutorial - Big Data In Real World<\/title>\n<meta name=\"description\" content=\"In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS & MapReduce.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Hadoop Starter Kit - Tutorial - Big Data In Real World\" \/>\n<meta property=\"og:description\" content=\"In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS & MapReduce.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\" \/>\n<meta property=\"og:site_name\" content=\"Big Data In Real World\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/bigdatainrealworld\" \/>\n<meta property=\"article:published_time\" content=\"2017-02-16T16:00:48+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-02-19T13:32:43+00:00\" \/>\n<meta name=\"author\" content=\"Big Data In Real World\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Big Data In Real World\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\"},\"author\":{\"name\":\"Big Data In Real World\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#\/schema\/person\/24cab2292ef49c73053440c86515ef67\"},\"headline\":\"Hadoop Starter Kit – Tutorial\",\"datePublished\":\"2017-02-16T16:00:48+00:00\",\"dateModified\":\"2023-02-19T13:32:43+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\"},\"wordCount\":427,\"publisher\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#organization\"},\"articleSection\":[\"Hadoop\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\",\"url\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\",\"name\":\"Hadoop Starter Kit - Tutorial - Big Data In Real World\",\"isPartOf\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#website\"},\"datePublished\":\"2017-02-16T16:00:48+00:00\",\"dateModified\":\"2023-02-19T13:32:43+00:00\",\"description\":\"In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS & MapReduce.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.bigdatainrealworld.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Hadoop Starter Kit – Tutorial\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#website\",\"url\":\"https:\/\/www.bigdatainrealworld.com\/\",\"name\":\"Big Data In Real World\",\"description\":\"Learn Big Data from experts!\",\"publisher\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.bigdatainrealworld.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#organization\",\"name\":\"Big Data In Real World\",\"url\":\"https:\/\/www.bigdatainrealworld.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.bigdatainrealworld.com\/wp-content\/uploads\/2023\/02\/black.png\",\"contentUrl\":\"https:\/\/www.bigdatainrealworld.com\/wp-content\/uploads\/2023\/02\/black.png\",\"width\":500,\"height\":500,\"caption\":\"Big Data In Real World\"},\"image\":{\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/bigdatainrealworld\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#\/schema\/person\/24cab2292ef49c73053440c86515ef67\",\"name\":\"Big Data In Real World\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.bigdatainrealworld.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/d332bc24fe9b3182f0a22135f163ac4e?s=96&d=retro&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/d332bc24fe9b3182f0a22135f163ac4e?s=96&d=retro&r=g\",\"caption\":\"Big Data In Real World\"},\"description\":\"We are a group of Big Data engineers who are passionate about Big Data and related Big Data technologies. We have designed, developed, deployed and maintained Big Data applications ranging from batch to real time streaming big data platforms. We have seen a wide range of real world big data problems, implemented some innovative and complex (or simple, depending on how you look at it) solutions.\",\"sameAs\":[\"https:\/\/www.bigdatainrealworld.com\/\"],\"url\":\"https:\/\/www.bigdatainrealworld.com\/author\/bigdatainrealworld\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Hadoop Starter Kit - Tutorial - Big Data In Real World","description":"In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS & MapReduce.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/","og_locale":"en_US","og_type":"article","og_title":"Hadoop Starter Kit - Tutorial - Big Data In Real World","og_description":"In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS & MapReduce.","og_url":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/","og_site_name":"Big Data In Real World","article_publisher":"https:\/\/www.facebook.com\/bigdatainrealworld","article_published_time":"2017-02-16T16:00:48+00:00","article_modified_time":"2023-02-19T13:32:43+00:00","author":"Big Data In Real World","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Big Data In Real World","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/#article","isPartOf":{"@id":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/"},"author":{"name":"Big Data In Real World","@id":"https:\/\/www.bigdatainrealworld.com\/#\/schema\/person\/24cab2292ef49c73053440c86515ef67"},"headline":"Hadoop Starter Kit – Tutorial","datePublished":"2017-02-16T16:00:48+00:00","dateModified":"2023-02-19T13:32:43+00:00","mainEntityOfPage":{"@id":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/"},"wordCount":427,"publisher":{"@id":"https:\/\/www.bigdatainrealworld.com\/#organization"},"articleSection":["Hadoop"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/","url":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/","name":"Hadoop Starter Kit - Tutorial - Big Data In Real World","isPartOf":{"@id":"https:\/\/www.bigdatainrealworld.com\/#website"},"datePublished":"2017-02-16T16:00:48+00:00","dateModified":"2023-02-19T13:32:43+00:00","description":"In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS & MapReduce.","breadcrumb":{"@id":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.bigdatainrealworld.com\/hadoop-starter-kit-tutorial\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.bigdatainrealworld.com\/"},{"@type":"ListItem","position":2,"name":"Hadoop Starter Kit – Tutorial"}]},{"@type":"WebSite","@id":"https:\/\/www.bigdatainrealworld.com\/#website","url":"https:\/\/www.bigdatainrealworld.com\/","name":"Big Data In Real World","description":"Learn Big Data from experts!","publisher":{"@id":"https:\/\/www.bigdatainrealworld.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.bigdatainrealworld.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.bigdatainrealworld.com\/#organization","name":"Big Data In Real World","url":"https:\/\/www.bigdatainrealworld.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.bigdatainrealworld.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.bigdatainrealworld.com\/wp-content\/uploads\/2023\/02\/black.png","contentUrl":"https:\/\/www.bigdatainrealworld.com\/wp-content\/uploads\/2023\/02\/black.png","width":500,"height":500,"caption":"Big Data In Real World"},"image":{"@id":"https:\/\/www.bigdatainrealworld.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/bigdatainrealworld"]},{"@type":"Person","@id":"https:\/\/www.bigdatainrealworld.com\/#\/schema\/person\/24cab2292ef49c73053440c86515ef67","name":"Big Data In Real World","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.bigdatainrealworld.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/d332bc24fe9b3182f0a22135f163ac4e?s=96&d=retro&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/d332bc24fe9b3182f0a22135f163ac4e?s=96&d=retro&r=g","caption":"Big Data In Real World"},"description":"We are a group of Big Data engineers who are passionate about Big Data and related Big Data technologies. We have designed, developed, deployed and maintained Big Data applications ranging from batch to real time streaming big data platforms. We have seen a wide range of real world big data problems, implemented some innovative and complex (or simple, depending on how you look at it) solutions.","sameAs":["https:\/\/www.bigdatainrealworld.com\/"],"url":"https:\/\/www.bigdatainrealworld.com\/author\/bigdatainrealworld\/"}]}},"_links":{"self":[{"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/posts\/1194","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/comments?post=1194"}],"version-history":[{"count":2,"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/posts\/1194\/revisions"}],"predecessor-version":[{"id":1229,"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/posts\/1194\/revisions\/1229"}],"wp:attachment":[{"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/media?parent=1194"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/categories?post=1194"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bigdatainrealworld.com\/wp-json\/wp\/v2\/tags?post=1194"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}