In this post we will look at how to calculate resource allocation for Spark applications. Figuring out how to allocate resources for a Spark application requires […]
This is our second installment of our Big Data Interview Questions and Answers webinar. Click for the first one. It’s always fun to host one of these […]
We are happy and excited to announce, today we are launching a Slack workspace for Hadoop In Real World community. The purpose of the Slack workspace […]
We hosted a webinar on November 11th 2017 answering several Hadoop or Big Data interview questions that were asked in real interviews. Couple weeks before the […]
We hosted this webinar on Saturday, October 28th 2017. In this webinar we discussed the most common memory related issues with MapReduce jobs and how to […]
ZooKeeper is a coordination service for distributed applications. Well, what the heck is that right? Glad you asked. If you are looking to understand what is […]
We hosted a webinar Saturday, October 14th 2017 and we answered some great questions that was posted by Hadoop In Real World community and also from […]
Dissecting MapReduce Program (Part 2) In the last post we went over the driver program of a MapReduce program in detail. We will also see InputFormat, OutputFormat […]
Dissecting MapReduce Components In the last post we looked at different Phases of MapReduce. In this post we will take a real example and walk through […]
Hadoop Starter Kit – Tutorial In this Hadoop Tutorial a.k.a Hadoop Starter Kit you will learn about the core concepts of Hadoop like HDFS, MapReduce and […]