Hadoop or Big Data Interview Questions and Answers (Part 1)

Spark Execution Engine – Logical Plan to Physical Plan

November 8, 2017

Managed Table vs. External Table In Hive

January 24, 2018

Published by Big Data In Real World at November 15, 2017

Interesting & Challenging Questions

Explain your recent project, roles & responsibilities?
Explain MapReduce flow in detail
Assume a 10 GB dataset, how many mappers and reducers will be created?
How does data get transferred from Mapper to Reducer?
If you have more than one reducer, how does data gets to the correct reducer?
When to use Hive and when to use Spark?
Does Hive support ACID & CRUD?
When to use Partitions & when to use Buckets in Hive?
What is Map Join & SMB Join in Hive?
When to use columnar format and when to use row format?
What is the difference between RDD & DataFrame?
What is Oozie?
What is Flume?
What is Sentry?
What is Kafka?
How do you promote code to production?
Why odd number of nodes – 1, 3, 5.. ?
Is it possible to make a Mapper multithreaded?

We are certainly planning to host another session to cover more interview questions. So, if you have an interesting interview question that you would like us to answer, please email that question(s) to info@hadoopinrealworld.com

Here is the full recording of the webinar. Enjoy!

Big Data In Real World

We are a group of Big Data engineers who are passionate about Big Data and related Big Data technologies. We have designed, developed, deployed and maintained Big Data applications ranging from batch to real time streaming big data platforms. We have seen a wide range of real world big data problems, implemented some innovative and complex (or simple, depending on how you look at it) solutions.

Comments are closed.

Hadoop or Big Data Interview Questions and Answers (Part 1)

Spark Execution Engine – Logical Plan to Physical Plan

Managed Table vs. External Table In Hive

Spark Execution Engine – Logical Plan to Physical Plan

Managed Table vs. External Table In Hive

Interesting & Challenging Questions

Big Data In Real World

Related posts

Sunset: Hadoop Developer In Real World cluster

How to kill a running Spark application?

What is the default number of executors in Spark?