Spark Execution Engine – Logical Plan to Physical Plan
November 8, 2017Managed Table vs. External Table In Hive
January 24, 2018We hosted a webinar on November 11th 2017 answering several Hadoop or Big Data interview questions that were asked in real interviews. Couple weeks before the webinar we asked our wonderful Hadoop In Real World community to share interesting or challenging questions they were asked in real interviews. As a result we got several interesting and challenging questions from the community that were asked in real world interviews.
We had so much fun answering those questions in the webinar. The participants were super engaging and we even answered more questions that was asked live by the participants on the webinar.
We quite often hosts webinars like these, sign up below to get invitations to join one of our webinars.
Interesting & Challenging Questions
- Explain your recent project, roles & responsibilities?
- Explain MapReduce flow in detail
- Assume a 10 GB dataset, how many mappers and reducers will be created?
- How does data get transferred from Mapper to Reducer?
- If you have more than one reducer, how does data gets to the correct reducer?
- When to use Hive and when to use Spark?
- Does Hive support ACID & CRUD?
- When to use Partitions & when to use Buckets in Hive?
- What is Map Join & SMB Join in Hive?
- When to use columnar format and when to use row format?
- What is the difference between RDD & DataFrame?
- What is Oozie?
- What is Flume?
- What is Sentry?
- What is Kafka?
- How do you promote code to production?
- Why odd number of nodes – 1, 3, 5.. ?
- Is it possible to make a Mapper multithreaded?
We are certainly planning to host another session to cover more interview questions. So, if you have an interesting interview question that you would like us to answer, please email that question(s) to info@hadoopinrealworld.com
Here is the full recording of the webinar. Enjoy!