What is RDD?
October 11, 2017What is ZooKeeper and it’s Use Case
October 25, 2017We hosted a webinar Saturday, October 14th 2017 and we answered some great questions that was posted by Hadoop In Real World community and also from participants in the webinar. We would like to thank everyone who joined the webinar. Here are some of the questions we covered in the session.
Questions answered
- How to decide on number of reducers?
- How to deal with records with different schema in dataset?
- What is Vectorized query execution?
- Is there a good use case of Spark for ETL type workloads?
- What is Apache Flink and how it changes the big data ecosystem?
- What are the differences between Flume, Kafka streaming and Spark streaming?
- How to size a cluster?
- What are the prerequisites for Hadoop Developer, Administrator and Tester roles?
We quite often hosts webinars like these and sign up below to get invitations to join one of our webinars.
Here is the full recording of the webinar. Enjoy!