In Understanding Big Data Problem post we saw that HDFS or Hadoop Distributed filesystem takes care of all the storage related complexities in Hadoop. In this […]
How to use multi character delimiter in a Hive table? Sometimes your data is slightly complex to delimit the individual columns with a single character like […]
Change field termination value in Hive This blog post describes how to change the field termination value in Hive. Assume when you created the Hive table, […]
DataNode process killed due to Incompatible clusterIDs error This blog post will describe how to address Incompatible clusterIDs with DataNodes. 2013-04-11 16:26:15,720 FATAL org.apache.hadoop.hdfs.server.datanode.DataNode: Initialization failed […]
FSNamesystem initialization failed FSNamesystem initialization failed is a common error Hadoop users gets especially if there are trying to set up of a Hadoop cluster for […]
Hadoop safemode recovery – taking too long! Any time NameNode is restarted or started, NameNode first goes into maintenance state called Safe Mode. When NameNode is […]
Hadoop Administrator In Real World – Course Coverage We launched Hadoop Developer In Real World course on Nov 2015 and we got excellent response from the […]
What employers expect from Hadoop Administrators? In this post we will discuss what employers expect from Hadoop Administrators. We also have a video version of this post, […]
Is Hadoop Administration right for me? When we first announced that we are working on a new Hadoop Administration course we had several students and members […]
Changing The Output File Prefix Of Hadoop MapReduce Job Your Hadoop job can have multiple reducers and each reducer will create a file by default with […]
Hadoop Mapper and Reducer Output Mismatch Can you have different output Key Value pair types for Mapper and Reducer in a MapReduce program? Short answer – […]
Apache Pig Tutorial – Executing Script with Parameters Goal of this tutorial is to learn Apache Pig concepts in a fast pace. So don’t except lengthy posts. […]