Where HDFS stores files locally in Datanodes? - Big Data In Real World

Where HDFS stores files locally in Datanodes?

How to fix the LEADER_NOT_AVAILABLE error in Kafka?
March 22, 2021
What is the difference between cache and persist in Spark?
March 26, 2021
How to fix the LEADER_NOT_AVAILABLE error in Kafka?
March 22, 2021
What is the difference between cache and persist in Spark?
March 26, 2021

HDFS divides the files into blocks and stores the blocks locally in datanodes. The location varies from cluster to cluster based on the configuration in hdfs-site.xml

dfs.datanode.data.dir

dfs.datanode.data.dir  in hdfs-site.xml file dictates the directory location where the blocks will be stored in the local file system in the individual datanodes.

You can also specify multiple directories in a comma-delimited fashion typically on different devices. Directories that do not exist are ignored.

file://${hadoop.tmp.dir}/dfs/data  is the default value for this property.

Big Data In Real World
Big Data In Real World
We are a group of Big Data engineers who are passionate about Big Data and related Big Data technologies. We have designed, developed, deployed and maintained Big Data applications ranging from batch to real time streaming big data platforms. We have seen a wide range of real world big data problems, implemented some innovative and complex (or simple, depending on how you look at it) solutions.

Comments are closed.

Where HDFS stores files locally in Datanodes?
This website uses cookies to improve your experience. By using this website you agree to our Data Protection Policy.

Hadoop In Real World is now Big Data In Real World!

X