How to fix the LEADER_NOT_AVAILABLE error in Kafka?
March 22, 2021What is the difference between cache and persist in Spark?
March 26, 2021HDFS divides the files into blocks and stores the blocks locally in datanodes. The location varies from cluster to cluster based on the configuration in hdfs-site.xml
dfs.datanode.data.dir
dfs.datanode.data.dir
in hdfs-site.xml file dictates the directory location where the blocks will be stored in the local file system in the individual datanodes.
You can also specify multiple directories in a comma-delimited fashion typically on different devices. Directories that do not exist are ignored.
file://${hadoop.tmp.dir}/dfs/data
is the default value for this property.