hadoop-yarn – Row Coding

Why does Hadoop report “Unhealthy Node local-dirs and log-dirs are bad”?

September 29, 2023 by Tarik

The most common cause of local-dirs are bad is due to available disk space on the node exceeding yarn’s max-disk-utilization-per-disk-percentage default value of 90.0%. Either clean up the disk that the unhealthy node is running on, or increase the threshold in yarn-site.xml <property> <name>yarn.nodemanager.disk-health-checker.max-disk-utilization-per-disk-percentage</name> <value>98.5</value> </property> Avoid disabling disk check, because your jobs may failed … Read more

Difference between `yarn.scheduler.maximum-allocation-mb` and `yarn.nodemanager.resource.memory-mb`?

September 13, 2023 by Tarik

Consider in a scenario where you are setting up a cluster where each machine having 48 GB of RAM. Some of this RAM should be reserved for Operating System and other installed applications. yarn.nodemanager.resource.memory-mb: Amount of physical memory, in MB, that can be allocated for containers. It means the amount of memory YARN can utilize … Read more

Spark on yarn concept understanding

August 30, 2023 by Tarik

Adding to other answers. Is it necessary that spark is installed on all the nodes in the yarn cluster? No, If the spark job is scheduling in YARN(either client or cluster mode). Spark installation is needed in many nodes only for standalone mode. These are the visualizations of spark app deployment modes. Spark Standalone Cluster … Read more

How to limit the number of retries on Spark job failure?

August 28, 2023 by Tarik

There are two settings that control the number of retries (i.e. the maximum number of ApplicationMaster registration attempts with YARN is considered failed and hence the entire Spark application): spark.yarn.maxAppAttempts – Spark’s own setting. See MAX_APP_ATTEMPTS: private[spark] val MAX_APP_ATTEMPTS = ConfigBuilder(“spark.yarn.maxAppAttempts”) .doc(“Maximum number of AM attempts before failing the app.”) .intConf .createOptional yarn.resourcemanager.am.max-attempts – YARN’s … Read more

How to set amount of Spark executors?

August 11, 2023 by Tarik

In Spark 2.0+ version use spark session variable to set number of executors dynamically (from within program) spark.conf.set(“spark.executor.instances”, 4) spark.conf.set(“spark.executor.cores”, 4) In above case maximum 16 tasks will be executed at any given time. other option is dynamic allocation of executors as below – spark.conf.set(“spark.dynamicAllocation.enabled”, “true”) spark.conf.set(“spark.executor.cores”, 4) spark.conf.set(“spark.dynamicAllocation.minExecutors”,”1″) spark.conf.set(“spark.dynamicAllocation.maxExecutors”,”5″) This was you can let … Read more

What is a container in YARN?

August 3, 2023 by Tarik

It represents a resource (memory) on a single node at a given cluster. A container is supervised by the node manager scheduled by the resource manager One MR task runs in such container(s).

Why does a JVM report more committed memory than the linux process resident set size?

July 25, 2023 by Tarik

I’m beginning to suspect that stack memory (unlike the JVM heap) seems to be precommitted without becoming resident and over time becomes resident only up to the high water mark of actual stack usage. Yes, at least on linux mmap is lazy unless told otherwise. Anonymous pages are only backed by physical memory once they’re … Read more

Application report for application_ (state: ACCEPTED) never ends for Spark Submit (with Spark 1.2.0 on YARN)

July 21, 2023 by Tarik

FetchFailedException or MetadataFetchFailedException when processing big data set

July 18, 2023 by Tarik

This error is almost guaranteed to be caused by memory issues on your executors. I can think of a couple of ways to address these types of problems. 1) You could try to run with more partitions (do a repartition on your dataframe). Memory issues typically arise when one or more partitions contain more data … Read more

What is the relation between ‘mapreduce.map.memory.mb’ and ‘mapred.map.child.java.opts’ in Apache Hadoop YARN?

July 13, 2023 by Tarik

mapreduce.map.memory.mb is the upper memory limit that Hadoop allows to be allocated to a mapper, in megabytes. The default is 512. If this limit is exceeded, Hadoop will kill the mapper with an error like this: Container[pid=container_1406552545451_0009_01_000002,containerID=container_234132_0001_01_000001] is running beyond physical memory limits. Current usage: 569.1 MB of 512 MB physical memory used; 970.1 MB … Read more