make -j 8 g++: internal compiler error: Killed (program cc1plus)

Try running (just after the failure) dmesg. Do you see a line like this? Out of memory: Kill process 23747 (cc1plus) score 15 or sacrifice child Killed process 23747, UID 2243, (cc1plus) total-vm:214456kB, anon-rss:178936kB, file-rss:5908kB Most likely that is your problem. Running make -j 8 runs lots of process which use more memory. The problem … Read more

Which cluster type should I choose for Spark?

Spark Standalone Manager : A simple cluster manager included with Spark that makes it easy to set up a cluster. By default, each application uses all the available nodes in the cluster. A few benefits of YARN over Standalone & Mesos: YARN allows you to dynamically share and centrally configure the same pool of cluster … Read more

Can apache spark run without hadoop?

Spark is an in-memory distributed computing engine. Hadoop is a framework for distributed storage (HDFS) and distributed processing (YARN). Spark can run with or without Hadoop components (HDFS/YARN) Distributed Storage: Since Spark does not have its own distributed storage system, it has to depend on one of these storage systems for distributed computing. S3 – … Read more