Solution is based on what if Hadoop cluster is suddenly unavailable when Name Node is terminated. The solution to increases the performance and decreases the time of delay, it will solve automated fail over problem as well as increases reliability of Hadoop.
I want to get notification on completion of Mapreduce jobs, How do I do it? Heavy mapreduce jobs may run for several hours. There can be several jobs and checking the status of mapreduce jobs manually will be a boring task. I don’t like this J. If we try to manage java pr… Source: Notification onContinue reading “How to setup notification on completion of Mapreduce jobs”
While checking the details of a YARN applications, if you are getting a message similar to “Log Aggregation not enabled”. You can follow the below steps to enable it. This issue occurs … Source: Enabling Log Aggregation in YARN
I want to learn Hadoop and Spark and how I can build cluster for free? CREATE A FREE HADOOP/SPARK CLUSTER IN 5 MINUTES using http://galacticexchange.io/
How to calculate Hadoop cluster growth plan based on storage? This calculation is for small 3 node Hadoop cluster assume average daily ingest rate of 10 GB per node. Average daily ingest rate 10 GB Replication factor 3 (copies of each block) Daily raw consumption 30 GB (Ingest × replication) Node raw storage 600 GBContinue reading “How to calculate hadoop cluster growth plan based on storage”