This tutorial will show how to install and configure version 5 of Cloudera’s Distribution Including Apache Hadoop (CDH 5), and how to deploy it on EC2 cluster. http://www.bogotobogo.com/Hadoop/BigData_hadoop_CDH5_Install.php
Tag Archives: Hadoop
Create multi-node Hadoop or Spark clusters, running in Docker containers
Spin Up Hadoop and Spark Clusters in Minutes With BlueData EPIC Lite, you’ll have your own personal sandbox to develop and test Big Data analytics. Create multi-node Hadoop or Spark clusters, running in Docker containers. Point to data in your local files or in HDFS and NFS storage. On your laptop. Within minutes. Download* EPICContinue reading “Create multi-node Hadoop or Spark clusters, running in Docker containers”
Cloudera Administration Handbook
A complete, hands-on guide to building and maintaining large Apache Hadopp clusters using Cloudera Manager and CDH5 http://www.rohitmenon.com/index.php/cloudera-administration-handbook/
Installing a virtual Hadoop cluster
Step by Step instruction to install virtual Hadoop cluster with Vagrant and Cloudera Manager http://dandydev.net/blog/installing-virtual-hadoop-cluster
Self-service BI with Pig, Impala and PowerBI
Source: http://baboonit.be/blog/self-service-bi-with-pig-impala-and-powerbi Visualisations in PowerBI Here are some charts that we generated with PowerBI. The nice thing is that you can drill down on any bar. This is ideal for exploring a dataset. You can also easily build an animated chart. In the following example, the delays per airport are shown on a scatter chart,Continue reading “Self-service BI with Pig, Impala and PowerBI”
You must be logged in to post a comment.