– It is comparible with Hadoop – It provides ease of development – It is fast – It provides multiple language support – it has a unified stack
Tag Archives: Spark
The Spark Stack
• Spark SQL: This is Spark’s module for working with structured data, and it is designed to support workloads that combine familiar SQL database queries with more complicated, algorithm-based analytics. Spark SQL supports the open source Hive project, and its SQL-like HiveQL query syntax. Spark SQL also supports JDBC and ODBC connections, enabling a degree of integration withContinue reading “The Spark Stack”
Storage Options for Apache Spark
• MapR (file system and database) • Google Cloud • Amazon S3 • Apache Cassandra • Apache Hadoop (HDFS) • Apache HBase • Apache Hive • Berkeley’s Tachyon project
Spark Deployment Options
Running Spark on YARN Running Spark on Mesos Running Spark on EC2
Programming languages supported by Spark
Java • Python • Scala • SQL • R Spark Examples
You must be logged in to post a comment.