Alteryx and Databricks Drive Adoption of Apache Spark for Scalable Hadoop Analytics

Alteryx and Databricks are collaborating to make Apache Hadoop and Spark accessible for everyday analysts. These companies will become the primary committers to SparkR, a subset of the overall Spark framework. In addition, Alteryx and Databricks are announcing a technology and go-to-market partnership to accelerate the adoption of SparkR and SparkSQL, in order to help analysts get greater value from Spark as the leading open-source in-memory engine. 

Spark is an open source data analytics framework that enables fast and scalable in-memory analytic processing capabilities inside and independent of Hadoop. SparkR is an R package that enables the R programming language to run inside of the Spark framework in order to manipulate the data for analytics. The collaboration between Alteryx and Databricks will foster faster delivery of a market leading in-memory engine for R-based analytics within Hadoop that is available for the Spark community. Together, the companies say, they will work to bring the SparkR package to a 1.0 production version, utilizing a growing array of machine learning algorithms.

"We are focused on becoming the most complete option for data analysts across the Hadoop landscape. Our goal is to empower analysts to utilize data everywhere to make the best analytic decisions possible," said George Mathew, president and COO of Alteryx. "We believe the Apache Spark framework to be the primary method for our customers to achieve scalable, analytic freedom with their Hadoop investment. We're delighted to be driving the new analytic stack with Databricks."

Alteryx will be adopting the Apache Spark framework into a future release of the Alteryx Analytics platform to allow its customers to achieve faster, scalable analytics across all of their data. As an important foundation, Alteryx will support the ability to read and write directly to Hadoop HDFS in an upcoming release to the Alteryx Analytics platform.

Databricks, the company founded by the creators of Apache Spark‹the powerful open-source processing engine that provides blazingly fast and sophisticated analytics has also announced the launch of Databricks Cloud, a cloud platform built around Apache Spark. In addition to this launch, Databricks announced the close of $33 million in series B funding led by New Enterprise Associates (NEA) with follow-on investment from Andreessen Horowitz.

For more information, visit

More information is also available about Databricks.