Databricks Partners with RStudio to Boost Data Science Productivity

Bookmark and Share

Databricks is partnering with RStudio, providers of a free and open-source integrated development environment for R, to increase the productivity of data science teams and allow both companies to integrate Databricks’ Unified Analytics Platform with the RStudio Server.

The RStudio and Databricks integration removes the barriers that stop most R-based machine learning and artificial intelligence (AI) projects.

RStudio provides a way for data science teams to analyze data with R through open source and enterprise ready tools for the R computing environment.  By integrating both solutions, data scientists can easily use RStudio from within a Databricks implementation. Data science teams are better positioned to collaborate with data engineering and lines of business to accelerate AI initiatives.

“Unifying data with machine learning continues to be the biggest barrier when building machine learning models. Data science teams use so many technologies and systems to manage data and machine learning – working in silos and hindering the iterative process needed to achieve AI,” said Michael Hoff, senior vice president of business development and partners at Databricks. “Our technology integration with the RStudio Server eliminates the need for data teams to spend valuable time ramping up on new tools. Data scientists can leverage a familiar IDE, quickly access and prepare high quality data sets, and automatically run and execute R workloads at unprecedented scale.”

By utilizing the joint solution, data science teams can experience:

  • Increased productivity among data science teams. The seamless integration of both solutions allows data scientists to use familiar tools and languages to run and execute R jobs on Databricks’ Unified Analytics Platform directly in RStudio IDE.
  • Simplified access to large data sets. Remove barriers to most R-based machine learning and AI projects by bringing the datasets together in Databricks’ Unified Analytics Platform with the ability to code in RStudio. Databricks provides scalable data processing to clean, blend, and join datasets with optimized data format.
  • Distributed R computing at scale. Databricks supports R as a first-class language, offering unprecedented performance as well as the ability to auto-scale cloud-based clusters to handle the most demanding jobs, while keeping the total cost of ownership low.

“Databricks and RStudio share the same mission to make data science teams more productive,” said Tareef Kawaf, president of RStudio. “We’re confident they will appreciate having the combination of Apache Spark and the RStudio Server, or their own RStudio Server Pro, ready to go in the Databricks Unified Analytics Platform.”

For more information about this partnership, visit