Trifacta Works with Databricks to Deliver Automated Pipelines for Data Lakehouses

Trifacta is partnering with Databricks on a joint solution that natively integrates Trifacta’s interactive, visual data preparation capabilities into the Databricks Lakehouse Platform. The new solution accelerates the development and orchestration of data preparation pipelines by removing bottlenecks that can slow data prep for analytics and machine learning models while also tracking data lineage and ensuring sustainable data governance.

With the integration of Trifacta and Databricks platforms, a broader set of data workers can take ownership over the data preparation process and easily collaborate with data scientists and data engineers to better align data products and pipelines with business goals. The visual, low-code joint solution reduces the burden on engineering teams and accelerates an organization’s time-to-insight from data analytics.

“Trifacta offers 1-click data preparation for customers to explore and transform diverse data at scale,” said Michael Hoff, SVP of business development, Databricks. “This helps drive more data into the lakehouse for advanced cloud analytics and machine learning.”

Any transformation executed in Trifacta is translated into runtime Spark code that executes via Databricks, while Databricks automatically scales processing execution based on the parameters of the transformation job. The result, the companies say, is faster, more reliable data pipelines that work as hard as analysts do. 

“Data operations are more tightly synced than ever before; no longer are today’s organizations seeking to create siloed data marts or disjointed data teams. And the partnership between Trifacta and Databricks is a perfect example of that,” says Ash Vijayakanthan, vice president of alliances, Trifacta. “We’ve made it possible for business users to not only involve themselves in data preparation, but also in the creation and management of data pipelines, both of which were once considered highly technical work relegated to IT teams. We’re excited to see our joint customers yield far greater returns with the ability to accelerate the data preparation process on top of cutting-edge processing power.”

Trifacta offers support for all major cloud platforms; Databricks is available on all major cloud platforms (GCP, Azure, and AWS).

For more information, go to