Fivetran Offers Integration of Amazon S3 for the Automation of the Modern Data Lake

Fivetran, a global leader in automated data movement, is introducing support for Amazon Simple Storage Service (Amazon S3) with Apache Iceberg data lake format to combine large-volume, cost-effective data storage with simple access and comprehensive compliance.

“We now automatically extract, cleanse, deduplicate, and make ready for analysis large volumes of semi-structured data to power data lakes in the same reliable and secure way our customers get their data into their cloud warehouses today,” said Fraser Harris, vice president of product at Fivetran. “Fivetran and AWS share a vision that without structure, governance, and accuracy of data in a data lake, organizations are unnecessarily increasing complexity and not realizing the full value of the data they store there. Fivetran’s mission is to make access to data as simple and reliable as electricity, and this new support brings that promise to the world of data lakes.”

Instead of focusing on all the manual steps required to ingest data, cleanse it, prepare it for usage, hash and block sensitive data, and then start querying it, modern organizations see great value in reducing data lake management efforts through pipeline automation and governance.

Amazon S3 is an object storage service from Amazon Web Services (AWS) that offers industry-leading scalability, data availability, security, and performance.

Apache Iceberg is a widely supported open-source data format that offers atomic, consistent, isolated, and durable (ACID) transactions for data lakes.

Fivetran is an automated data movement platform, anonymizing personally identifiable information (PII) while cleansing, normalizing and automatically loading data into the lake. 

With expansive storage capacity and support for multiple data formats, the data lake is a popular destination for teams doing analysis on massive data sets or running extensive data science projects that fuel their business.

Hundreds of thousands of data lakes run on top of Amazon S3 and, of the many enterprise teams that have already put them to work, a majority cite enhanced business agility, improvement in developing products and services, and enhancing customer service and engagement as benefits of data lakes.

“We are delighted that the accessibility of Amazon S3 with Iceberg continues to grow,” said Greg Khairallah, director of analytics at AWS. “It’s an easy way for our customers to simplify data ingestion while providing customers the scalability of a data lake and the reliable data transformation of a data warehouse.”

For more information about this news, visit