Cloudflare is building on its foundation with the launch of the Cloudflare Data Platform, a complete solution for ingesting, storing, and querying analytical data tables.
The platform is made up of three solutions:
- Cloudflare Pipelines receives events sent via Workers or HTTP, transforms them with SQL, and ingests them into Iceberg or as files on R2
- R2 Data Catalog manages the Iceberg metadata and now performs ongoing maintenance, including compaction, to improve query performance
- R2 SQL is Cloudflare’s in-house distributed SQL engine, designed to perform petabyte-scale queries over data in R2
Like all Cloudflare Developer Platform products, it runs on its global compute infrastructure and is built around open standards and interoperability, the company said.
That means that users can bring their own Iceberg query engine—whether that's PyIceberg, DuckDB, or Spark—connect with other platforms such as Databricks and Snowflake—and pay no egress fees to access data.
The Cloudflare Data Platform is available now to ingest events into R2 Data Catalog and query them via R2 SQL.
In the first half of 2026, Cloudflare will expand on the capabilities in all these products, including:
- Integration with Logpush, so users can transform, store, and query their logs directly within Cloudflare
- User-defined functions via Workers, and stateful processing support for streaming transformations
- Expanding the feature set of R2 SQL to cover aggregations and joins
Cloudflare's connectivity cloud protects entire corporate networks, helps customers build Internet-scale applications efficiently, accelerates any website or Internet application, wards off DDoS attacks, keeps hackers at bay, and can help users on their journey to Zero Trust.
For more information about this news, visit www.cloudflare.com.