Teradata to Advance Big Data Analytics with Support for Presto Open Source SQL Query Engine

To help users extract insights from data lakes,Teradata has made a multi-year commitment to contribute to Presto’s open source development. Based on a three-part roadmap, Teradata’s says its contributions will be 100% open source under the Apache license and will advance Presto’s code base, scalability, iterative querying, and ability to query multiple data repositories.

Presto complements Teradata QueryGrid and fits within the Teradata Unified Data Architecture vision. Presto integrates with the Teradata Unified Data Architecture by providing users the ability to originate queries directly from their Hadoop platform, while Teradata QueryGrid allows queries to be initiated from the Teradata Database and the Teradata Aster Database all through a common SQL protocol. 

Running on multiple Hadoop distributions, Presto also can reach out from a Hadoop platform to query Cassandra, relational databases, or proprietary data stores. This flexibility allows Presto to combine data from multiple sources, allowing for analytics across the entire organization through a single query. This cross-platform analytic capability allows Presto users to extract the maximum business value from data lakes of any size, from gigabytes to petabytes. 

In addition to the announcement of support for Presto, Teradata, in connection with Hadoop Summit this week, also made two other Hadoop-related announcements, including a new release of Rainstor that integrates into the Teradata UDA, and a new release of Teradata Loom that provides stronger authentication and security. 

Developed and used by Facebook, Presto is a SQL query engine that supports big data analytics. According to Teradata, there is a growing interest in Presto, which has already been adopted by Airbnb, DropBox, Gree, Groupon, and Netflix.

According to Teradata, its support includes a first phase to enhance essential features that simplify adoption of Presto. The Phase 1 capabilities are available today for download at or on Github. The second phase, which will be completed by the end of 2015,  is to integrate Presto with other key parts of the big data ecosystem, such as standard Hadoop distribution management tools, interoperability with YARN, and connectors that extend Presto’s capabilities beyond the Hadoop distributed file system (HDFS). And phase, which will be available in 2016, will enable ODBC (Open Database Connectivity) and JDBC (Java Database Connectivity API) to expand adoption within organizations and enhance integration with business intelligence tools.

In addition to its open source contributions, commercial support from Teradata is now available through Think Big, a Teradata company acquired in 2014. Think Big will offer its expertise in three areas. With Presto Jumpstart, it will assist companies with piloting new functionality, and with Presto Development, Think Big consultants will help customers design, build and deploy a Presto Solution. In addition, the company will offer the Think Big Academy, 2-day workshops to help customers understand the best uses and criteria for architectural decisions.