Cloudera Enterprise 5.7 Boosts Data Processing with Hive-on-Spark Support

Cloudera, provider of a data management and analytics platform built on Apache Hadoop and open source technologies, has announced the general availability of Cloudera Enterprise 5.7. According to the vendor, the new release offers an average 3x improvement for data processing with added support of Hive-on-Spark, and an average 2x improvement for business intelligence analytics with updates to Apache Impala (incubating). The new release also adds visibility into multi-tenant usage across these workloads for management efficiency and optimal resourcing.

The advancement of data engineering and ETL development with Hive-on-Spark marks a critical milestone in Hadoop evolution - further solidifying Spark’s status as the standard data processing engine in Hadoop, according to Charles Zedlewski, vice president, products at Cloudera.  With the 5.7 release, he observed, Cloudera customers can better enable a wide range of users across the platform, while also maintaining “fast performance, easy management, and compliance-ready security.”

Last year, Cloudera launched the One Platform Initiative as the roadmap to complete the transition from MapReduce to Spark and is leading development to better integrate Spark with Hadoop - ensuring it meets the enterprise requirements for larges-scale production workloads. The release of Hive-on-Spark in Cloudera 5.7 brings Spark one step closer as developers can now leverage the data processing capabilities of Spark, while continuing to use familiar Hive, and delivers a 3x performance improvement on average.

Cloudera, IBM, Intel, MapR, and others, and involved customers across a range of industries launched Hive-on-Spark, a community-driven initiative as part of an early access program for further development.

For further consistency, Cloudera has worked with more than 2,300 of its partners to ensure customers can continue to use the data integration and preparation tools with Hive-on-Spark, without disrupting the business. Partners including BMC, ClearStory Data, Elastic, NGDATA, Solix, Trillium Software, Zementis, and others are working with Cloudera to certify their technologies for a smooth transition.

For more information about Cloudera 5.7 - now available on - read the Release Notes.

Image courtesy of Shutterstock.


Subscribe to Big Data Quarterly E-Edition