Hortonworks Announces YARN-Based Architecture of Hortonworks Data Platform 2.0

Hortonworks has announced the availability of Hortonworks Data Platform 2.0, a commercial distribution built on the Hadoop 2 GA release from the Apache Software Foundation. Hortonworks Data Platform 2.0 delivers the YARN based architecture of Hadoop 2, and includes the latest innovations from the broader Hadoop ecosystem in a single integrated and tested platform.

The YARN component acts as the operating system for new types of data processing engines to plug in and take advantage of Hadoops availability, said Shaun Connolly, vice president of corporate strategy, Hortonworks. “YARN’s goal is to enable a wide range of workloads to be able to run and co-exist within the same platform. That way users’ batch jobs, interactive jobs and real time-straining jobs can be managed and co-exist within the same Hadoop cluster.”

The Stinger Initiative’s Phase 2 is also a feature in Hortonworks Data Platform 2.0. The Stinger Initiative is designed to enhance the speed, scale and breadth of SQL semantics supported by the recently released Apache Hive 0.12. “A popular topic in the Hadoop landscape has been the ability to run SQL style work loads within the Hadoop-platform and Apache Hive has been that component that enables people to issue standard SQL commands to the Hadoop platform and get the data back in a very similar way,” explained Connolly. This release signals progress as well as performance on SQL compliance.

While the arrival of Hadoop 2 to the mainstream enterprise is a key element in the Hortonworks Data Platform 2.0,  HDP 2.0 also include enhancements to improve the flexibility and integration of Hadoop and incorporates a wave of innovations in related components in the broader Hadoop platform and related Apache projects.

For more information on the Hortonworks Data Platform 2.0, visit the Hortonworks website at