Pentaho 6.1 Speeds Data Pipeline for Traditional and Big Data Environments

Bookmark and Share

Pentaho has made enhancements to its metadata injection feature set to reduce development time for data onboarding in both traditional and Hadoop environments. According to the vendor, Pentaho 6.1 accelerates the big data analytics pipeline by simplifying the onboarding process, while adding capabilities to drive more agile and automated big data analytics at scale.

The problem for companies today is that big data onboarding is more than just data loading or movement. It includes management of a changing array of data sources, as well as the need to capture of metadata. At the same time, processes must be repeatable at scale while control and governance are maintained. All these challenges are exacerbated in big data environments like Hadoop.

According to Pentaho, modern data onboarding is a repetitive, manual, and risk-prone process that creates a bottleneck in the data pipeline, and, up until now, organizations could only ease that bottleneck by deploying internal data engineering resources. Pentaho 6.1, the company says, provides IT professionals with a wider array of options for dynamically passing metadata to Pentaho Data Integration at run time to control complex transformation logic.

In addition, Pentaho has introduced a new self-service data onboarding blueprint, an architected process that enables business users and customers to onboard a variety of data themselves—without IT assistance—streamlining the data ingestion process.  At the foundation of this latest Pentaho blueprint is its metadata injection functionality.

Pentaho 6.1 also adds enhancements to its data integration and analytics platform to help enterprises scale their data pipelines to accommodate the greater volume, variety, and complexity of data. Improvements to Pentaho’s inline modeling capabilities help derive and share new business metrics among users, and new data service improvements allow users to produce virtual datasets across a wider array of data blends, and automatically model and publish analytic data without the need for a physical table.

Pentaho, a Hitachi Group Company, has more information about the new release available here.