Syncsort Expands Integration with Cloudera

Syncsort is expending integration between DMX-h and Cloudera Navigator, enabling data governance practitioners to view detailed data lineage information on enterprise-wide data as well as changes to the data both outside and within the Hadoop cluster.

“With the maturity of Hadoop as an enterprise data platform, organizations are using it to store and process significantly more data, and, in turn, more users and tools are accessing the data. The opportunity to drive greater insights is remarkable, but the volumes, diverse data sources and hybrid environments create a big governance challenge,” said Tendü Yogurtçu, CTO, Syncsort. “Cloudera recognized these challenges early on and developed Cloudera Navigator to address them. Syncsort’s DMX-h data integration seamlessly integrates with Cloudera Navigator to deliver detailed data lineage information regardless of whether the data movement and transformation process was run inside or outside of Hadoop, on-premise or in the cloud. The new solution helps enterprises integrate their entire data ecosystem with Cloudera Navigator, while meeting regulatory compliance requirements.”

The new integrated solution provides detailed data lineage information on:

  • Enterprise-Wide Data Coming from Outside the Hadoop Cluster: DMX-h now leverages field-level metadata to provide detailed data lineage information on everything that happened to the data on-the-fly as it consumes data from diverse data sources, transforms it and delivers it to the Cloudera Enterprise Data Hub.
  • Data Residing Inside the Hadoop Cluster: Cloudera Navigator tracks field-level metadata on data lineage in the cluster.
  • Changes to Data Inside the Hadoop Cluster: DMX-h is also used for Data Integration within the cluster. ETL jobs created in the DMX-h point-and-click interface can be run on MapReduce, Spark, or stand-alone Windows/Linux/Unix systems, on-premise or in the cloud. All data processing details are published to Cloudera Navigator.
  • Data Inside and Outside the Cluster in One Consolidated View: The integrated solution provides a consolidated end-to-end view of all the detailed data lineage information from Syncsort DMX-h and Cloudera Navigator in the Cloudera Navigator Dashboard. The solution connects Hadoop-based governance with all enterprise data for superior audit, data lineage, metadata management and policy enforcement capabilities, on-premise or in the cloud, including complex hybrid environments.
For more information about this news, visit