Informatica Corporation, an independent provider of data integration software and Cloudera, a provider of Apache Hadoop-based data management software and services, yesterday announced at Informatica World 2010 that the two companies are partnering to provide customers with the solutions needed to address the challenges associated with managing large-scale data, including structured, complex and social data. Together, Informatica and Cloudera say they intend to bring the productivity benefits of the Informatica Platform to the data-intensive distributed computing capability of Hadoop.
"During the past decade, data volumes, ranging from the unstructured formats of social media sites, to the structured formats within databases and applications have exploded," said Girish Pancha, executive vice president, Data Integration, Informatica. "It has become increasingly clear that organizations are struggling to make sense of and leverage the vast amount of both structured data as well as social data that is captured and processed. Now, our customers will be able to leverage their data of virtually any scale."
Informatica connectivity with Hadoop will leverage the Sqoop interfaces to provide a highly optimized and scalable connector from the Informatica Platform to the Hadoop Distributed File System. In addition the extension of the Informatica Platform to support hybrid deployment on Hadoop will enable data integration mappings defined within the Informatica development environment to be intelligently converted into a combination of MapReduce functions and User Defined Functions (UDFs) for execution on the data-intensive distributed computing environment of Hadoop.
For more details about Informatica, go here.
For more about Cloudera, go here.