Subscribe to the 5 Minute Briefing Information Management email newsletter

Five Minute Briefing - Information Management
November 6, 2012

Five Minute Briefing - Information Management: November 6, 2012. A concise weekly report with key product news, market research and insight for data management professionals and IT executives.

News Flashes

Open source software vendor Talend announced that it has added big data profiling for Apache Hadoop and support for NoSQL databases in the upcoming release of its integration platform, Talend v5.2. Data profiling, the process of evaluating the character and condition of data stored across the enterprise, is a critical step toward gaining control over organizational data, and is emerging as a big data best practice. "Profiling allows you to understand what you have in your Hadoop cluster and how this data can be used for your big data integration and management project," Yves de Montcheuil, Talend's vice president of marketing, tells 5 Minute Briefing.

Big data is here, offering both vast opportunities — as well as vexing challenges — for every organization it touches. For a number of years, it has been understood that to be of value, information needs to be readily available, as close to real time as possible, to users in any location. Now, with the onset of "big data," the task gets more daunting. "These are all increasing the demands on both transactional and analytics data systems," says Bernie Spang, director of database software and systems for IBM.

Cloudera, provider of Apache Hadoop-based software and services, announced the first big data management solution that allows batch and real-time operations on any type of data within one scalable system. Cloudera Enterprise Real-Time Query (RTQ), powered by Cloudera Impala, improves the economics and performance of large scale enterprise data management, allowing organizations to process data at petabyte scale and interact with that data in real time all on the same system.

Informatica Corporation has introduced Informatica PowerCenter Big Data Edition, a solution for organizations supplementing transactional data with social, mobile, cloud and machine data at high velocity, volume and variety. PowerCenter Big Data Edition works with emerging technologies like Hadoop as well as traditional data management infrastructure to leverage organizations' big data while reducing costs and risks.

Think About It

There is a lot of buzz in the industry about big data, and it is a game-changing paradigm, but organizations still need to govern big data the same way they are governing traditional types of data, contends Sunil Soares, director of Information Governance at IBM.

The beauty of a truly wonderful database design is its ability to serve many masters. And good database designers are able to empathize with those who will use their designs. In business intelligence settings, three perspectives deserve consideration when composing designs.