Data Quality Articles
Talend unveiled a new Quick Start solution for deploying cloud data lakes on Amazon Web Services (AWS) platforms, allowing organizations to get data lakes up and running fast. Available for download immediately, the Quick Start automates the building of data lake environments by deploying Talend Big Data Integration components and AWS services such as Amazon EMR, Amazon Redshift, Amazon Simple Storage Service (Amazon S3), and Amazon Relational Database Service (Amazon RDS).
Posted December 01, 2017
There never has been a more interesting time to be involved in the data management field. Data not only has become "the new oil" but is also the catalyst that is powering organizations to new heights of success. The past year has seen the rise of powerful analytics and an embrace of new tools and platforms emerging to more effectively tap into the power that data offers. DBTA reached out to industry experts to document the most important trends shaping data management in 2018.
Posted December 01, 2017
The importance of data to today's modern world becomes more and more clear every day. Organizations are creating, storing, gathering, and managing more data than ever before. If you are reading this article, chances are, you will agree with this statement: "You are managing more data this year than you did last year … and your organization is planning to manage even more data next year."
Posted December 01, 2017
Ongroup's MVON# platform can now horizontally scale implementations of MultiValue. MVON# gives MV# developers the tools to write software in MV, transpile to C# and run it in the .NET Common Language Runtime.
Posted December 01, 2017
TimeXtender, a recognized global software company enabling self-service BI and analytics, is upgrading its Discovery Hub platform, adding a shared semantic layer that enables users to use the same language across platforms. The update also provides different perspectives—grouping together related data for area-specific purposes, for example, finance, sales, marketing, or cross-business analytics.
Posted November 30, 2017
Confluent, provider of the streaming platform based on Apache Kafka, is releasing Confluent Cloud, allowing users to manage large streaming environments. The fully managed service helps companies deploy their Customer 360, fleet management, fraud detection, and other real-time, large-scale initiatives.
Posted November 29, 2017
Melissa, a provider of global contact data quality and identity verification solutions, has expanded its partnership with Runner Technologies (Runner EDQ), a provider of integrated address verification solutions and Platinum level member of the Oracle Partner Network (OPN). Together,, the two firms are rolling out CLEAN_Address, an Oracle-validated, integrated address verification solution for PeopleSoft Enterprise, JD Edwards EnterpriseOne, and E-Business Suite platforms.
Posted November 14, 2017
IBM is releasing new offerings to its Watson Data Platform, introducing data cataloging and data refining capabilities designed to make it easier for developers and data scientists to analyze and prepare enterprise data for AI applications. By improving data visibility and helping to better enforce data security policies, users can now connect and share data across public and private cloud environments.
Posted November 08, 2017
Real-time data delivery represents the next frontier of intelligent enterprises, and there is great potential value in the ability to immediately sense and respond to opportunities and threats. At the same time, enterprises are encumbered by existing or legacy technologies and methodologies that may add latency to their data-delivery efforts. What will a real-time enterprise look like?
Posted November 01, 2017
The way MarkLogic CEO Gary Bloom sees it, interest in artificial intelligence is soaring: Everyone wants to talk about it and everyone wants to apply intelligence for better insights and better decisions. But there is just one problem.
Posted October 25, 2017
Semarchy has added new capabilities in its xDM platform, a master data management (MDM) that leverages smart algorithms and material design to simplify data stewardship, governance, and integration
Posted October 11, 2017
MapR Technologies is partnering with DataScience.com, an enterprise data science platform provider, to launch a joint solution that will power collaborative data science projects. With this partnership, DataScience.com customers can enjoy a truly collaborative workflow environment where their data science experiments can run directly on the MapR Platform without needing a separate compute cluster to access data.
Posted October 06, 2017
Data professionals and vendors converged at Strata Data in New York to trade tips and tricks for handling big data. Top of mind for most was the impact of machine learning and how it's continuing to evolve as the "next big thing."
Posted October 05, 2017
Dremio launched its data analytics platform in July and at Strata Data Conference in New York the company had the opportunity to showcase what the company can do. The company's mission is to cut out the need for traditional ETL, data warehouses, cubes, and aggregation tables, as well as the infrastructure in order to enable users to be independent and self-directed in their use of data, thereby accelerating time to insight.
Posted October 03, 2017
Zaloni is introducing a machine learning data matching engine that leverages the company's data lake solution, enabling enriched data views for multiple use cases across business sectors. Zaloni's data matching engine provides a new approach for creating an integrated, consistent view of data that is updated, efficiently maintained, and can drive customer-facing applications.
Posted October 02, 2017
According to a new survey from SAS, less than half (45%) of respondents have a structured plan in place for compliance with the EU's new General Data Protection Regulation (GDPR) and more than half (58%) indicate that their organizations are not fully aware of the consequences of noncompliance.
Posted October 02, 2017
BlueData, provider of a Big-Data-as-a-Service (BDaaS) software platform, is enhancing its BlueData EPIC platform and extending the solution to Google Cloud Platform (GCP) and Microsoft Azure. This release adds new innovations and options for running Hadoop, Spark, and other Big Data workloads on Docker containers -- delivering on the requirements from its rapidly growing customer base, including many of the world's largest enterprises across multiple industries.
Posted September 28, 2017
Regardless of industry, the ability to collect, manage, and intelligently leverage data will clearly be a differentiator for the foreseeable future. Executives in healthcare are acutely aware of the disruption being driven by this new paradigm and understand that this trend is impacting every sector, from banking to farming to manufacturing. Ultimately, investing time and resources in data collection and analysis is only valuable if it provides insight for making proactive, tactical decisions. Innovative companies today are using big data and analytics to drive attributable revenue and compete more effectively.
Posted September 27, 2017
Melissa, a provider of global contact data quality and identity verification solutions, has added geographic risk data to its location intelligence tools and services. Through a partnership with HazardHub, a provider of geographic risk datasets for U.S. properties, natural hazard data such as wind, fire, water, or earthquake risk can be associated with specific properties to enable a greater level of risk awareness and location intelligence. Coupled with Melissa's property and mortgage data enhancements, users have an end-to-end property intelligence solution that provides enriched property data supported by precise risk scores.
Posted September 26, 2017
TigerGraph is emerging from stealth, securing of $31M in Series A funding and launching TigerGraph - a native parallel graph database platform for enterprise applications along with the availability of both its Cloud Service and GraphStudio, TigerGraph's visual software development kit (SDK).
Posted September 20, 2017
Syncsort and ASG Technologies are combining their solutions and expertise in data quality and data governance to help companies with both data governance and regulatory compliance.
Posted September 13, 2017
On Sunday at Oracle OpenWorld, several Oracle user groups, including the IOUG, will bring the experiences of our users and experts to San Francisco and share with thousands of our peers. If you're coming to OpenWorld, I can't say enough about how important it is to participate in the Sunday Program.
Posted September 06, 2017
Quest Software, a global systems management and security software provider, is releasing Toad Edge, a new commercial database toolset that can manage next-generation open source database environments. This release will support MySQL, saving time, minimizing the MySQL learning curve, and mitigating risks that can be associated with building applications on an open source database platform.
Posted August 23, 2017
Centerbridge Partners, L.P., a private investment firm, has completed the $1.26 billion acquisition of enterprise software providers Syncsort Incorporated and Vision Solutions, Inc. from affiliates of Clearlake Capital Group, L.P. Headquartered in Pearl River, NY, the new company benefits from a dramatic increase in global presence, as well as significantly expanded product offerings, afforded by the combination.
Posted August 18, 2017
Referential integrity (RI) is a method for ensuring the "correctness" of data within a DBMS. People tend to oversimplify RI, stating that it is merely the identification of relationships between relational tables. It is actually much more than this. RI embodies the integrity and usability of a relationship by establishing rules that govern that relationship.
Posted August 09, 2017
BackOffice Associates, a provider of information governance, data stewardship, and data migration solutions, has announced that Bridge Growth Partners, a private equity firm, has signed an agreement to make a majority equity investment in the company.
Posted August 03, 2017
With more data streaming in from more sources, in more varieties, and being used more broadly than ever by more constituents, ensuring high data quality is becoming an enterprise imperative. In fact, as data is increasingly appreciated as the most valuable asset a company can have, says DBTA columnist Craig S. Mullins, data integrity is not just an important thing; it's the only thing. If the data is wrong, then there is no reason to even keep it, says Mullins.
Posted August 02, 2017
TIBCO Software, a provider of integration, API management, and analytics, is acquiring nanoscale.io, a provider of microservices technology and tooling. The acquisition extends and enhances TIBCO's leadership in the development of microservices and APIs that connect and integrate with a more expansive on-premises or hybrid cloud environment, and bolster its Connected Intelligence platform.
Posted July 27, 2017
It's been long acknowledged that data is the most precious commodity of the 21st-century business, and that all efforts and resources need to be dedicated to the acquisition and care of this resource. Lately, however, executives have become enamored with the vision of transforming their organizations into "data-driven" enterprises, which move forward into the future on data-supported insights. So, what, exactly, does the ideal "data-driven enterprise" look like?
Posted July 27, 2017
The data manager now sits in the center of a revolution swirling about enterprises. In today's up-and-down global economy, opportunities and threats are coming in from a number of directions. Business leaders recognize that the key to success in hyper-competitive markets is the ability to leverage data to draw insights that predict and provide prescriptive action to stay ahead of markets and customer preferences. For that, they need to keep up with the latest solutions and approaches in data management. Here are 12 of the key technologies turning heads—or potentially opening enterprise wallets—in today's data centers.
Posted July 19, 2017
Dell EMC has announced a new platform for implementing and sustaining a hybrid cloud based on Microsoft Azure Stack. The new Dell EMC Cloud for Microsoft Azure Stack is aimed at helping organizations standardize on the Microsoft Azure ecosystem and provides automated IT service delivery for traditional and cloud-native applications.
Posted June 29, 2017
New and emerging vendors offer fresh ways of dealing with data management and analytics challenges in areas such as data as a service, security as a service, cloud in a box, and data visualization. Here, DBTA looks at the 10 companies whose approaches we think are worth watching.
Posted June 16, 2017
BlueData, provider of Big-Data-as-a-Service (BDaaS) software, is releasing an enhanced version of BlueData EPIC, providing increased scalability, enhanced networking, and significant security and performance optimizations. With BlueData EPIC 3.0, enterprises can quickly and easily deploy large-scale production environments for big data analytics and data science running in Docker containers - either on-premises, in the public cloud, or in a hybrid architecture.
Posted June 16, 2017
Melissa, a provider of global data quality and identity verification solutions, has achieved certification in the EU-U.S. Privacy Shield Framework, which establishes principles to ensure privacy of customer data shared in the process of transatlantic commerce. According to Melissa, this certification supports Melissa's ongoing commitment to the security, privacy, and availability of customer data for a worldwide clientele.
Posted June 13, 2017
Syncsort, a provider of data integration solutions for next-generation analytics, has announced new solutions that bring together its industry-leading big data integration and recently acquired Trillium data quality software to address data governance and customer 360 initiatives within data lakes.
Posted June 06, 2017
Trifacta, a provider of data wrangling software, is launching the Spring '17 Wrangler Enterprise release, focusing on accelerating the expansion of data wrangling projects in enterprise environments. With the Spring '17 release, Trifacta now provides enhanced features that meet the growing needs for deploying data wrangling solutions at enterprise-wide scale.
Posted June 01, 2017
Qubole, the big data-as-a-service company , is building an autonomous data platform that will include Qubole Data Service (QDS) Community Edition, QDS Enterprise Edition, and QDS Cloud Agents. The solution can intelligently automate and analyze platform usage to make data teams more effective.
Posted May 26, 2017
Tricentis, a provider of software test automation, and Panaya, a provider of SaaS-based test management, are releasing a joint platform for autonomous software testing in SAP environments. Autonomous SAP testing generates SAP business process tests automatically, diving deep into applications and using machine learning to observe and understand patterns in end user transactions.
Posted May 24, 2017
Informatica, a provider of solutions for enterprise data management, has unveiled a metadata-driven artificial intelligence technology called "CLAIRE" with the latest release of the Informatica Intelligent Data Platform.
Posted May 23, 2017
You will often hear experienced practitioners and consultants suggest that there is both an art and a science to effective data governance. The art is in the details of fine-tuning a data governance program to fit your culture and address specific business needs. But the fundamental principles of data governance are best understood and executed through science.
Posted May 15, 2017
What are the enabling technologies that make enterprise architecture what it is today? There are a range of new-generation technologies and approaches shaping today's data environments. The key is putting them all together to help enterprise architecture fit into the enterprise's vision of itself as a data-driven organization. Tools and technologies emerging within today's data-driven enterprise include cloud, data lakes, real-time analytics, microservices, containers, Spark, Hadoop, and open source trends.
Posted May 15, 2017
Oracle has introduced new artificial intelligence-based customer experience applications to support B2B and B2C interactions. Helping organizations to avoid the need for additional processes and integrations, the applications are intended to allow organizations achieve immediate value and embrace more efficient approaches, according to Jack Berkowitz, vice president, Products & Data Science, Oracle Adaptive Intelligence.
Posted May 03, 2017
Cloudera, which last week began trading on the New York Stock Exchange under the symbol "CLDR," has announced the general availability of the Cloudera Data Science Workbench, a self-service tool for data scientists. The workbench, which was announced in beta at Strata+Hadoop World San Jose 2017, enables fast, easy and secure self-service data science for the enterprise.
Posted May 02, 2017