The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.

Hadoop Articles

Syncsort, a provider of "Big Iron to Big Data" software, is releasing new innovations in its Ironstream data integration software. The new updates include the ability to deliver mainframe log and application data in real-time directly to Elastic Logstash.

Posted March 21, 2018

An abundance of database types are flourishing so much so that those who tout "polyglot persistence" insist that one size cannot fit all and focus on integrating multiple data stores. However, multi-model databases are also on the rise and may be a better fit than using multiple data stores. Jeff Fried, CTO, BA Insight will discuss the pros and cons of both approaches at Data Summit 2018 during his session titled, "Polyglot Persistence Versus Multi-Model Databases."

Posted March 14, 2018

BMC, a provider of IT solutions for the digital enterprise, is expanding its Control-M Managed File Transfer offering to include support for all file transfers from a single automation platform. With BMC's Control-M solution, companies have instant visibility into the status of file transfers and business application workloads.

Posted March 09, 2018

With data lakes being so new to organizations, an early failure can significantly set back the opportunity to fundamentally transform analytics. However, while the potential for big data innovation is significant, organizations are mired with slow, manual, and time-consuming processes for managing the tasks that turn raw big data into relevant business assets and insights. Without addressing these challenges in a systematic way, organizations find that data lake projects turn into labor-intensive, complex endeavors.

Posted March 08, 2018

It's time to submit nominations for the annual Database Trends and Applications Readers' Choice Awards Program in which the winning information management solutions, products, and services are chosen by you—the people who actually use them. This year, there are 29 categories in which to submit nominations.

Posted March 07, 2018

Startups with a mission to address an unanswered problem continue to emerge in the tech sector. These companies tap into new and still emerging technologies such as cloud, IoT, machine learning, containerization, AI, NoSQL, and blockchain to provide fresh solutions that simply were not possible before.

Posted March 07, 2018

MapR Technologies has extended advanced containers integration into the MapR Converged Data Platform. The company is enabling the deployment of stateful applications with its Data Fabric for Kubernetes providing persistent storage and full Kubernetes support with volume access. The persistent storage for stateful containers is a well-known problem in the container world, said Jack Norris, SVP of Data & Applications, MapR, noting that MapR is providing an "elegantly simple way" to solve that problem that is scalable, fast, and secure.

Posted March 06, 2018

Registration is now open for Data Summit 2018 presented by DBTA and Big Data Quarterly. For a limited time, super early bird pricing is also available for the conference which will take place May 22 - 23, 2018 at the Hyatt Regency Boston.

Posted February 28, 2018

The worldwide cognitive systems and artificial intelligence revenue has been forecast to grow beyond $47 billion in 2020, changing how we live and work dramatically. Will you be ready? Cognitive Computing Summit is coming to Boston in May to provide a 2-day immersion into the cognitive computing use cases, strategies, and software that every organization should know about now. 

Posted February 09, 2018

It's no secret that data management has changed dramatically in the nearly 10 years since the onslaught of "big data." Nonetheless, experts agree, there has never been a better time to be a DBA—provided you are willing to continue learning and growing and to step in to fill gaps where they open up.

Posted February 05, 2018

An astounding array of new technologies and approaches have emerged on the database scene over the past few years that promise to turn the next 12 months into a time of unprecedented transformation for the database landscape. There are new developments, along with reinforcement of tried-and-true technologies, some of which may help make the jobs of data managers just a bit easier.

Posted February 01, 2018

As we enter a new year new trends will take their turn in the spotlight. This is no different for the big data landscape. We have come a long way since the term "big data" swept the business world off its feet as the next frontier for innovation, competition, and productivity. Hadoop, NoSQL, and Spark have become members of the enterprise IT landscape, data lakes have evolved as a real strategy and migration to the cloud has accelerated across service and deployment models.

Posted January 25, 2018

TimeXtender, a provider of self-service BI and analytics, is now a certified technology partner with Tableau Software, enabling customers to make business decisions on the fly. TimeXtender was selected as a new partner as its Discovery Hub works seamlessly with Tableau, allowing customers to make business decisions in a timely manner based on data that is important to them.

Posted January 25, 2018

Quest Software, a global systems management and security software provider, is making three new updates to the Toad product family, including Toad Edge v1.2, Toad Data Point v4.3 and Toad Intelligence Central v4.3. The new release of Toad Edge simplifies the development and management of next-generation open source database platforms, with added support for MariaDB and MySQL instances running on Microsoft Azure.

Posted January 25, 2018

Modern data warehousing is not only being shaped by the need for businesses to deliver data faster to more users, but the need for a richer picture of their operations afforded by a greater variety of data for analysis. A growing number of organizations are modifying their data warehouse infrastructures with new technologies, from in-memory databases to Hadoop - and a flourishing market of cloud solutions.

Posted January 08, 2018

Today's modern enterprise data warehouse (EDW), which has been in existence for more than 20 years, faces a growing set of challenges. The platform must incorporate data integration, quality, and governance effectively.

Posted January 05, 2018

In a world where new technologies are often presented to the industry as rainbows and unicorns, there is always someone in a cubicle trying to figure out how to solve business problems and just make these great new technologies work together. The truth is that all of these technologies take time to learn, and it also takes time to identify the problems that can be solved by each of them.

Posted January 05, 2018

With a new year comes new ideas on how to disrupt the big data industry. From tried and true methods to the introduction of new solutions, several experts are predicting a surge of a combination of both old and new solutions, along with the rise of different roles that will power enterprises through 2018.

Posted January 04, 2018

Data lakes are often viewed as the ultimate silo breakers, integrating mountains of data frompoint solutions for ERP, CRM, enterprise data warehouse, cloud and on-premises applications.However, if the enterprise data lake is not leveraged appropriately, it often ends up being just adata dump or worse still a "data swamp."

Posted January 02, 2018

RedPoint Global, a provider of data management and customer engagement technology, is updating its RedPoint Data Management solution within the RedPoint Customer Data Platform. With RedPoint Data Management 8.0, organizations can harness massive amounts of data from an ever-growing number of touchpoints to create a truly unified customer profile - all in an expanded open garden environment.

Posted December 11, 2017

IBM is launching its next-generation Power Systems Servers incorporating its newly designed POWER9 processor, advancing performance improvements across popular AI frameworks. Built specifically for compute-intensive AI workloads, the new POWER9 systems are capable of improving the training times of deep learning frameworks by nearly which allows enterprises to build more accurate AI applications.

Posted December 05, 2017

There never has been a more interesting time to be involved in the data management field. Data not only has become "the new oil" but is also the catalyst that is powering organizations to new heights of success. The past year has seen the rise of powerful analytics and an embrace of new tools and platforms emerging to more effectively tap into the power that data offers. DBTA reached out to industry experts to document the most important trends shaping data management in 2018.

Posted December 01, 2017

Over the past few years, the scale, speed, and power of analytics have been dramatically transformed. The amount of data available from the internet, combined with advances in software to make use of it, has created a practice called "big data analytics." It can provide types of information that were not available in the recent past and it has the potential to do so in real-time.

Posted December 01, 2017

Data integration can seem like a never-ending quest as organizations try to combine and access data from disparate applications and sources. But as we move beyond relational as the only DBMS type that matters and embrace NoSQL and Hadoop data platforms, data integration can become more challenging and require new tools and approaches to achieve success.

Posted November 28, 2017

MapR Technologies, a pioneer in delivering one platform for all data, across every cloud, has announced the availability of the MapR Converged Data Platform 6.0, with new advancements to help organizations achieve greater value from their data through DataOps teams.  The major system update from MapR includes innovations that automate platform health and security, and a database for next-generation applications.

Posted November 21, 2017

No longer the stuff of science fiction, the business uses for cognitive computing, artificial intelligence, and machine learning today include fields as diverse as medicine, marketing, defense, energy, and agriculture. Enabling these applications is the vast amount of data that companies are collecting from machine sensors, instruments, and websites and the ability to support smarter solutions with faster data processing.

Posted November 13, 2017

Hortonworks, which recently announced DataPlane Service (DPS), a product designed to address the new paradigm of data management, has announced the first generally available extensible service that DPS will support—Data Lifecycle Manager (DLM). DLM 1.0 is a hybrid cloud-focused solution that offers disaster recovery and replication with auto-tiering and backup and restore.

Posted October 31, 2017

Syncsort and CA Technologies have formed a partnership. The new integration between Syncsort DMX-h and CA Datacom and CA IDMS bridges the "big iron to big data gap," enabling enterprises to tap into valuable mainframe data and make it accessible to emerging next-generation platforms.

Posted October 26, 2017

MapR Technologies has introduced the MapR Data Science Refinery, a new solution that allows data scientists to access and analyze all data in-place, to collaborate, build and deploy machine learning models on the MapR Converged Data Platform. 

Posted October 25, 2017

Bitwise, a data management services company, announced the launch of its Hadoop Adaptor for Mainframe Data, intended for converting any mainframe data in EBCDIC format to Hadoop-friendly formats such as ASCII, Avro, and Parquet. The data conversion solution addresses compatibility issues of ingesting mainframe data into the Hadoop data lake for advanced analytics, combining mainframe data with any other data sources in the data lake, and achieving faster analysis on mainframe data in Hadoop.

Posted October 23, 2017

Hadoop adoption in the enterprise is growing steadily and with this momentum is an increase in Hadoop-related projects. From real-time data processing with Apache Spark, to data warehousing with Apache Hive, to applications that run natively across Hadoop clusters via Apache YARN, these next-generation technologies are solving real-world big data challenges today.

Posted October 06, 2017

Data professionals and vendors converged at Strata Data in New York to trade tips and tricks for handling big data. Top of mind for most was the impact of machine learning and how it's continuing to evolve as the "next big thing."

Posted October 05, 2017

Dremio launched its data analytics platform in July and at Strata Data Conference in New York the company had the opportunity to showcase what the company can do. The company's mission is to cut out the need for traditional ETL, data warehouses, cubes, and aggregation tables, as well as the infrastructure in order to enable users to be independent and self-directed in their use of data, thereby accelerating time to insight.

Posted October 03, 2017

AtScale, which provides a universal semantic platform for BI on big data, has completed a $25 million Series C financing round. Seeking to provide big data access to any data, anywhere, for any employee, AtScale enables enterprises to simplify their business intelligence infrastructure by allowing business users to continue working with the tools they know while providing the enterprise with a universal semantic layer to centrally manage data definitions, performance and security.

Posted October 03, 2017

At the Strata Data Conference in New York, Paxata, provider of the Adaptive Information big data prep platform, announced early availability of its Intelligent Ingest as part of its next major release. The new automated ingest capabilities are aimed at making it more simple for business consumers to rapidly incorporate data from any cloud or format to prepare data for business analysis. 

Posted October 02, 2017

At the Strata Data conference in New York, Attunity, a provider of data integration and big data management software solutions, showcased the new release of its data integration platform designed to address the changing needs of companies with advanced analytics and data management initiatives. According to Kevin Petrie, senior director and technology evangelist at Attunity, many legacy data integration tools are not able to handle the necessary volume and variety of data feeding to the cloud at the required performance levels.

Posted October 02, 2017

Alation and Paxata have announced a partnership and integration to simplify the establishment of trust in the data lake.  Alation is a provider of software for collaborative data cataloging to enable analysts and information stewards to search, query and collaborate for faster, more accurate insights. Paxata provides an enterprise-grade, self-service, scalable, intelligent platform that enables business consumers to quickly transform raw data into ready information.

Posted September 28, 2017

BlueData, provider of a Big-Data-as-a-Service (BDaaS) software platform, is enhancing its BlueData EPIC platform and extending the solution to Google Cloud Platform (GCP) and Microsoft Azure. This release adds new innovations and options for running Hadoop, Spark, and other Big Data workloads on Docker containers -- delivering on the requirements from its rapidly growing customer base, including many of the world's largest enterprises across multiple industries.

Posted September 28, 2017

Informatica has introduced a new set of solutions and enhancements for intelligent data lake management and enterprise data cataloging to improve regulatory compliance in the era of GDPR. The solutions also feature integration with Hortonworks Atlas and support for Cloudera Altus, expanding Informatica's coverage across hybrid enterprise deployments, on premises and in the cloud.

Posted September 27, 2017

MapR Technologies announced database innovations for data-intensive applications, including advancements for developers that enable rich applications, in-place and continuous machine learning/AI and SQL capabilities, and global real-time data integration and micro-services support.

Posted September 26, 2017

Hortonworks has announced the Hortonworks DataPlane Service (Hortonworks DPS) to help improve the process of provisioning and operating distributed data systems for data science, self-service analytics or data warehousing optimization.

Posted September 25, 2017

As companies grow increasingly data-centric in their decision making, product and services development, and their overall understanding of the world they work in, speed and agility are becoming critical capabilities. A common theme in big data and analytics today is "Industry 4.0," representing a new wave of technology that enables the automation necessary for scaling. There's compelling justification for this as companies seek to unlock business value from big data with two broad approaches: the democratization of data with greater access by more users, and the enablement of automation everywhere possible.

Posted September 20, 2017

The movement toward the instrumentation of everything and the democratization of data and analytics is resulting in more data flowing to more users, and is creating new challenges in data management.

Posted September 20, 2017