Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Hadoop adoption in the enterprise is growing steadily and with this momentum is an increase in Hadoop-related projects. From real-time data processing with Apache Spark, to data warehousing with Apache Hive, to applications that run natively across Hadoop clusters via Apache YARN, these next-generation technologies are solving real-world big data challenges today.

Posted October 06, 2017

Data professionals and vendors converged at Strata Data in New York to trade tips and tricks for handling big data. Top of mind for most was the impact of machine learning and how it's continuing to evolve as the "next big thing."

Posted October 05, 2017

Dremio launched its data analytics platform in July and at Strata Data Conference in New York the company had the opportunity to showcase what the company can do. The company's mission is to cut out the need for traditional ETL, data warehouses, cubes, and aggregation tables, as well as the infrastructure in order to enable users to be independent and self-directed in their use of data, thereby accelerating time to insight.

Posted October 03, 2017

AtScale, which provides a universal semantic platform for BI on big data, has completed a $25 million Series C financing round. Seeking to provide big data access to any data, anywhere, for any employee, AtScale enables enterprises to simplify their business intelligence infrastructure by allowing business users to continue working with the tools they know while providing the enterprise with a universal semantic layer to centrally manage data definitions, performance and security.

Posted October 03, 2017

At the Strata Data Conference in New York, Paxata, provider of the Adaptive Information big data prep platform, announced early availability of its Intelligent Ingest as part of its next major release. The new automated ingest capabilities are aimed at making it more simple for business consumers to rapidly incorporate data from any cloud or format to prepare data for business analysis. 

Posted October 02, 2017

At the Strata Data conference in New York, Attunity, a provider of data integration and big data management software solutions, showcased the new release of its data integration platform designed to address the changing needs of companies with advanced analytics and data management initiatives. According to Kevin Petrie, senior director and technology evangelist at Attunity, many legacy data integration tools are not able to handle the necessary volume and variety of data feeding to the cloud at the required performance levels.

Posted October 02, 2017

Alation and Paxata have announced a partnership and integration to simplify the establishment of trust in the data lake.  Alation is a provider of software for collaborative data cataloging to enable analysts and information stewards to search, query and collaborate for faster, more accurate insights. Paxata provides an enterprise-grade, self-service, scalable, intelligent platform that enables business consumers to quickly transform raw data into ready information.

Posted September 28, 2017

BlueData, provider of a Big-Data-as-a-Service (BDaaS) software platform, is enhancing its BlueData EPIC platform and extending the solution to Google Cloud Platform (GCP) and Microsoft Azure. This release adds new innovations and options for running Hadoop, Spark, and other Big Data workloads on Docker containers -- delivering on the requirements from its rapidly growing customer base, including many of the world's largest enterprises across multiple industries.

Posted September 28, 2017

Informatica has introduced a new set of solutions and enhancements for intelligent data lake management and enterprise data cataloging to improve regulatory compliance in the era of GDPR. The solutions also feature integration with Hortonworks Atlas and support for Cloudera Altus, expanding Informatica's coverage across hybrid enterprise deployments, on premises and in the cloud.

Posted September 27, 2017

MapR Technologies announced database innovations for data-intensive applications, including advancements for developers that enable rich applications, in-place and continuous machine learning/AI and SQL capabilities, and global real-time data integration and micro-services support.

Posted September 26, 2017

Hortonworks has announced the Hortonworks DataPlane Service (Hortonworks DPS) to help improve the process of provisioning and operating distributed data systems for data science, self-service analytics or data warehousing optimization.

Posted September 25, 2017

As companies grow increasingly data-centric in their decision making, product and services development, and their overall understanding of the world they work in, speed and agility are becoming critical capabilities. A common theme in big data and analytics today is "Industry 4.0," representing a new wave of technology that enables the automation necessary for scaling. There's compelling justification for this as companies seek to unlock business value from big data with two broad approaches: the democratization of data with greater access by more users, and the enablement of automation everywhere possible.

Posted September 20, 2017

The movement toward the instrumentation of everything and the democratization of data and analytics is resulting in more data flowing to more users, and is creating new challenges in data management.

Posted September 20, 2017

Many people are unsure of the differences between deep learning, machine learning, and artificial intelligence. Generally speaking, and with minimal debate, it is reasonably well-accepted that artificial intelligence can most easily be categorized as that which we have not yet figured out how to solve, while machine learning is a practical application with the know-how to solve problems, such as with anomaly detectio

Posted September 20, 2017

Syncsort is releasing a new platform that will deliver an agile, efficient, and powerful solution that will improve data stored and processed in data lakes. Trillium Quality for Big Data integrates Trillium data quality capabilities with the Intelligent Execution (IX) technology from its DMX-h Big Data integration solution.

Posted September 19, 2017

Actian, a provider of software for data management, analytics and integration, has announced support for Apache Spark in the latest release of Actian Vector in Hadoop (VectorH). Actian Vector technology exploits vectorized processing and multi-level in-memory  acceleration to improve performance on Hadoop data stores. It supports single node, clustered, and hybrid computing environments that span on-premise and the cloud.

Posted September 19, 2017

Each year, tens of thousands of data professionals from well over 100 countries gather at Oracle OpenWorld in San Francisco. Leaders of two major Oracle users' groups—David Start, president of the Independent Oracle Users Group, and Alyssa Johnson, president of the Oracle Applications Users Group—share what they have planned for their members at Oracle OpenWorld 2017, taking place Oct. 1-5.

Posted September 07, 2017

On Sunday at Oracle OpenWorld, several Oracle user groups, including the IOUG, will bring the experiences of our users and experts to San Francisco and share with thousands of our peers. If you're coming to OpenWorld, I can't say enough about how important it is to participate in the Sunday Program.

Posted September 06, 2017

MapR has introduced the MapR Orbit Cloud Suite which provides a comprehensive set of cloud computing capabilities for the MapR Converged Data Platform to enable organizations to build data fabrics that manage data across one or more clouds, hybrid clouds, or to the edge.

Posted August 30, 2017

Docker has announced the latest release of Docker Enterprise Edition (EE), a container as a service (CaaS) platform for managing and securing Windows, Linux, and mainframe applications across both on premises and cloud infrastructure.

Posted August 28, 2017

Databricks, founded by the team that created Apache Spark, has secured $140 million in a Series D funding round led by Andreessen Horowitz. The new funding brings Databricks' total capital raised to $247 million, and will accelerate the company's investment in making artificial intelligence (AI) achievable for enterprise organizations with its Unified Analytics Platform.

Posted August 22, 2017

Centerbridge Partners, L.P., a private investment firm, has completed the $1.26 billion acquisition of enterprise software providers Syncsort Incorporated and Vision Solutions, Inc. from affiliates of Clearlake Capital Group, L.P. Headquartered in Pearl River, NY, the new company benefits from a dramatic increase in global presence, as well as significantly expanded product offerings, afforded by the combination.

Posted August 18, 2017

These days, end users—be they employees or consumers visiting a site—expect information delivered in seconds, if not nanoseconds. Applications tied into networks of connected devices and sensors are powering operations and making adjustments on a real-time basis.

Posted August 16, 2017

Kyvos Insights, a big data analytics company, is releasing Kyvos 4.0, providing users with new capabilities for creating data cubes with near limitless scalability and performance. Kyvos 4.0 delivers new levels of scalability, performance, and support for concurrent users, enabling organizations to provide self-service, interactive business intelligence (BI) on big data for all of their users across the enterprise.

Posted August 15, 2017

IBM announced a new all-flash, high-performance data and file management solution for enterprise clients running exabyte-scale big data analytics, cognitive and AI applications. The combined flash and storage software solution has been certified with the Hortonworks Data Platform (HDP) to provide clients with more choice in selecting the right platform for their big data analytics on data processing engines like Hadoop and Spark.

Posted August 14, 2017

In the new world of big data and the data-driven enterprise, data has been likened to the new oil, a company's crown jewels, and the transformative effect of the advent of electricity. Whatever you liken it to, the message is clear that enterprise data is of high value. And, after more than 10 years, there is no technology more aligned with advent of big data than Hadoop.

Posted August 02, 2017

The desire to compete on analytics is driving the adoption of big data and cloud technologies that enable enterprises to inexpensively store and process large volumes of data. But, building a modern data architecture can be confusing and time-consuming. From NoSQL and in-memory databases, to Hadoop and Spark, technologies are available that offer new and distinct capabilities to the world of enterprise data management.

Posted July 27, 2017

It's been long acknowledged that data is the most precious commodity of the 21st-century business, and that all efforts and resources need to be dedicated to the acquisition and care of this resource. Lately, however, executives have become enamored with the vision of transforming their organizations into "data-driven" enterprises, which move forward into the future on data-supported insights. So, what, exactly, does the ideal "data-driven enterprise" look like?

Posted July 27, 2017

The data manager now sits in the center of a revolution swirling about enterprises. In today's up-and-down global economy, opportunities and threats are coming in from a number of directions. Business leaders recognize that the key to success in hyper-competitive markets is the ability to leverage data to draw insights that predict and provide prescriptive action to stay ahead of markets and customer preferences. For that, they need to keep up with the latest solutions and approaches in data management. Here are 12 of the key technologies turning heads—or potentially opening enterprise wallets—in today's data centers.

Posted July 19, 2017

MicroStrategy, a provider of enterprise analytics and mobility software, is releasing MicroStrategy 10.8, updating critical capabilities across enterprise analytics, enterprise mobility, embedded analytics, enterprise cloud, and enterprise IoT.

Posted July 05, 2017

Snowflake Computing, a provider of cloud data warehouse technology, is extending its concept of the modern data warehouse to what it calls a "data sharehouse."

Posted June 22, 2017

SAP and Accenture are expanding their collaboration to co-innovate, co-develop, and jointly go to market with digital solutions based on the new SAP Leonardo digital innovation system. The long-standing strategic partners will focus on embedding digital technologies including additional machine learning, analytics, and the Internet of Things (IoT) at the core of clients' businesses to deliver even greater value from their SAP investments.

Posted June 21, 2017

Syncsort, a provider of data integrity and integration solutions for next-generation analytics, has announced new capabilities in its mainframe data access and integration solution that populates Hadoop data lakes with changes in mainframe data.

Posted June 20, 2017

New and emerging vendors offer fresh ways of dealing with data management and analytics challenges in areas such as data as a service, security as a service, cloud in a box, and data visualization. Here, DBTA looks at the 10 companies whose approaches we think are worth watching.

Posted June 16, 2017

A new era of computing is unfolding with big data, cloud, and cognitive all converging at once. This confluence will transform how we do business and it's impacting all industries.

Posted June 16, 2017

The world of data management is constantly changing. Each year, the DBTA 100 spotlights the companies that are dealing with evolving market demands through innovation in software, services, and hardware.

Posted June 15, 2017

IBM and Hortonworks are expanding their partnership focused on extending data science and machine learning to more developers and across the Apache Hadoop ecosystem. The companies are combining the Hortonworks Data Platform (HDP) with the IBM Data Science Experience and IBM Big SQL into new integrated solutions designed to help users better analyze and manage data for better decision making.

Posted June 13, 2017

Attunity Ltd., a provider of data integration and big data management software solutions, is launching a new solution, Attunity Compose for Hive, which automates the process of creation and continuous loading of operational and historical data stores in a data lake.

Posted June 13, 2017

Addressing the rise of hybrid deployments, Hortonworks has introduced a new software support subscription to provide seamless support to organizations as they transition from on-premise to cloud. Separately, Hortonworks also announced the general availability of Hortonworks Dataflow (HDF) 3.0, a new release of its open source data-in-motion platform, which enables customers to collect, curate, analyze and act on all data in real-time, across the data center and cloud.

Posted June 12, 2017

Hadoop adoption is growing and so is the commitment to data lake strategies. Data security, governance, integration, and access have all been identified as critical success factors for data lake deployments.

Posted June 09, 2017

The demand for speed and agility are among the key drivers of the growing DevOps movement, which seeks to better align software development and IT operations. Yet, challenges still exist.

Posted June 07, 2017

Databricks has introduced a new offering to simplify the management of Apache Spark workloads in the cloud. "Databricks Serverless" is a managed computing platform for Apache Spark that allows teams to share a pool of computing resources and automatically isolates users and manages costs. The new offering aims to remove the complexity and cost of users managing their own Spark clusters.

Posted June 06, 2017

MapR Technologies, Inc., provider of a converged data platform that integrates analytics with operational processes in real time, has announced MapR-XD, a cloud-scale data store to manage files and containers. As part of the MapR Converged Data Platform, MapR-XD supports any data type from the edge to the data center and multiple cloud environments with automatic policy-driven tiering from hot, warm or cold data to enable customers to create global data fabrics which are ready for analytical and operational applications.

Posted June 06, 2017

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Sponsors