Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

To shed light on the enterprise and technology issues IT professionals will be facing in 2017 as business or organizational leadership seeks strategies to leverage the "big data" phenomenon, the fourth annual edition of the Big Data Sourcebook is now available for download.

Posted December 01, 2016

Dataguise through its DgSecure platform is now supporting sensitive data discovery on Amazon Redshift and Amazon RDS, as well as Amazon Simple Storage Service (S3). The platform will now scan for sensitive information stored on Amazon Redshift, RDS, and S3 and provide ongoing monitoring of sensitive data in S3 throughout its lifecycle.

Posted December 01, 2016

What's ahead for 2017 in terms of big data and IoT? IT executives reflect on the impact that Spark, blockchain, data lakes, cognitive computing, and AI and machine learning, and other cutting-edge approaches may have on data management and analytics over the year ahead.

Posted November 30, 2016

SUSE is acquiring OpenStack IaaS and Cloud Foundry PaaS Talent and Technology Assets from HPE. The agreement aims to accelerate SUSE's entry into the growing Cloud Foundry Platform-as-a-Service (PaaS) market.

Posted November 30, 2016

AtScale, which provides a self-service BI platform for big data, has announced an expansion of its services. With this announcement, the company says it is introducing a BI platform that enables businesses to work seamlessly across all of big data, on premise and in the cloud. In addition to Hadoop, AtScale has announced preview availability of support for data stored in Teradata, Google Dataproc and BigQuery, expanding on the company's existing support for Microsoft Azure and HDInsight.

Posted November 21, 2016

Aerospike, a provider of NoSQL solutions, is releasing a new version of its Aerospike platform, transforming how organizations store, access, and analyze data. The new version of Aerospike includes features such as SortedMap, durable delete, IPv6, improved cluster management, and updated network naming.

Posted November 17, 2016

Databricks has announced that, in collaboration with industry partners, it has broken the world record in the CloudSort Benchmark, a third-party industry benchmarking competition for processing large datasets. Databricks was founded by the team that created the Apache Spark project.

Posted November 16, 2016

Expanding the capabilities for customers to take advantage of the elasticity of Apache Hadoop and Apache Spark in the cloud to power new workloads and analytic applications, Hortonworks, Inc. has announced the availability of Hortonworks Data Cloud on the AWS Cloud.

Posted November 15, 2016

MicroStrategy Incorporated is launching MicroStrategy Desktop at no cost to users, allowing professionals to utilize popular data sources and build insightful visualizations. MicroStrategy Desktop, available for Mac and PC, is a data discovery tool that allows users to access data on their own and build dashboards.

Posted November 03, 2016

New data sources such as sensors, social media, and telematics along with new forms of analytics such as text and graph analysis have necessitated a new data lake design pattern to augment traditional design patterns such as the data warehouse. Unlike the data warehouse - an approach based on structuring and packaging data for the sake of quality, consistency, reuse, ease of use, and performance - the data lake goes in the other direction by storing raw data that lowers data acquisition costs and provides a new form of analytical agility.

Posted November 03, 2016

Driven by the demands of an always-on global economy, the widespread proliferation of data is combining with an expectation to leverage seamlessly-integrated data in near-real time. However, data integration methods aren't keeping up.

Posted November 02, 2016

Trifacta, a provider of data wrangling solutions, is launching Wrangler Edge, a platform designed for analyst teams wrangling diverse data outside of big data environments. "We are packing the Trifacta product and adding enterprise features such as the ability to schedule jobs to handle larger data volumes to connect to diverse sources," said Will Davis, director of product marketing. "We also added collaboration and sharing features as well all without requiring organizations to manage a large Hadoop infrastructure."

Posted November 01, 2016

With data flowing into enterprises from so many different sources, and at varying speeds and times, effective solutions are needed to enable insights to be uncovered for faster decision making. To delve into the issues involved in making big data usable more quickly within organizations, DBTA recently presented a webinar featuring executives from the Federal Home Loan Mortgage Corp., known as Freddie Mac.

Posted October 27, 2016

SAP has completed its acquisition of Altiscale, which provides a high-performance, scalable Big Data-as-a-Service (BDaaS) solution that includes full operational services. The company announced this acquisition at Strata + Hadoop 2016 in New York City. With this acquisition Altiscale will operate as a focused and integrated BDaaS offering from SAP to help accelerate and operationalize Big Data deployment in the enterprise.

Posted October 26, 2016

Paxata is releasing Paxata Connect to extend the Paxata Platform with a connectivity framework that creates a nexus to acquire, shape, and publish meaningful data for faster time to value. With Connect, information architects and developers can take advantage of out-of-the-box connectors, build their own repeatable data services and pipelines, and maintain transparency and oversight to ensure data provides a greater and faster return.

Posted October 19, 2016

Talend, a provider of cloud and big data integration software, has formed a strategic partnership with T-Systems. T-Systems, a German subsidiary of Deutsche Telekom, is using Talend Big Data Integration software to streamline the collection and cleansing of data as part of T-Systems' Big Data platform services.

Posted October 11, 2016

A new world of self-service BI brings with it its own issue of data chaos. When everyone is looking at the data their own way, people find different answers to the same questions.

Posted October 10, 2016

Organizational issues such as governance and skills—not technology requirements—are the greatest challenges that IT and corporate managers are facing in the emerging world of big data. To get a better handle on the complex new world big data is catalyzing, executives and professionals recognize they must reimagine and re-architect the concept of the "data center"—and what ultimately is coming out of may be a surprise to everyone. These are some key takeaways from a recent survey of 319 corporate and IT managers, conducted by Unisphere Research, a division of Information Today, Inc., in partnership with Cloudera and Intel.

Posted October 07, 2016

Vormetric, a Thales company, is releasing a new platform called Thales Orchestrator, to help reduce the cost of protecting data at rest across organizations. Thales Orchestrator's features include live data transformation, key management-as-a-service, Bring Your Own (encryption) Key management for AWS with Thales HSMs, vaultless tokenization, and Docker Encryption and Access Controls. Docker data-at-rest encryption and access controls that help to assure container images and file systems are controlled and secure.

Posted October 06, 2016

Splice Machine, provider of an SQL RDBMS powered by Hadoop and Spark, now supports native PL/SQL on Splice Machine. Announced at Strata + Hadoop World in NYC, the new capabilities are available through the Splice Machine Enterprise Edition.

Posted October 05, 2016

Pepperdata unveiled a new offering that enables customers of Amazon Elastic MapReduce (EMR) to gain granular visibility into their clusters' run time performance. Even after an Amazon EMR cluster has completed its work and is terminated, users will be able to access fine-grained monitoring data that allows customers to view a run and analyze it, as well as compare it with historical data to improve future performance.

Posted October 05, 2016

Choosing when to leverage cloud infrastructure is a topic that should not be taken lightly. There are a few issues that should be considered when debating cloud as part of a business strategy.

Posted October 04, 2016

Syncsort is incorporating new open metadata management capabilities in its DMX-h data integration software that, along with its seamless integration with Cloudera Navigator, aim to make big data governance easier.DMX-h provides organizations with a single interface for accessing and integrating all enterprise data, including IBM z mainframes, and the flexibility to use the metadata repository that best meets their needs, on premise and in the cloud.

Posted October 04, 2016

NoSQL and Hadoop—two foundations of the emerging agile data architecture—have been on the scene for several years now, and, industry observers say, adoption continues to accelerate—especially within mainstream enterprises that weren't necessarily at the cutting edge of technology in the past.

Posted October 04, 2016

MapR Technologies recently announced support for event-driven microservices in the MapR Converged Data Platform. The goal, the company says, is to leverage continuous analytics, automated actions, and faster response to impact business as it happens.

Posted October 03, 2016

Zaloni, the data lake company, unveiled new platform updates at Strata + Hadoop World 2016 including new enhancements to Bedrock Data Lake Management Platform and its Mica self-service data preparation solution. Bedrock helps businesses govern and manage data across the enterprise, and Bedrock 4.2 adds new capabilities around data privacy, security, and data lifecycle management.

Posted October 03, 2016

MariaDB Corporation is updating its MaxScale platform, adding a data streaming integration with Kafka, enhanced security, and high availability capabilities. MariaDB MaxScale is a next-generation database proxy that manages administrative functions like security, scalability, data streaming and high availability, enabling the database to focus on core functionality to drive faster innovation.

Posted October 03, 2016

At Strata + Hadoop World, Hortonworks showcased its technology solutions for streaming analytics, security, governance, and Apache Spark at scale.

Posted September 30, 2016

Attendees of Strata + Hadoop saw their fair share of solutions that tout that they are "next big thing" to solve a multitude of big data problems. Eric Sammer, CTO and co-founder, and Bryce Hein, vice president of Marketing, at Rocana observed that the focus is more on how platforms can help issues rather than the infrastructure behind it.

Posted September 29, 2016

Cloudera has added new technology enhancements to its data management and analytics platform to make it easier for companies to take advantage of elastic, on-demand cloud infrastructure for business value from all their data. The move to the cloud has become a top priority for CIOs, said Charles Zedlewski, vice president, of products at Cloudera, at Strata + Hadoop World 2016 in NYC.

Posted September 29, 2016

MathWorks showcased the latest release of MATLAB, which is used in the development of analytics and algorithms to help solve engineering and scientific problems, at Strata + Hadoop World 2016 in New York.

Posted September 29, 2016

Dataguise announced general availability of the Dataguise DgSecure Dashboard, a dashboard for visualization of all sensitive data throughout the enterprise—including government-protected PII, PHI, and PCI data—and demonstrated the capability at Strata + Hadoop World Conference in New York City.

Posted September 29, 2016

At Strata + Hadoop World 2016, Kognitio announced a competition to find the best use case or application that has its newest offering, Kognitio-on-Hadoop, as part of the solution. According to Roger Gaskell, CTO of Kognitio, the company is looking for solutions that are innovative in terms of application or something more common place but is now being done at scale.

Posted September 29, 2016

Attunity Ltd., a provider of big data management software solutions, is introducing a new platform for SAP, a data replication solution optimized to deliver SAP application data application data in real-time for big data analytics on-premises or in the cloud. Attunity Replicate for SAP aims to transform complex data structures into easily accessible data models across a wide variety of analytics platforms.

Posted September 28, 2016

SAP is releasing a next generation data warehouse solution for running a real-time digital enterprise on-premise and in the cloud. The new solution, SAP BW/4HANA, will be available on Amazon Web Services (AWS) and SAP HANA Enterprise Cloud (HEC).

Posted September 28, 2016

Data lakes are quickly transitioning from interesting idea to priority project. A recent study, "Data Lake Adoption and Maturity," from Unisphere Research showed that nearly half of respondents have an approved budget or have requested budget to launch a data lake project. What's driving this rapid rush to the lake?

Posted September 27, 2016

ODPi, a nonprofit organization providing big data solutions, has announced that several new solution and application providers have signed onto the ODPi Interoperable Compliance Program. DataTorrent, IBM, Pivotal, SAS, Syncsort, WanDisco, and Xaivent will now work together across a wider range of commercial Apache Hadoop platforms.

Posted September 27, 2016

At Strata + Hadoop World, MapR Technologies announced support for microservices that leverage continuous analytics, automated actions, and rapid response to better impact business as it happens. The new capabilities in the MapR Platform range from microservices application monitoring and management to integrated support for agile microservices application development.

Posted September 27, 2016

At Strata + Hadoop World, Pentaho announced five new improvements, including SQL on Spark, to help enterprises overcome big data complexity, skills shortages and integration challenges in complex, enterprise environments. According to Donna Prlich, senior vice president, product management, Product Marketing & Solutions, at Pentaho, the enhancements are part of Pentaho's mission to help make big data projects operational and deliver value by strengthening and supporting analytic data pipelines.

Posted September 26, 2016

Informatica is expanding its Data Lake Management solution to strengthen and deepen its overall platforms, enabling data analysts to easily find, prepare, govern, and protect data of any size and velocity to more efficiently and confidently drive business value from Hadoop-based data lakes.

Posted September 21, 2016

Oracle executive chairman and CTO Larry Ellison presented his annual opening keynote at OpenWorld 2016 on Sunday night. Amid announcements of new products and services, his remarks repeatedly returned to three key themes.

Posted September 21, 2016

In his keynote on Monday at Oracle OpenWorld 2016, Oracle CEO Mark Hurd showcased customer success stories and offered three new predictions for the future of cloud, which, he said, represents a generational shift in the IT market.

Posted September 21, 2016

"In this coming year, you'll see us aggressively moving into infrastructure-as-a-service," Larry Ellison, Oracle's chief technology officer and executive chairman of the board, said to kick off the company's OpenWorld conference Sunday night at the Moscone Center. In the first of his two scheduled keynote addresses, Ellison went on to outline a number of strategic announcements that aim to strengthen the company's offerings, as well as to help it compete with Amazon.com, one of its top challengers.

Posted September 19, 2016

To help organizations continue to get the most from their data, Big Data Quarterly has published the second annual "Big Data 50," a list of companies driving big data innovation. The Big Data 50 includes forward-thinking companies that are expanding what is possible in terms of managing and deriving value from data.

Posted September 14, 2016

With the growth of data variety, volume, and velocity, innovative data management approaches are needed. To help organizations get the most from their data, Big Data Quarterly presents the second annual "Big Data 50," our list of companies driving innovation.

Posted September 14, 2016

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

Sponsors