Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Describing it as potentially the most important new open source project in a decade, IBM announced a major commitment to Apache Spark.

Posted June 15, 2015

Data is only going to continue to grow and so must data processing capabilities.

Posted June 15, 2015

JethroData has closed an $8.1 million Series B financing round led by Square Peg Capital, and including existing investor Pitango Venture Capital. This funding round comes on the heels of the general availability of JethroData's SQL-on-Hadoop software, which uses indexes to query data up to 100 times faster than alternative SQL on Hadoop solutions.

Posted June 10, 2015

MapR Technologies is working with Microsoft to offer the MapR Distribution including Apache Hadoop on Azure cloud services. MapR is architected to provide a scalable and agile Hadoop platform and NoSQL database for big and fast data applications. The MapR Distribution will also be integrated with the Azure Data Lake, enabling users to deploy tiered analytical and storage capabilities in the cloud.

Posted June 10, 2015

Concurrent has announced general availability of Cascading 3.0, a platform for building and deploying enterprise applications on Hadoop. "This new, available version of Cascading will enable our users even further by simplifying application development, accelerating time to market and allowing enterprises to leverage existing, and more importantly, new and emerging data infrastructure and programming skills," said Chris Wensel, founder and CTO, Concurrent, Inc.

Posted June 10, 2015

Pentaho is now offering support for integration with SAP HANA and Amazon Elastic MapReduce (EMR). Dubbed Pentaho 5.4, available now, the solution offers new capabilities that build on a pragmatic and future-proofed platform for big data orchestration and analytics at scale.

Posted June 10, 2015

Syncsort is working with Dell to help businesses improve operational efficiency and lower costs by shifting expensive workloads and associated data from enterprise data warehouses (EDW) to Hadoop.

Posted June 10, 2015

Map R is releasing version 5.0 of the MapR Distribution including Hadoop that will process big and fast data on a single data platform that enables a new class of real-time applications.

Posted June 09, 2015

While the influx of data provides many benefits to an organization, it can come with some challenges as well. The cost and lack of elasticity due to the volume and multiple data sources have become difficult issues for businesses to deal with.

Posted June 09, 2015

To help users extract insights from data lakes,Teradata has made a multi-year commitment to contribute to Presto's open source development. Based on a three-part roadmap, Teradata's says its contributions will be 100% open source under the Apache license and will advance Presto's code base, scalability, iterative querying, and ability to query multiple data repositories.

Posted June 08, 2015

New and emerging companies are taking aim at data management challenges to help customers get more value from their data. Here, DBTA showcases the approaches of 10 companies we think are worth watching.

Posted June 01, 2015

The world of information technology is often described as an ecosystem. And just like an ecosystem that exists in nature, IT is affected by internal and external factors. In this issue of Database Trends and Applications magazine, we introduce the third annual list of companies that matter in data, a reflection of the IT ecosystem as it exists now.

Posted June 01, 2015

Cloudera is now offering support for Capgemini's new reference architecture for the SAP HANA platform and Cloudera Enterprise. "By bringing the power of Cloudera's enterprise data hub offering to the ecosystem in support of SAP HANA, we can enable Capgemini's clients to expand the amount of data they have within their environment in a cost-efficient manner," said Tim Stevens, vice president of corporate and business development at Cloudera.

Posted May 28, 2015

Many DBAs are now tasked with managing multi-vendor environments, and handling a variety of data types. Increasingly, DBAs are turning to strategies such as database automation to be able to concentrate more on the big picture of moving their enterprises forward.

Posted May 28, 2015

As the influx of data continues to consume organizations, marketers are overwhelmed at sorting through it all to gain effective insights. Data visualization can help, according to Amanda Gessert, senior manager of business intelligence at Merkle, who took part in a Data Summit 2015 session called "Increasing Your Productivity Through Data Visualization."

Posted May 27, 2015

The pressure to compete on analytics and leverage the value of increasingly large and varied data stores shows no sign of abating. To focus the spotlight on the products that are most important to DBTA readers, the DBTA Readers' Choice Awards provides the opportunity to select from more than 300 nominees across 29 categories.

Posted May 27, 2015

At Data Summit 2015 in New York City, Tony Shan, chief architect, Wipro, gave a talk on the key components of a successful big data methodology and shared lessons learned from real world big data implementations. According to Shan, there is an 8-step process for a big data framework with specific techniques and methods.

Posted May 26, 2015

Building on its native support for a variety of database platforms - including Apache Hive for Hadoop, MongoDB Enterprise, and traditional relational database management systems (RDBMS) - Embarcadero Technologies' data modeling tool, ER/Studio Data Architect version 10.0, has been certified by HortonWorks and Mongo DB Enterprise.

Posted May 21, 2015

Rosslyn Analytics, a provider of big data cloud technology, has announced it is one of the first to offer companies analytics as a service on Azure. The big data cloud analytics platform, powered by Azure, provides self-service management from source to analytics and enables business and IT users to interact with, change, and analyze data using a combination of self-service data integration, cleansing and enrichment tools and machine learning and visualization technologies.

Posted May 20, 2015

In a DBTA roundtable webcast on Wednesday, May 27, three of the industry's leading Hadoop and cloud experts - Gartner Research vice president Merv Adrian, Hortonworks vice president of strategy Shaun Connolly, and Microsoft general manager of platform and cloud Lance Olson - will discuss key technology solutions such as Apache Hadoop and cloud, and emerging best practices for leveraging them.

Posted May 20, 2015

Oracle is shipping a new big data product called Oracle Big Data Spatial and Graph. Spatial and graph analytics has been available as an option for Oracle Database for more than 10 years, and with this introduction the company is bringing spatial and graph analytics to Hadoop and NoSQL.

Posted May 20, 2015

Ian Abramson, principal senior consultant, SWI Systemware Innovation Corporation, Oracle Ace, and IOUG past president, provided a look back at the evolution of BI and a peek at what may lie ahead for analytics in a talk titled "Analytics in the Time of Big Data," as part of the IOUG track at Data Summit 2015 in New York.

Posted May 20, 2015

MapR Technologies, Inc., a provider of a distribution for Apache Hadoop, is including Apache Drill 1.0 in the MapR Distribution.

Posted May 19, 2015

Organizations have begun to realize the importance that big data can provide them. The technologies and issues involved in becoming a data driven enterprise were explored in a panel discussion at Data Summit 2015 in New York City, featuring David Mariani, CEO of AtScale; Andy Schroepfer, chief strategy officer with HOSTING; and Wendy Gradek, senior manager, R&D Analytics, EMC.

Posted May 19, 2015

Time is running out to vote for the 2015 DBTA Readers' Choice Awards. This year, there are more than 300 nominees across 29 categories.

Posted May 19, 2015

Google white papers have inspired many great open source projects. What has been missing until now, however, has been a way of bringing these technologies together such that any data-centric organization can benefit from the capabilities of each technology across its entire data center, and in new ways not documented by any single white paper. This is called the "Zeta Architecture."

Posted May 19, 2015

The demand for effective data management is intensifying. At the same time, the database market has expanded into a wide array of solutions—from traditional relational database management systems to alternative databases such as NoSQL, NewSQL, cloud, and in-memory offerings.

Posted May 19, 2015

RedPoint Global was founded in 2006 by Dale Renner, Lewis Clemmens, and George Corugedo, who previously had worked together at Accenture. Based in Wellesley, Mass., RedPoint collaborates with clients around the world in 11 different verticals. "We have always been very focused on the data, and recognize that a lot of business problems live and die by the quality of the data," says Corugedo.

Posted May 19, 2015

Hadoop is contributing to the success of data analytics. Anad Rai, IT manager at Verizon Wireless, examined the differences between traditional versus big data at Data Summit 2015 in a session titled "Analytics: Traditional Versus Big Data." The presentation, which was part of the IOUG track moderated by Alexis Bauer Kolak, education manager at the IOUG, showed how big data technologies are helping data discovery and improving the transformation of information and knowledge into wisdom.

Posted May 14, 2015

The data lake is one of the hottest topics in the data industry today. It is a massive storage reservoir that allows data to be stored in its rawest forms. Hadoop Day at Data Summit 2015 concluded with a panel on everything data lake featuring James Casaletto, solutions architect for MapR, Joe Caserta, president and founder of Caserta Concepts, and George Corugedo, CTO with RedPoint Global Inc.

Posted May 14, 2015

With the influx of big data solutions and technologies comes a bevy of new problems, according to Data Summit 2015 panelists Miles Kehoe, search evangelist at Avalon Consulting, and Anne Buff, business solutions manager for SAS best practices at the SAS Institute. Kehoe and Buff opened the second day of Data Summit with a keynote discussion focusing on resolving data conundrums.

Posted May 14, 2015

To transform data into value, IT must move from thinking about what it does to data, and instead focus on business outcomes and what can be done with the data to advance the business, according to Edd Dumbill, vice president, strategy, Silicon Valley Data Science, who gave the welcome keynote at Data Summit 2015.

Posted May 14, 2015

Data has changed and, with this change, cracks in the armor of traditional data warehousing approaches are forming. The concept of the data warehouse is still sound. However, businesses can be more successful by acknowledging that the traditional enterprise data warehouse cannot solve all problems today.

Posted May 14, 2015

At Data Summit 2015 in New York, James Casaletto of MapR and David Teplow of Integra provided deep dives into the world of Hadoop, past, present, and future.

Posted May 12, 2015

Splice Machine is partnering with Talend to enable customers to simplify data integration and streamline data workflows on Hadoop. Through this partnership, organizations building operational data lakes with Splice Machine can augment Talend's data integration technology with its data quality capabilities.

Posted May 12, 2015

Pentaho users will now be able to use Apache Spark within Pentaho thanks to a new native integration solution that will enable the orchestration of all Spark jobs. Pentaho Data Integration (PDI), an effort initiated by Pentaho Labs, will enable customers to increase productivity, reduce maintenance costs, and dramatically lower the skill sets required as Spark is incorporated into big data projects.

Posted May 12, 2015

Pivotal has made updates to its big data suite that include upgrades to the Pivotal HD enterprise-grade Apache Hadoop distribution, which is now based on the Open Data Platform core, and performance improvements for Pivotal Greenplum Database.

Posted May 05, 2015

As organizations embark on big data projects, how do they choose among all the diverse players across NoSQL, NewSQL, and Hadoop? In this special DBTA roundtable webcast, you'll learn why open standards matter to your big data investment.

Posted May 05, 2015

The Spring 2015 release of the SnapLogic Elastic Integration Platform extends the platform's cloud and big data integration capabilities to the Internet of Things (IoT) with support for Message Queuing Telemetry Transport (MQTT), a lightweight machine-to-machine connectivity protocol.

Posted May 05, 2015

In a recent DBTA webcast, Shane Johnson, senior product marketing manager, Couchbase, discussed the relationship between NoSQL and Hadoop, detailing the multiple ways to integrate NoSQL databases with Hadoop. "It's not Hadoop or Couchbase Server. It's Hadoop and Couchbase Server," said Johnson.

Posted April 28, 2015

Splice Machine, a provider of Hadoop RDMS, announced that it is partnering with mrc (michaels, ross & cole ltd), to allow Splice Machine's Hadoop RDBMS to be certified and integrated with mrc's m-Power platform. "Our partnership with mrc gives businesses a solution that can speed real-time application deployment on Hadoop with the staff and tools they currently have, while also offering affordable scale-out on commodity hardware for future growth," said Monte Zweben, co-founder and CEO, Splice Machine.

Posted April 28, 2015

Pivotal HAWQ is now available on the Hortonworks Data Platform (HDP), enabling the benefits of SQL on Hadoop to be leveraged by enterprises that are investing in HDP. This marks the first time that the features and capabilities of Pivotal HAWQ have been made available outside of Pivotal. The availability aligns with a common Open Data Platform (ODP) Core that allows users to leverage the best-of-breed technology across providers.

Posted April 27, 2015

The future will flourish with machines. We've been told this in pop culture for decades, from the helpful robots of the Jetsons, to the infamous Skynet of the Terminator movies, to the omniscient "computer" of Star Trek. Smart, connected devices will be ubiquitous and it's up to us, the humans, to decide what's next. But the Internet of Things (IoT) is about more than devices and data.

Posted April 23, 2015

SUSE and Veristorm are partnering to provide certified high-performance Hadoop solutions that run directly on Linux on IBM z Systems, IBM Power Systems, and x86-64. Customers with IBM z Systems can team SUSE Linux Enterprise Server for System z with Veristorm zDoop, a commercial distribution of Hadoop supported on mainframes.

Posted April 23, 2015

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Sponsors