Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

In 2014, the big data drumbeat continued to pound, major DBMS vendors expanded their product offerings, Microsoft hired a new CEO, and a range of new technology offerings were introduced. In retrospect, what stands out?

Posted January 29, 2015

To help organizations leverage the full range of big data to drive better decision making, Novetta has launched data refinement, entity resolution and analysis software that it says will power large-scale analytics on all data in Hadoop. The solution is now certified to run on Cloudera CDH and Hortonworks HDP.

Posted January 23, 2015

It is no secret that we are in the data age. Data comes at us from all directions, in all shapes and sizes.Incumbent vendors and startups constantly add new features, build on top of emerging open source projects, and claim to solve the next wave of challenges. Within the Hadoop ecosystem alone, there are (at least) 11 Hadoop-related open source projects. Making sense of it can be a time-consuming headache. To bring clarity and peace of mind, here are the top 5 big data predictions for 2015 and beyond.

Posted January 21, 2015

Registration is now open for Data Summit 2015, providing the opportunity to connect with the best minds in the industry, learn what works, and chart your course forward in an increasingly data-driven world. The event is designed to offer a comprehensive educational experience designed to guide attendees through the key issues in data management and analysis today.

Posted January 21, 2015

In 2014, we continued to watch big data enable all things "big" about data and its business analytics capabilities. We also saw the emergence (and early acceptance) of Hadoop Version 2 as a data operating platform, with cornerstones of YARN (Yet Another Resource Negotiator) and HDFS (Hadoop Distributed File System). In 2015, the mainstream adoption with enterprise data strategies and acceptance of the data lake will continue as data management and governance practices provide further clarity. The cautionary tale of 2014 to ensure business outcomes drive big data adoption, rather than the hype of previous years, will likewise continue.

Posted January 21, 2015

During a live event, Larry Ellison, Oracle's executive chairman of the board and CTO, outlined a new strategy for reducing customer costs and increasing value with ithe company's next generation of engineered systems. In the presentation today, Ellison emphasized two key points.

Posted January 21, 2015

Xplenty, which provides a big data processing platform powered by Hadoop, is partnering with Segment, which provides a customer data hub. Segment is a universal integration layer that supports customer data collection. Through a single platform, it collects, translates, and routes data to analytics and marketing tools.

Posted January 20, 2015

The manageability track at COLLABORATE 15 - IOUG Forum is designed to provide DBAs of every experience level and industry with the knowledge set to streamline IT management processes at their organization and accelerate their transformation to cloud.

Posted January 14, 2015

MapR, a provider of an enterprise grade distributed data platform including Hadoop, has announced a relationship with SAS, a provider of business analytics software and services. "If I were to summarize the journey the two companies are on, it is about getting data results bigger and faster," explained Jack Norris, CMO, MapR Technologies. The partnership of both organizations will allow for additional flexibility and control of Hadoop-based data.

Posted December 29, 2014

As analytics continues to play a larger role in the enterprise, the need to leverage and protect the data looms larger. According to the IDC, the big data and analytics market will reach $125 billion worldwide in 2015. Here are 10 predictions from industry experts about the data and analytics in 2015.

Posted December 19, 2014

In its fourth big data-related acquisition this year, Teradata announced it has acquired RainStor, a privately held company specializing in online big data archiving on Hadoop. RainStor's technology offers three key advantages, explains Chris Twogood, vice president of products and services at Teradata. It enables extreme data compression with the ability to compress data from 10x to 40x, Rainstor data is immutable which is important for compliance and security regulations, and it is all accessible by SQL.

Posted December 18, 2014

2015 is going to be a big year for big data in the enterprise, according to Oracle. Neil Mendelson, Oracle vice president of big data and advanced analytics, shared Oracle's "Top 7" big data predictions for 2015. "The technology is moving very quickly and it is gaining to the point where a broader set of people can get into it - not just because it is affordable - but because they no longer require specialized skills in order to take advantage of it," he said.

Posted December 17, 2014

Data is increasingly being recognized as a rich resource flowing through organizations from a continually growing range of sources. But to realize its full potential, this data must be accessed by an array of users to support both real-time decision making and historical analysis, integrated with other information, and still kept safe from hackers and others with malicious intent. Fortunately, leading vendors are developing products and services to help. Here, DBTA presents the list of Trend-Setting Products in Data and Information Management for 2015.

Posted December 17, 2014

BlueData, which provides EPIC Enterprise, software to enable enterprises to create a self-service cloud experience on premise, has announced the growth of its big data partner ecosystem. Thirteen new companies spanning infrastructure, big data distributions, ETL/BI applications and system integrators have joined the partner program to accelerate the adoption of big data private cloud on-premises.

Posted December 16, 2014

There is still time to submit a speaking proposal for DBTA's Data Summit 2015, which will take place at the New York Hilton Midtown, May 11-13, 2015.

Posted December 08, 2014

Real-time fraud analytics provider Argyle Data is partnering with Hortonworks, a contributor to and provider of enterprise Apache Hadoop. By joining the Hortonworks Technology Partner Program, Argyle says it can rely on Hadoop to help strengthen its ability to drive and lead efforts related to advanced analytics and emerging technologies, including petabyte-scale data storage, machine learning and deep-packet inspection (DPI).

Posted December 02, 2014

GoGrid, an infrastructure-as-a-service provider specializing in multi-cloud solutions, is partnering with Cloudera, a provider of Hadoop-based software and services. The partnership will allow companies to evaluate and run the platform for big data through Cloudera Live. Traditional methods of deploying Hadoop require on-premise work to be done to just test potential solutions. The GoGrid-Cloudera partnership allows customers to run Cloudera with GoGrid's cloud infrastructure. What makes GoGrid unique is its 1-Button Deploy orchestration process, according to John Keagy, founder and CEO of GoGrid.

Posted December 02, 2014

To help IT and business stakeholders take action to benefit from the emerging technologies and trends in information management, Database Trends and Applications has just published the second annual Big Data Sourcebook, a free resource.

Posted November 25, 2014

Oracle has announced an updated version of Oracle GoldenGate 12c. With GoldenGate 12c, customers can implement real-time data integration and transactional data replication between on-premises and cloud environments and across a broader set of heterogeneous platforms, achieving faster time to value and a greater return from their data assets. "Inherent in the concept of integration is that we can effectively cover both like and unlike platforms, and that we offer our customers the ability to effectively capture and move their data regardless of which systems, platforms, and vendors their data originates from," said Jeff Pollock, vice president of product management at Oracle.

Posted November 19, 2014

Splice Machine today announced the general availability of its Hadoop RDBMS, a platform to build real-time, scalable applications, that incorporates new features that emerged from charter customers using the the beta offering. With the additional new features and the validation from beta customers, Splice Machine 1.0 can support enterprises struggling with their existing databases and seeking to scale-out affordably, said Monte Zweben, co-founder and CEO, Splice Machine.

Posted November 19, 2014

The call for speakers for Data Summit 2015 at the New York Hilton Midtown, May 11-13, 2015, is now officially open. The deadline for submitting proposals is December 5, 2014.

Posted November 19, 2014

Concurrent has announced the latest version of Driven, a big data application performance-monitoring and management system. Driven is purpose-built to address the challenges of enterprise application development and deployment for business-critical data applications, delivering control and performance management for enterprises seeking to achieve operational excellence.

Posted November 18, 2014

Seagate Technology, a provider of storage solutions, has introduced ClusterStor Hadoop Workflow Accelerator. The solution is expected to be a boon to computationally intensive high performance data analytics environments, enabling them to achieve a significant reduction in data transfer time.

Posted November 17, 2014

Talend has introduced a new release of its integration platform. The 5.6 release sets new benchmarks for big data productivity and profiling, innovates in MDM with efficiency controls, and broadens Internet of Things (IoT) device connectivity.

Posted November 12, 2014

Rocket Software has announced Rocket Data Virtualization version 2.1, a mainframe data virtualization solution for universal access to data, regardless of location, interface or format.

Posted November 10, 2014

Hadoop is one of the best-known technologies within the big data realm. However, deploying a Hadoop environment is not a simple task. To help address the challenges for prospective Hadoop customers, Cloudera, which offers analytic data management based on Apache Hadoop, and CenturyLink, which provides managing services in the cloud, have formed a partnership.

Posted November 04, 2014

Platfora, which provides a big data analytics platform built natively on Hadoop and Spark, has introduced Platfora 4.0 with advanced visualizations, geo-analytics capabilities, and collaboration features to enable users with a range of skill levels to work iteratively with data at scale.

Posted October 28, 2014

Protegrity, a provider of data security solutions, has announced an expanded partnership with Hadoop platform provider Hortonworks. Protegrity Avatar for Hortonworks extends the capabilities of HDP native security with Protegrity Vaultless Tokenization (PVT) for Apache Hadoop, Extended HDFS Encryption, and the Protegrity Enterprise Security Administrator, for advanced data protection policy, key management and auditing.

Posted October 28, 2014

Big data continues to grow at an exponential rate for many enterprises. One issue that continues to grow as well is the threat to data security.

Posted October 28, 2014

At SAP TechEd & d-code, SAP announced new innovations for the latest release of SAP HANA, the fall update of SAP HANA Cloud Platform, and a new SAP API Management technology.

Posted October 22, 2014

Attunity has introduced Replicate 4.0 which provides high-performance data loading and extraction for Apache Hadoop. The solution has been certified with the Hortonworks and Cloudera Hadoop distributions.

Posted October 22, 2014

Apache Hadoop has been a great technology for storing large amounts of unstructured data, but to do analysis, users still need to reference data from existing RDBMS based systems. This topic was addressed in "From Oracle to Hadoop: Unlocking Hadoop for Your RDBMS with Apache Sqoop and Other Tools," a session at the Strata + Hadoop World conference, presented by Guy Harrison, executive director of Research and Development at Dell Software, David Robson, principal technologist at Dell Software, and Kathleen Ting, a technical account manager at Cloudera and a co-author of O'Reilly's Apache Sqoop Cookbook.

Posted October 22, 2014

In his presentation at the Strata + Hadoop World conference, titled "Unseating the Giants: How Big Data is Causing Big Problems for Traditional RDBMSs," Monte Zweben, CEO and co-founder of Splice Machine, addressed the topic of scale-up architectures as exemplified by traditional RDBMS technologies versus scale-out architectures, exemplified by SQL on Hadoop, NoSQL and NewSQL solutions.

Posted October 22, 2014

MapR Technologies, one of the top ranked distributors for Hadoop, has announced that MapR-DB is now available for unlimited production use in the freely-downloadable MapR Community Edition. "From a developer standpoint, they can combine the best of Hadoop, which is deep predictive analytics across the data, as well as a NoSQL database for real-time operations," explained Jack Norris, chief marketing officer for MapR Technologies.

Posted October 21, 2014

Companies are facing the "big squeeze" created by IT budgets that are relatively flat, growing by only 3% to 4% a year, versus data growth that is averaging 30% to 40%, and a consensus that data is a valuable commodity that cannot be thrown away, said Monte Zweben, CEO and co-founder of Splice Machine in his presentation at the Strata + Hadoop World conference.

Posted October 21, 2014

Datameer has introduced Datameer 5.0 with Smart Execution, a technology that examines dataset characteristics, analytics tasks and available system resources to determine the most appropriate execution framework for each workload.

Posted October 21, 2014

At Strata + Hadoop World in New York, Microsoft announced an update to Microsoft Azure HDInsight, its cloud-based distribution of Hadoop. Customers can now process millions of Hadoop events in near real time, with Microsoft's preview of support for Apache Storm clusters in Azure HDInsight. In addition, as part of its integration with the Azure platform, Hortonworks announced that the Hortonworks Data Platform (HDP) has achieved Azure Certification.

Posted October 20, 2014

With Cloudera 5.2 the focus is on building products to deliver on the promise of the enterprise data hub that Cloudera introduced last year, said Clarke Patterson, senior director of product marketing at Cloudera. In particular, new capabilities make the technology more accessible to users who are not data scientists and also increase the level of security, two hurdles which can stand in the way of Hadoop adoption.

Posted October 15, 2014

Generally available today, EMC and Pivotal have announced the Data Lake Hadoop Bundle 2.0 that includes EMC's Data Computing Appliance (DCA), a high-performance big data computing appliance for deployment and scaling of Hadoop and advanced analytics, Isilon scale-out NAS (network attached storage), as well as the Pivotal HD Hadoop distribution and the Pivotal HAWQ parallel SQL query engine. The idea is to provide a turn-key offering that combines compute, analytics and storage for customers building scale-out data lakes for enterprise predictive analytics.

Posted October 14, 2014

Pages
1
2
3
4

Sponsors