Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

As analytics continues to play a larger role in the enterprise, the need to leverage and protect the data looms larger. According to the IDC, the big data and analytics market will reach $125 billion worldwide in 2015. Here are 10 predictions from industry experts about the data and analytics in 2015.

Posted December 19, 2014

In its fourth big data-related acquisition this year, Teradata announced it has acquired RainStor, a privately held company specializing in online big data archiving on Hadoop. RainStor's technology offers three key advantages, explains Chris Twogood, vice president of products and services at Teradata. It enables extreme data compression with the ability to compress data from 10s to 40x, Rainstor data is immutable which is important for compliance and security regulations, and it is all accessible by SQL.

Posted December 18, 2014

2015 is going to be a big year for big data in the enterprise, according to Oracle. Neil Mendelson, Oracle vice president of big data and advanced analytics, shared Oracle's "Top 7" big data predictions for 2015. "The technology is moving very quickly and it is gaining to the point where a broader set of people can get into it - not just because it is affordable - but because they no longer require specialized skills in order to take advantage of it," he said.

Posted December 17, 2014

Data is increasingly being recognized as a rich resource flowing through organizations from a continually growing range of sources. But to realize its full potential, this data must be accessed by an array of users to support both real-time decision making and historical analysis, integrated with other information, and still kept safe from hackers and others with malicious intent. Fortunately, leading vendors are developing products and services to help. Here, DBTA presents the list of Trend-Setting Products in Data and Information Management for 2015.

Posted December 17, 2014

BlueData, which provides EPIC Enterprise, software to enable enterprises to create a self-service cloud experience on premise, has announced the growth of its big data partner ecosystem. Thirteen new companies spanning infrastructure, big data distributions, ETL/BI applications and system integrators have joined the partner program to accelerate the adoption of big data private cloud on-premises.

Posted December 16, 2014

There is still time to submit a speaking proposal for DBTA's Data Summit 2015, which will take place at the New York Hilton Midtown, May 11-13, 2015.

Posted December 08, 2014

Real-time fraud analytics provider Argyle Data is partnering with Hortonworks, a contributor to and provider of enterprise Apache Hadoop. By joining the Hortonworks Technology Partner Program, Argyle says it can rely on Hadoop to help strengthen its ability to drive and lead efforts related to advanced analytics and emerging technologies, including petabyte-scale data storage, machine learning and deep-packet inspection (DPI).

Posted December 02, 2014

GoGrid, an infrastructure-as-a-service provider specializing in multi-cloud solutions, is partnering with Cloudera, a provider of Hadoop-based software and services. The partnership will allow companies to evaluate and run the platform for big data through Cloudera Live. Traditional methods of deploying Hadoop require on-premise work to be done to just test potential solutions. The GoGrid-Cloudera partnership allows customers to run Cloudera with GoGrid's cloud infrastructure. What makes GoGrid unique is its 1-Button Deploy orchestration process, according to John Keagy, founder and CEO of GoGrid.

Posted December 02, 2014

To help IT and business stakeholders take action to benefit from the emerging technologies and trends in information management, Database Trends and Applications has just published the second annual Big Data Sourcebook, a free resource.

Posted November 25, 2014

Oracle has announced an updated version of Oracle GoldenGate 12c. With GoldenGate 12c, customers can implement real-time data integration and transactional data replication between on-premises and cloud environments and across a broader set of heterogeneous platforms, achieving faster time to value and a greater return from their data assets. "Inherent in the concept of integration is that we can effectively cover both like and unlike platforms, and that we offer our customers the ability to effectively capture and move their data regardless of which systems, platforms, and vendors their data originates from," said Jeff Pollock, vice president of product management at Oracle.

Posted November 19, 2014

Splice Machine today announced the general availability of its Hadoop RDBMS, a platform to build real-time, scalable applications, that incorporates new features that emerged from charter customers using the the beta offering. With the additional new features and the validation from beta customers, Splice Machine 1.0 can support enterprises struggling with their existing databases and seeking to scale-out affordably, said Monte Zweben, co-founder and CEO, Splice Machine.

Posted November 19, 2014

The call for speakers for Data Summit 2015 at the New York Hilton Midtown, May 11-13, 2015, is now officially open. The deadline for submitting proposals is December 5, 2014.

Posted November 19, 2014

Concurrent has announced the latest version of Driven, a big data application performance-monitoring and management system. Driven is purpose-built to address the challenges of enterprise application development and deployment for business-critical data applications, delivering control and performance management for enterprises seeking to achieve operational excellence.

Posted November 18, 2014

Seagate Technology, a provider of storage solutions, has introduced ClusterStor Hadoop Workflow Accelerator. The solution is expected to be a boon to computationally intensive high performance data analytics environments, enabling them to achieve a significant reduction in data transfer time.

Posted November 17, 2014

Talend has introduced a new release of its integration platform. The 5.6 release sets new benchmarks for big data productivity and profiling, innovates in MDM with efficiency controls, and broadens Internet of Things (IoT) device connectivity.

Posted November 12, 2014

Rocket Software has announced Rocket Data Virtualization version 2.1, a mainframe data virtualization solution for universal access to data, regardless of location, interface or format.

Posted November 10, 2014

Hadoop is one of the best-known technologies within the big data realm. However, deploying a Hadoop environment is not a simple task. To help address the challenges for prospective Hadoop customers, Cloudera, which offers analytic data management based on Apache Hadoop, and CenturyLink, which provides managing services in the cloud, have formed a partnership.

Posted November 04, 2014

Platfora, which provides a big data analytics platform built natively on Hadoop and Spark, has introduced Platfora 4.0 with advanced visualizations, geo-analytics capabilities, and collaboration features to enable users with a range of skill levels to work iteratively with data at scale.

Posted October 28, 2014

Protegrity, a provider of data security solutions, has announced an expanded partnership with Hadoop platform provider Hortonworks. Protegrity Avatar for Hortonworks extends the capabilities of HDP native security with Protegrity Vaultless Tokenization (PVT) for Apache Hadoop, Extended HDFS Encryption, and the Protegrity Enterprise Security Administrator, for advanced data protection policy, key management and auditing.

Posted October 28, 2014

Big data continues to grow at an exponential rate for many enterprises. One issue that continues to grow as well is the threat to data security.

Posted October 28, 2014

At SAP TechEd & d-code, SAP announced new innovations for the latest release of SAP HANA, the fall update of SAP HANA Cloud Platform, and a new SAP API Management technology.

Posted October 22, 2014

Attunity has introduced Replicate 4.0 which provides high-performance data loading and extraction for Apache Hadoop. The solution has been certified with the Hortonworks and Cloudera Hadoop distributions.

Posted October 22, 2014

Apache Hadoop has been a great technology for storing large amounts of unstructured data, but to do analysis, users still need to reference data from existing RDBMS based systems. This topic was addressed in "From Oracle to Hadoop: Unlocking Hadoop for Your RDBMS with Apache Sqoop and Other Tools," a session at the Strata + Hadoop World conference, presented by Guy Harrison, executive director of Research and Development at Dell Software, David Robson, principal technologist at Dell Software, and Kathleen Ting, a technical account manager at Cloudera and a co-author of O'Reilly's Apache Sqoop Cookbook.

Posted October 22, 2014

In his presentation at the Strata + Hadoop World conference, titled "Unseating the Giants: How Big Data is Causing Big Problems for Traditional RDBMSs," Monte Zweben, CEO and co-founder of Splice Machine, addressed the topic of scale-up architectures as exemplified by traditional RDBMS technologies versus scale-out architectures, exemplified by SQL on Hadoop, NoSQL and NewSQL solutions.

Posted October 22, 2014

MapR Technologies, one of the top ranked distributors for Hadoop, has announced that MapR-DB is now available for unlimited production use in the freely-downloadable MapR Community Edition. "From a developer standpoint, they can combine the best of Hadoop, which is deep predictive analytics across the data, as well as a NoSQL database for real-time operations," explained Jack Norris, chief marketing officer for MapR Technologies.

Posted October 21, 2014

Companies are facing the "big squeeze" created by IT budgets that are relatively flat, growing by only 3% to 4% a year, versus data growth that is averaging 30% to 40%, and a consensus that data is a valuable commodity that cannot be thrown away, said Monte Zweben, CEO and co-founder of Splice Machine in his presentation at the Strata + Hadoop World conference.

Posted October 21, 2014

Datameer has introduced Datameer 5.0 with Smart Execution, a technology that examines dataset characteristics, analytics tasks and available system resources to determine the most appropriate execution framework for each workload.

Posted October 21, 2014

At Strata + Hadoop World in New York, Microsoft announced an update to Microsoft Azure HDInsight, its cloud-based distribution of Hadoop. Customers can now process millions of Hadoop events in near real time, with Microsoft's preview of support for Apache Storm clusters in Azure HDInsight. In addition, as part of its integration with the Azure platform, Hortonworks announced that the Hortonworks Data Platform (HDP) has achieved Azure Certification.

Posted October 20, 2014

With Cloudera 5.2 the focus is on building products to deliver on the promise of the enterprise data hub that Cloudera introduced last year, said Clarke Patterson, senior director of product marketing at Cloudera. In particular, new capabilities make the technology more accessible to users who are not data scientists and also increase the level of security, two hurdles which can stand in the way of Hadoop adoption.

Posted October 15, 2014

Generally available today, EMC and Pivotal have announced the Data Lake Hadoop Bundle 2.0 that includes EMC's Data Computing Appliance (DCA), a high-performance big data computing appliance for deployment and scaling of Hadoop and advanced analytics, Isilon scale-out NAS (network attached storage), as well as the Pivotal HD Hadoop distribution and the Pivotal HAWQ parallel SQL query engine. The idea is to provide a turn-key offering that combines compute, analytics and storage for customers building scale-out data lakes for enterprise predictive analytics.

Posted October 14, 2014

Building on its data lake approach, Pivotal today announced the next step in this vision with the implementation of an architecture that builds upon disk-based storage with memory-centric processing frameworks.

Posted October 14, 2014

The new Dell In-Memory Appliance for Cloudera Enterprise is designed to provide customers with a processing engine combined with interactive analytics in a preconfigured and scalable solution, and will begin shipping Oct. 15, 2014.

Posted October 14, 2014

Dataguise, a provider of security and data governance solutions for big data, has expanded its DgSecure platform to support Hadoop in the cloud, including full support for Amazon EMR (Elastic MapReduce). Additionally, big data cloud service providers, Altiscale and Qubole, have joined Dataguise's Big Data Protection Partner Program (BDP3) to leverage DgSecure in providing comprehensive discovery, protection and visibility to sensitive data for their cloud-based Hadoop customers.

Posted October 13, 2014

Splunk, which provides software for machine-generated big data analysis, has announced Splunk Enterprise 6.2, Splunk Mint, and Splunk Hunk 6.2. "What we are doing with this release is fundamentally broadening the number of users that can do advanced analytics," stated Shay Mowlem, VP, product marketing at Splunk.

Posted October 13, 2014

IBM is adding new analytics capabilities to the mainframe platform, helping enable better data security and providing clients with the ability to integrate Hadoop big data. By applying analytic tools to business transactions as they are occurring, mainframe systems can enable clients to have true real-time insights. With the analytics on the System z platform, clients can also incorporate social media into their real-time analytic.

Posted October 13, 2014

Teradata is introducing Teradata Loom 2.3, a platform that provides integrated metadata management, data lineage, and ata wrangling for enterprise Hadoop. Teradata has also launched Teradata Cloud for Hadoop, a turnkey, full service cloud environment, and a broad technology and marketing partnership with Cloudera. "Increasingly, customers want a one-stop shop for their data analytics needs," said Chris Twogood, vice president of products and services at Teradata.

Posted October 09, 2014

One feature of the big data revolution is the acknowledgement that a single database management system architecture cannot meet all needs. However, the Lambda Architecture provides a useful pattern for combining multiple big data technologies to achieve multiple enterprise objectives. First proposed by Nathan Marz, it attempts to provide a combination of technologies that together can provide the characteristics of a web-scale system that can satisfy requirements for availability, maintainability, and fault-tolerance.

Posted October 08, 2014

Join DBTA for a webcast on Thursday, October 9, to learn about the key use cases, data replication strategies and methods for exploring data more efficiently through Hadoop.

Posted October 06, 2014

Big analytics and visualization company Datameer is releasing version 5.0 of the company's data analytics application for Hadoop. "Our vision overall is to make data simple and accessible for everyone," said Matt Schumpert, director of product management with Datameer, about version 5.0 improvements.

Posted October 01, 2014

Ron Bodkin founded Think Big Analytics to help organizations gain value from big data. Before that, he was vice president of engineering at Quantcast where he led the data science and engineering teams deploying Hadoop and NoSQL for batch and real-time decision making. In this interview, Bodkin who is CEO of Think Big, now a Teradata company, discusses the challenges organizations face in getting big data projects off the ground and what they need to consider when they embark on projects to leverage data from the Internet of Things and social media.

Posted September 25, 2014

Many in the industry have begun to look to data lakes and Hadoop as the future for data storage. To help shed light on the data lake approach, the pros and cons of this data repository were considered in a recent Unisphere webcast presented by Peter Evans, BI and analytics product evangelist and product technologist consultant, Dell Software; and Elliot King, Unisphere Research analyst.

Posted September 25, 2014

Pages
1
2
3
4

Sponsors