Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

The converging forces of open source, with its rapid crowd-sourced innovation, cloud, with its unlimited capacity and on-demand deployment options, and NoSQL database technologies, with their ability to handle unstructured data, are helping companies address the new challenges and opportunities presented by big data. Here are the winners of the 2014 DBTA Readers' Choice Awards for Best Big Data Solution.

Posted August 04, 2014

Novetta Identity Analytics has achieved Hortonworks Certification, and Novetta has joined the Hortonworks Technology Partner Program. Novetta Identity Analytics provides a central, multi-dimensional view of the entities across the data silos and uncovers the relationships within and among those entities to support customer intimacy, churn prediction, risk profiling, fraud analysis and detection, and other use cases.

Posted August 04, 2014

Tata Consultancy Services (TCS), an IT services, consulting and business solutions organization, has formed a new partnership with MapR Technologies. The two companies are developing turn-key solutions to address big data challenges.

Posted July 29, 2014

Databricks and SAP have collaborated on a Databricks-certified Apache Spark distribution offering for the SAP HANA platform. This production-ready distribution offering is the first result of a new partnership between Databricks and SAP.

Posted July 23, 2014

In "Big Data at Work," Tom Davenport explains to readers why big data is important to them and their organizations, what technology they actually need to manage it, and where to start capitalizing on its potential. Here, the author shares an excerpt from his recent book.

Posted July 23, 2014

Helping companies get more from big, unstructured data, Actian has unveiled the Actian Analytics Platform - Hadoop SQL Edition. In addition, as a result of a partnership with Logi Analytics, the Actian Analytics Platform combines with Logi Info to offer customers a comprehensive analytics platform to connect, visualize, analyze and act on big data.

Posted July 22, 2014

InfiniDB has released version 4.6 of its fourth generation columnar data platform, which includes enhanced support for large-scale join operations and support for additional data load commands to speed the extraction, transformation, and load (ETL) process. The company also introduced a new 60-day enterprise trial that includes InfiniDB Enterprise Manager, a management console to simplify the administration of InfiniDB, and provide real-time visibility into the performance and availability of users' InfiniDB servers.

Posted July 15, 2014

Oracle today introduced Oracle Big Data SQL, which allows customers to run one SQL query across Hadoop, NoSQL, and Oracle Database, minimizing data movement while overcoming data silos. According to Oracle the new solution helps customers gain a competitive advantage by making it easier to uncover insights faster, and allows them to leverage existing SQL skills while protecting data security and enforcing governance. Oracle Big Data SQL runs on Oracle Big Data Appliance and can work in conjunction with Oracle Exadata Database Machine.

Posted July 15, 2014

As part of a collaboration to make Apache Hadoop and Spark accessible for everyday analysts, Alteryx and Databricks will become the primary committers to SparkR, a subset of the overall Apache Spark framework. In addition, Alteryx and Databricks are forming a technology and go-to-market partnership to accelerate the adoption of SparkR and SparkSQL, in order to help analysts get greater value from Spark as the leading open source in-memory engine.

Posted July 08, 2014

Data Lake Hadoop, a new bundle from Pivotal and EMC, combines scale-out enterprise storage and big data and analytics to offer a production-grade HDFS solution. The bundle consists of pre-configured EMC Isilon X410 nodes, HAWQ subscriptions and Pivotal HD, Pivotal's enterprise version of Apache Hadoop.

Posted July 08, 2014

Following a recently announced a partnership with Databricks, DataStax has integrated Spark for real-­time analytics into DataStax Enterprise. In addition, the 4.5 release adds the ability to merge Cassandra data with Hadoop for integration of operational and historical data. The new release has been certified with Hadoop vendors Cloudera and Hortonworks and DSE has been certified for their platforms.

Posted July 01, 2014

Pentaho has announced version 5.1 of its business analytics and data integration platform. The new release enables code-free analytics directly on the NoSQL database MongoDB, simplifies the data preparation process for data scientists, and offers full support for MapReduce 2.0 (YARN).

Posted June 24, 2014

Data security specialist Dataguise has introduced Dataguise for Data Governance suite, a solution suite for big data governance that will initially support Oracle, IBM DB2, SQL Server, Teradata, Cloudera, Hortonworks, MapR and Pivotal HD.

Posted June 20, 2014

Big data analytics hold great promise, but the present methods of mining and managing big data are still evolving and pose serious security and privacy challenges. Confronting these challenges is essential if the potential of big data is to be fully exploited.

Posted June 17, 2014

The database market is heating up again. Recent venture funding and acquisition announcements from key Hadoop and NoSQL startups are drawing attention to the big data space.

Posted June 17, 2014

Today, many businesses are looking for ways to rationalize their mainframe capacity in order to contain MIPS growth as well as defer costly upgrades. Unfortunately, some of these stopgap measures involve costly trade-offs, such as keeping only the most recent or even archiving critical data to tape. This can mean that valuable information goes untapped.

Posted June 11, 2014

With the latest release of its flagship product, Talend, a provider of big data integration software, enhances Talend's performance and scalability on Hadoop by an average of 45% and addresses the challenge posed for many companies by the limited pool of knowledgeable Hadoop developers.

Posted June 10, 2014

Dataguise, a provider of big data security and protective intelligence solutions, has introduced the Dataguise DgSecure data protection platform with the MapR Sandbox for Hadoop, and also announced the Dataguise DgSecure data protection platform with the Hortonworks HDP 2.1 Sandbox. "In order to realize the benefits of big data, enterprises are getting more proactive about addressing security, privacy and risk in Hadoop," said Patty Nghiem, vice president of marketing and business development at Dataguise.

Posted June 10, 2014

Cloudera has acquired Gazzang to add enterprise-grade data encryption and key management. The move is designed to address the challenges associated with securing and processing sensitive and legally protected data within the Hadoop ecosystem, and also fulfill a requirement posed by compliance regulations such as HIPAA-HITECH, PCI-DSS, FERPA and the EU Data Protection Directive.

Posted June 10, 2014

MarkLogic CEO Gary Bloom and his team brought the MarkLogic World Tour 2014 to Wall Street to showcase customer use cases and explain the key new features coming out in MarkLogic 8. The overarching theme for the new release is ease of use, said Bloom.

Posted June 04, 2014

Actian Corporation has announced that the Actian Analytics Platform, an end-to-end analytics platform that runs natively in Hadoop, is addressing the current challenges facing business analysts who want to use SQL on Hadoop.

Posted June 03, 2014

MapR has launched a Hadoop application gallery to make it easier for companies to find solutions within the Hadoop ecosystem. The company also announced a partnership with Syncsort targeted at helping customers move their mission-critical workloads to Hadoop.

Posted June 03, 2014

RainStor 5.5 has been certified to run on Cloudera Enterprise 5. This new certification enables Cloudera customers to run RainStor natively on HDFS, while offering enterprise-grade security features. "RainStor running on Cloudera Enterprise 5 is a significant step forward for customers taking a serious look at Hadoop," said Mark Cusack, chief architect, RainStor.

Posted May 22, 2014

Fast decision-making depends on real-time data movement that allows businesses to gather data from multiple locations into Hadoop as well as conventional data warehouses. Unfortunately, traditional ETL tools use slow data-scraping techniques that put a heavy load on operational systems and cannot meet the low latency required by many businesses.

Posted May 21, 2014

Concurrent CEO Gary Nakamura says the latest release of Cascading will give enterprises the flexibility to build data-oriented applications on Hadoop once, and then run the applications on the platform that best meets their business needs. "What we are providing is a standard way to develop data-centric applications without the risk of having to rewrite those applications when distributions or the providers of the computation engines underneath it change direction one day."

Posted May 13, 2014

New Cloudera certification enables customers to use Tungsten Replicator 3.0 to replicate transactions from operational database systems such as MySQL, MariaDB and Oracle to Cloudera Enterprise 5 in real-time. The Cloudera certification confirms that the data replicated by Tungsten Replicator matches the source and target to ensure data quality, and does not stress either side of the replication stream in the process.

Posted May 12, 2014

DataStax and Databricks are partnering to integrate Cassandra and Spark. "More and more, we see customers in the community wanting to do analytics on data in as real time as possible. That is what this is really about," said Martin Van Ryswyk, executive vice president of engineering, DataStax.

Posted May 12, 2014

The value proposition for the Splice Machine database, according to Monte Zweben, CEO and cofounder of Splice Machine, is that it enables companies to replace traditional RDBMSs when they hit a wall, either from a performance or cost perspective, with a full-featured, transactional SQL database on Hadoop, to power both operational applications and real-time analytics.

Posted May 12, 2014

It's an inevitable fact that every software system will have problems, but an enterprise-grade Hadoop infrastructure puts minimizing and managing these system errors at the forefront. When considering a distribution's dependability, you should evaluate a Hadoop distribution's position in five foundational necessities.

Posted May 08, 2014

MongoDB's Kelly Stirman and Cloudera's Yuri Bukhan recently talked with DBTA about the companies' new partnership and what it will mean for the big data ecosystem in the future. There is a need to demistify big data, they say, so that organizations can understand what technologies are right for their individual needs.

Posted May 08, 2014

Just after data is created, there is high value attached to it. As data begins to age, its value does not diminish, but the nature of that value begins to change. For many enterprises that are dealing with large data volumes, timely data access can be a major issue, especially when customers demand quick response times. Let's examine the challenge of processing data in real time reliably and meeting customers' expectations for quick responses.

Posted April 30, 2014

TEKsystems, an IT staffing solutions provider, says that employers are finding it increasingly difficult to hire business intelligence and security experts.

Posted April 23, 2014

Hadoop distribution companies Cloudera, Hortonworks and MapR have joined a new Big Data Protection Partner Program launched by Dataguise, a provider of data privacy protection and risk management analytics.

Posted April 22, 2014

MapR has added the complete Apache Spark technology stack to the MapR Distribution for Hadoop. Spark is an Spark is an in-memory processing framework that provides speed, programming ease, and advantages for real-time processing.

Posted April 15, 2014

New offerings for IBM System z are aimed at helping customers with rapid development and deployment of mobile applications as well as the ability to integrate them with core business processes, applications, and data. As part of this effort, As part IBM is enabling the industry's first commercial Hadoop for Linux on System z - zDoop software - provided through through Veristorm, an IBM partner.

Posted April 14, 2014

IBM, which marked the 50th anniversary of the mainframe today, looked to the future by rolling out new mobile, storage, cloud, and Hadoop for Big Data offerings for System z. According to IBM, as it celebrates this landmark occasion, more than 70% of enterprise data resides on a mainframe and 71% of all Fortune 500 companies have their core businesses on a mainframe

Posted April 08, 2014

Splice Machine is the newest member of the more than 800-member Cloudera Connect Partner Program. According to Splice Machine, its technology enables Cloudera users to tap into real-time updates with transactional integrity and standard ANSI SQL, which the company says are necessary features for organizations that are looking to become real-time, data-driven businesses.

Posted April 07, 2014

InfiniDB has announced the results of a new, independent benchmark from Radiant Advisors that examined the performance of leading open source SQL-on-Hadoop query engines, including InfiniDB for Hadoop 4.0

Posted April 07, 2014

Teradata has introduced the Teradata Database 15 with a new software product called Teradata QueryGrid that provides virtual compute capability within and beyond the Teradata Unified Data Architecture. The company also announced Teradata Active Enterprise Data Warehouse 6750 platform with new capabilities to support customers' most demanding real-time workloads.

Posted April 07, 2014

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

Sponsors