Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Pentaho has announced version 5.1 of its business analytics and data integration platform. The new release enables code-free analytics directly on the NoSQL database MongoDB, simplifies the data preparation process for data scientists, and offers full support for MapReduce 2.0 (YARN).

Posted June 24, 2014

Data security specialist Dataguise has introduced Dataguise for Data Governance suite, a solution suite for big data governance that will initially support Oracle, IBM DB2, SQL Server, Teradata, Cloudera, Hortonworks, MapR and Pivotal HD.

Posted June 20, 2014

In "Big Data at Work," Tom Davenport explains to readers why big data is important to them and their organizations, what technology they actually need to manage it, and where to start capitalizing on its potential. Here, the author shares an excerpt from his recent book.

Posted June 17, 2014

Big data analytics hold great promise, but the present methods of mining and managing big data are still evolving and pose serious security and privacy challenges. Confronting these challenges is essential if the potential of big data is to be fully exploited.

Posted June 17, 2014

The database market is heating up again. Recent venture funding and acquisition announcements from key Hadoop and NoSQL startups are drawing attention to the big data space.

Posted June 17, 2014

Today, many businesses are looking for ways to rationalize their mainframe capacity in order to contain MIPS growth as well as defer costly upgrades. Unfortunately, some of these stopgap measures involve costly trade-offs, such as keeping only the most recent or even archiving critical data to tape. This can mean that valuable information goes untapped.

Posted June 11, 2014

With the latest release of its flagship product, Talend, a provider of big data integration software, enhances Talend's performance and scalability on Hadoop by an average of 45% and addresses the challenge posed for many companies by the limited pool of knowledgeable Hadoop developers.

Posted June 10, 2014

Dataguise, a provider of big data security and protective intelligence solutions, has introduced the Dataguise DgSecure data protection platform with the MapR Sandbox for Hadoop, and also announced the Dataguise DgSecure data protection platform with the Hortonworks HDP 2.1 Sandbox. "In order to realize the benefits of big data, enterprises are getting more proactive about addressing security, privacy and risk in Hadoop," said Patty Nghiem, vice president of marketing and business development at Dataguise.

Posted June 10, 2014

Cloudera has acquired Gazzang to add enterprise-grade data encryption and key management. The move is designed to address the challenges associated with securing and processing sensitive and legally protected data within the Hadoop ecosystem, and also fulfill a requirement posed by compliance regulations such as HIPAA-HITECH, PCI-DSS, FERPA and the EU Data Protection Directive.

Posted June 10, 2014

MarkLogic CEO Gary Bloom and his team brought the MarkLogic World Tour 2014 to Wall Street to showcase customer use cases and explain the key new features coming out in MarkLogic 8. The overarching theme for the new release is ease of use, said Bloom.

Posted June 04, 2014

Actian Corporation has announced that the Actian Analytics Platform, an end-to-end analytics platform that runs natively in Hadoop, is addressing the current challenges facing business analysts who want to use SQL on Hadoop.

Posted June 03, 2014

MapR has launched a Hadoop application gallery to make it easier for companies to find solutions within the Hadoop ecosystem. The company also announced a partnership with Syncsort targeted at helping customers move their mission-critical workloads to Hadoop.

Posted June 03, 2014

RainStor 5.5 has been certified to run on Cloudera Enterprise 5. This new certification enables Cloudera customers to run RainStor natively on HDFS, while offering enterprise-grade security features. "RainStor running on Cloudera Enterprise 5 is a significant step forward for customers taking a serious look at Hadoop," said Mark Cusack, chief architect, RainStor.

Posted May 22, 2014

Fast decision-making depends on real-time data movement that allows businesses to gather data from multiple locations into Hadoop as well as conventional data warehouses. Unfortunately, traditional ETL tools use slow data-scraping techniques that put a heavy load on operational systems and cannot meet the low latency required by many businesses.

Posted May 21, 2014

Concurrent CEO Gary Nakamura says the latest release of Cascading will give enterprises the flexibility to build data-oriented applications on Hadoop once, and then run the applications on the platform that best meets their business needs. "What we are providing is a standard way to develop data-centric applications without the risk of having to rewrite those applications when distributions or the providers of the computation engines underneath it change direction one day."

Posted May 13, 2014

New Cloudera certification enables customers to use Tungsten Replicator 3.0 to replicate transactions from operational database systems such as MySQL, MariaDB and Oracle to Cloudera Enterprise 5 in real-time. The Cloudera certification confirms that the data replicated by Tungsten Replicator matches the source and target to ensure data quality, and does not stress either side of the replication stream in the process.

Posted May 12, 2014

DataStax and Databricks are partnering to integrate Cassandra and Spark. "More and more, we see customers in the community wanting to do analytics on data in as real time as possible. That is what this is really about," said Martin Van Ryswyk, executive vice president of engineering, DataStax.

Posted May 12, 2014

The value proposition for the Splice Machine database, according to Monte Zweben, CEO and cofounder of Splice Machine, is that it enables companies to replace traditional RDBMSs when they hit a wall, either from a performance or cost perspective, with a full-featured, transactional SQL database on Hadoop, to power both operational applications and real-time analytics.

Posted May 12, 2014

It's an inevitable fact that every software system will have problems, but an enterprise-grade Hadoop infrastructure puts minimizing and managing these system errors at the forefront. When considering a distribution's dependability, you should evaluate a Hadoop distribution's position in five foundational necessities.

Posted May 08, 2014

MongoDB's Kelly Stirman and Cloudera's Yuri Bukhan recently talked with DBTA about the companies' new partnership and what it will mean for the big data ecosystem in the future. There is a need to demistify big data, they say, so that organizations can understand what technologies are right for their individual needs.

Posted May 08, 2014

Just after data is created, there is high value attached to it. As data begins to age, its value does not diminish, but the nature of that value begins to change. For many enterprises that are dealing with large data volumes, timely data access can be a major issue, especially when customers demand quick response times. Let's examine the challenge of processing data in real time reliably and meeting customers' expectations for quick responses.

Posted April 30, 2014

TEKsystems, an IT staffing solutions provider, says that employers are finding it increasingly difficult to hire business intelligence and security experts.

Posted April 23, 2014

Hadoop distribution companies Cloudera, Hortonworks and MapR have joined a new Big Data Protection Partner Program launched by Dataguise, a provider of data privacy protection and risk management analytics.

Posted April 22, 2014

MapR has added the complete Apache Spark technology stack to the MapR Distribution for Hadoop. Spark is an Spark is an in-memory processing framework that provides speed, programming ease, and advantages for real-time processing.

Posted April 15, 2014

New offerings for IBM System z are aimed at helping customers with rapid development and deployment of mobile applications as well as the ability to integrate them with core business processes, applications, and data. As part of this effort, As part IBM is enabling the industry's first commercial Hadoop for Linux on System z - zDoop software - provided through through Veristorm, an IBM partner.

Posted April 14, 2014

IBM, which marked the 50th anniversary of the mainframe today, looked to the future by rolling out new mobile, storage, cloud, and Hadoop for Big Data offerings for System z. According to IBM, as it celebrates this landmark occasion, more than 70% of enterprise data resides on a mainframe and 71% of all Fortune 500 companies have their core businesses on a mainframe

Posted April 08, 2014

Splice Machine is the newest member of the more than 800-member Cloudera Connect Partner Program. According to Splice Machine, its technology enables Cloudera users to tap into real-time updates with transactional integrity and standard ANSI SQL, which the company says are necessary features for organizations that are looking to become real-time, data-driven businesses.

Posted April 07, 2014

InfiniDB has announced the results of a new, independent benchmark from Radiant Advisors that examined the performance of leading open source SQL-on-Hadoop query engines, including InfiniDB for Hadoop 4.0

Posted April 07, 2014

Teradata has introduced the Teradata Database 15 with a new software product called Teradata QueryGrid that provides virtual compute capability within and beyond the Teradata Unified Data Architecture. The company also announced Teradata Active Enterprise Data Warehouse 6750 platform with new capabilities to support customers' most demanding real-time workloads.

Posted April 07, 2014

About 3 years ago, the AMP (Algorithms, Machines, People) lab was established at U.C. Berkeley to attack the emerging challenges of advanced analytics and machine learning on big data. The resulting Berkeley Data Analytics Stack—particularly the Spark processing engine—has shown rapid uptake and tremendous promise.

Posted April 04, 2014

A new partnership between Hortonworks and LucidWorks, which provides a search development platform leveraging Apache Solr, will enable users throughout an organization to easily access and gain insight from big data sets that were previously available only to developers, analyst and data scientists.

Posted April 03, 2014

The theme for COLLABORATE 14-IOUG Forum is "Become Your Office Superhero," because, while you may look like a mild mannered technical resource in meetings or at your desk, you fight a daily battle to protect your organization's data, improve performance and generate new business opportunities. COLLABORATE is your chance to recharge your superpowers and to take on new skills.

Posted April 02, 2014

Continuent, Inc., a provider of open source database clustering and replication solutions, has announced the availability of Continuent Tungsten Replicator 3.0, an open source replication solution for Hadoop.

Posted April 01, 2014

Cloudera has launched what it describes as the industry's first hands-on Cloudera Certified Professional: Data Scientist (CCP:DS) data science certification. According to Cloudera, it is launching the data science certification program now to address to address a pressing challenge in the IT industry: job openings for data scientists are currently outpacing the supply of these in-demand workers, a situation that is aggravated by the fact that there has historically not been a clearly established skill set or university degree that an individual could acquire to qualify as a data scientist.

Posted March 28, 2014

The need for better Hadoop security is widely acknowledged. However, the transformative potential of big data is spurring the industry to quickly fill Hadoop's security gaps. To keep pace with these developments, organizations must keep a close watch on the new tools and practices being deployed.

Posted March 27, 2014

Cloud technologies and frameworks have matured in recent years and enterprises are realizing the benefits cloud adoption presents. The future of cloud deployments will involve rapid adoption of new technology frameworks beyond Hadoop, open standards in the area of cloud security, identity, and trust, as well as a universal and simple query language for aggregating data from legacy and emerging data stores.

Posted March 27, 2014

Datameer, which provides a self-service and schema-free big data analytics application for Hadoop, has introduced Datameer 4.0 which enables big data analytics workflow with visual insights at every step of analysis.

Posted March 27, 2014

Today, businesses are ending up with more and more critical dependency on their data infrastructure. If underlying database systems are not available, manufacturing floors cannot operate, stock exchanges cannot trade, retail stores cannot sell, banks cannot serve customers, mobile phone users cannot place calls, stadiums cannot host sports games, gyms cannot verify their subscribers' identity. Here is a look at some of the trends and how they are going to impact data management professionals.

Posted March 26, 2014

Continuing to expands its Asia-Pacific presence, San Jose-based MapR has opened a new Melbourne, Australia office.

Posted March 24, 2014

Cloudera has closed on a new round of funding for $160 million which will be used to further drive the enterprise adoption of and innovation in Hadoop and promote the enterprise data hub (EDH) market; support geographic expansion into Europe and Asia; expand its services and support capabilities; and scale the field and engineering organizations. The funding round was led by T. Rowe Price, and included an investment by Google Ventures and an affiliate of MSD Capital, L.P., the private investment firm for Michael S. Dell and his family.

Posted March 18, 2014

Pivotal has introduced Pivotal HD 2.0 and Pivotal GemFire XD, which along with the HAWQ query engine, form the foundation for the Business Data Lake architecture, a big data application framework for enterprise

Posted March 17, 2014

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

Sponsors