Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

AtScale is collaborating with Microsoft to add the AtScale Intelligence Platform to Azure HDInsight, an Apache Hadoop-based service in the cloud. By deploying AtScale on HDInsight, customers can connect any business intelligence (BI) tool to Hadoop in minutes, according to the vendor.

Posted July 28, 2015

The Hortonworks Data Platform (HDP) version 2.3 is now available, with enhancements for user experience for both operators and developers, new security and data governance capabilities, a new cluster monitoring service for support subscription customers.

Posted July 22, 2015

CA Technologies has enhanced its DevOps portfolio to make the job of IT operations teams easier. The enhanced monitoring solutions for agile operations provide the speed and scale organizations need to rapidly deploy new applications, monitor dynamic environments, and continually optimize application quality.

Posted July 20, 2015

Kyvos Insights, a big data analytics company, emerged from stealth mode to introduce a solution called Kyvos that it says provides insights from all corporate data, regardless of size and granularity.

Posted July 13, 2015

Responding to a growing need to deliver more data to the right people at the right time, Cloudera and Teradata have announced the Teradata Appliance for Hadoop with Cloudera—the enterprise-ready Hadoop distribution.

Posted July 09, 2015

RedPoint Global's two platforms, Interaction and Data Management, are now available on the Microsoft Azure Marketplace, giving data-driven marketers access RedPoint's data and campaign management tools on the cloud for the first time. "We want to go into this cloud world with our software and we're changing the paradigm of the way people even buy software and the way they do their processing of their data," said Dale Renner, CEO and founder of RedPoint. "What we're doing is basically allowing our clients to very rapidly deploy solutions where they don't even have to buy all the software."

Posted July 07, 2015

Fortscale Security Ltd., a provider of user behavior analytics software for enterprise security, is partnering with Cloudera to enable customers to build end-to-end Hadoop-based security analytics solutions to collect, manage and analyze logs for discovery of security threats. Fortscale has certified its user behavior analytics solution on Cloudera Enterprise.

Posted July 01, 2015

What matters most in data management in 2015? There are a lot of moving parts that data managers and professionals need to attend to in today's enterprises. Databases need to be wide open and accessible to all parts of the business, but at the same time, secure and free of tampering. Unstructured forms of data—such as log data, documents, graphics, video, and social data—need to be prepared and ready for analysis in the same way structured files have been ready for years.

Posted June 25, 2015

Pentaho is now offering support for integration with SAP HANA and Amazon Elastic MapReduce (EMR). Dubbed Pentaho 5.4, available now, the solution offers new capabilities that build on a pragmatic and future-proofed platform for big data orchestration and analytics at scale.

Posted June 24, 2015

Splice Machine has achieved certification on the MicroStrategy Analytics Platform, enabling faster query times and real-time updates. "Enterprises of all sizes are dealing with massive data sets on a daily basis, so there is a demand for a scalable, affordable database that can integrate seamlessly with a powerful unified analytics platform," said Monte Zweben, co-founder and CEO of Splice Machine.

Posted June 23, 2015

MapR Technologies is collaborating with Google Cloud Platform to provide $500 of credit to spend on Google Cloud Platform services for each person who registers for its free Hadoop On-Demand Training program. The incentive gives students an opportunity to complete labs using Google's cloud infrastructure at no charge, further supporting Hadoop skills development.

Posted June 23, 2015

Dataguise has rolled out DgSecure version 5.0, the latest generation of its DgSecure platform. The new release aims to enable businesses to scale sensitive data discovery, automate data protection, and achieve a comprehensive view of their sensitive data assets across the full spectrum of big data and traditional data repositories, both on-premise and in popular cloud platforms.

Posted June 23, 2015

Syncsort is partnering with Dell and Cloudera to release a joint solution that will help enterprises reduce costs and improve business analytics. The partnership will create a Dell-Syncsort architecture that will combine Syncsort's Hadoop extract-transform-load tool, DMX-h, with Cloudera's Enterprise Data Hub, enabling customers to offload targeted ETL workloads and data from data warehouses to Cloudera's platform, increasing performance and freeing up valuable cycles in the EDW.

Posted June 23, 2015

JethroData has closed an $8.1 million Series B financing round led by Square Peg Capital, and including existing investor Pitango Venture Capital. This funding round comes on the heels of the general availability of JethroData's SQL-on-Hadoop software, which uses indexes to query data up to 100 times faster than alternative SQL on Hadoop solutions.

Posted June 10, 2015

MapR Technologies is working with Microsoft to offer the MapR Distribution including Apache Hadoop on Azure cloud services. MapR is architected to provide a scalable and agile Hadoop platform and NoSQL database for big and fast data applications. The MapR Distribution will also be integrated with the Azure Data Lake, enabling users to deploy tiered analytical and storage capabilities in the cloud.

Posted June 10, 2015

Concurrent has announced general availability of Cascading 3.0, a platform for building and deploying enterprise applications on Hadoop. "This new, available version of Cascading will enable our users even further by simplifying application development, accelerating time to market and allowing enterprises to leverage existing, and more importantly, new and emerging data infrastructure and programming skills," said Chris Wensel, founder and CTO, Concurrent, Inc.

Posted June 10, 2015

Syncsort is working with Dell to help businesses improve operational efficiency and lower costs by shifting expensive workloads and associated data from enterprise data warehouses (EDW) to Hadoop.

Posted June 10, 2015

Map R is releasing version 5.0 of the MapR Distribution including Hadoop that will process big and fast data on a single data platform that enables a new class of real-time applications.

Posted June 09, 2015

To help users extract insights from data lakes,Teradata has made a multi-year commitment to contribute to Presto's open source development. Based on a three-part roadmap, Teradata's says its contributions will be 100% open source under the Apache license and will advance Presto's code base, scalability, iterative querying, and ability to query multiple data repositories.

Posted June 08, 2015

New and emerging companies are taking aim at data management challenges to help customers get more value from their data. Here, DBTA showcases the approaches of 10 companies we think are worth watching.

Posted June 01, 2015

Cloudera is now offering support for Capgemini's new reference architecture for the SAP HANA platform and Cloudera Enterprise. "By bringing the power of Cloudera's enterprise data hub offering to the ecosystem in support of SAP HANA, we can enable Capgemini's clients to expand the amount of data they have within their environment in a cost-efficient manner," said Tim Stevens, vice president of corporate and business development at Cloudera.

Posted May 28, 2015

Many DBAs are now tasked with managing multi-vendor environments, and handling a variety of data types. Increasingly, DBAs are turning to strategies such as database automation to be able to concentrate more on the big picture of moving their enterprises forward.

Posted May 28, 2015

At Data Summit 2015 in New York City, Tony Shan, chief architect, Wipro, gave a talk on the key components of a successful big data methodology and shared lessons learned from real world big data implementations. According to Shan, there is an 8-step process for a big data framework with specific techniques and methods.

Posted May 26, 2015

Building on its native support for a variety of database platforms - including Apache Hive for Hadoop, MongoDB Enterprise, and traditional relational database management systems (RDBMS) - Embarcadero Technologies' data modeling tool, ER/Studio Data Architect version 10.0, has been certified by HortonWorks and Mongo DB Enterprise.

Posted May 21, 2015

Rosslyn Analytics, a provider of big data cloud technology, has announced it is one of the first to offer companies analytics as a service on Azure. The big data cloud analytics platform, powered by Azure, provides self-service management from source to analytics and enables business and IT users to interact with, change, and analyze data using a combination of self-service data integration, cleansing and enrichment tools and machine learning and visualization technologies.

Posted May 20, 2015

Oracle is shipping a new big data product called Oracle Big Data Spatial and Graph. Spatial and graph analytics has been available as an option for Oracle Database for more than 10 years, and with this introduction the company is bringing spatial and graph analytics to Hadoop and NoSQL.

Posted May 20, 2015

MapR Technologies, Inc., a provider of a distribution for Apache Hadoop, is including Apache Drill 1.0 in the MapR Distribution.

Posted May 19, 2015

Google white papers have inspired many great open source projects. What has been missing until now, however, has been a way of bringing these technologies together such that any data-centric organization can benefit from the capabilities of each technology across its entire data center, and in new ways not documented by any single white paper. This is called the "Zeta Architecture."

Posted May 19, 2015

The demand for effective data management is intensifying. At the same time, the database market has expanded into a wide array of solutions—from traditional relational database management systems to alternative databases such as NoSQL, NewSQL, cloud, and in-memory offerings.

Posted May 19, 2015

RedPoint Global was founded in 2006 by Dale Renner, Lewis Clemmens, and George Corugedo, who previously had worked together at Accenture. Based in Wellesley, Mass., RedPoint collaborates with clients around the world in 11 different verticals. "We have always been very focused on the data, and recognize that a lot of business problems live and die by the quality of the data," says Corugedo.

Posted May 19, 2015

Hadoop is contributing to the success of data analytics. Anad Rai, IT manager at Verizon Wireless, examined the differences between traditional versus big data at Data Summit 2015 in a session titled "Analytics: Traditional Versus Big Data." The presentation, which was part of the IOUG track moderated by Alexis Bauer Kolak, education manager at the IOUG, showed how big data technologies are helping data discovery and improving the transformation of information and knowledge into wisdom.

Posted May 14, 2015

The data lake is one of the hottest topics in the data industry today. It is a massive storage reservoir that allows data to be stored in its rawest forms. Hadoop Day at Data Summit 2015 concluded with a panel on everything data lake featuring James Casaletto, solutions architect for MapR, Joe Caserta, president and founder of Caserta Concepts, and George Corugedo, CTO with RedPoint Global Inc.

Posted May 14, 2015

With the influx of big data solutions and technologies comes a bevy of new problems, according to Data Summit 2015 panelists Miles Kehoe, search evangelist at Avalon Consulting, and Anne Buff, business solutions manager for SAS best practices at the SAS Institute. Kehoe and Buff opened the second day of Data Summit with a keynote discussion focusing on resolving data conundrums.

Posted May 14, 2015

To transform data into value, IT must move from thinking about what it does to data, and instead focus on business outcomes and what can be done with the data to advance the business, according to Edd Dumbill, vice president, strategy, Silicon Valley Data Science, who gave the welcome keynote at Data Summit 2015.

Posted May 14, 2015

Splice Machine is partnering with Talend to enable customers to simplify data integration and streamline data workflows on Hadoop. Through this partnership, organizations building operational data lakes with Splice Machine can augment Talend's data integration technology with its data quality capabilities.

Posted May 12, 2015

Pentaho users will now be able to use Apache Spark within Pentaho thanks to a new native integration solution that will enable the orchestration of all Spark jobs. Pentaho Data Integration (PDI), an effort initiated by Pentaho Labs, will enable customers to increase productivity, reduce maintenance costs, and dramatically lower the skill sets required as Spark is incorporated into big data projects.

Posted May 12, 2015

Pivotal has made updates to its big data suite that include upgrades to the Pivotal HD enterprise-grade Apache Hadoop distribution, which is now based on the Open Data Platform core, and performance improvements for Pivotal Greenplum Database.

Posted May 05, 2015

The Spring 2015 release of the SnapLogic Elastic Integration Platform extends the platform's cloud and big data integration capabilities to the Internet of Things (IoT) with support for Message Queuing Telemetry Transport (MQTT), a lightweight machine-to-machine connectivity protocol.

Posted May 05, 2015

Splice Machine, a provider of Hadoop RDMS, announced that it is partnering with mrc (michaels, ross & cole ltd), to allow Splice Machine's Hadoop RDBMS to be certified and integrated with mrc's m-Power platform. "Our partnership with mrc gives businesses a solution that can speed real-time application deployment on Hadoop with the staff and tools they currently have, while also offering affordable scale-out on commodity hardware for future growth," said Monte Zweben, co-founder and CEO, Splice Machine.

Posted April 28, 2015

Pivotal HAWQ is now available on the Hortonworks Data Platform (HDP), enabling the benefits of SQL on Hadoop to be leveraged by enterprises that are investing in HDP. This marks the first time that the features and capabilities of Pivotal HAWQ have been made available outside of Pivotal. The availability aligns with a common Open Data Platform (ODP) Core that allows users to leverage the best-of-breed technology across providers.

Posted April 27, 2015

The future will flourish with machines. We've been told this in pop culture for decades, from the helpful robots of the Jetsons, to the infamous Skynet of the Terminator movies, to the omniscient "computer" of Star Trek. Smart, connected devices will be ubiquitous and it's up to us, the humans, to decide what's next. But the Internet of Things (IoT) is about more than devices and data.

Posted April 23, 2015

SUSE and Veristorm are partnering to provide certified high-performance Hadoop solutions that run directly on Linux on IBM z Systems, IBM Power Systems, and x86-64. Customers with IBM z Systems can team SUSE Linux Enterprise Server for System z with Veristorm zDoop, a commercial distribution of Hadoop supported on mainframes.

Posted April 23, 2015

While the new data stores and other software components are generally open source and incur little or no licensing costs, the architecture of the new stacks grows ever more complex, and this complexity is creating a barrier to adoption for more modestly sized organizations.

Posted April 22, 2015

To help organizations answer questions with data spread across disparate analytics systems and data repositories, Teradata has expanded its QueryGrid technologies. "With this announcement we have our foot on the gas pedal," Imad Birouty, director of product marketing, Teradata. "We have seven updates. We are announcing new connectors that are on their way, announcing that we have delivered on the connectors that we previously announced, and we are refreshing previously-released connector versions of the technologies."

Posted April 20, 2015

Unstructured data types and new database management systems are playing an increasing role in the modern data ecosystem, but structured data in relational database management systems (RDBMS) remains the foundation of the information infrastructure in most companies. In fact, structured data still makes up 75% of data under management for more than two-thirds of organizations, with nearly one-third of organizations not yet actively managing unstructured data at all, according to a new survey commissioned by Dell Software and conducted by Unisphere Research, a division of Information Today, Inc.

Posted April 15, 2015

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

Sponsors