Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Clearlake Capital Group, L.P. has completed the acquisition of Syncsort. "Our organic product investments and related software acquisitions over the last couple of years have been singularly focused on one of the most important secular growth opportunities in the technology industry today: demand for software that liberates data and existing budgets for use in advanced machine learning and business intelligence systems," said Lonne Jaffe, CEO of Syncsort.

Posted November 18, 2015

With the Hadoop ecosystem expanding rapidly, Hortonworks' Shaun Connolly recently discussed how Hadoop and related technologies are growing and being used, as well as the factors for a successful deployment.

Posted November 13, 2015

Franz, Inc., an early innovator in artificial intelligence (AI) and a supplier of semantic graph database technology, is releasing Gruff v6.0, the company's graph visualization software for exploring and discovering connections within data. Gruff provides novice users and graph experts with the ability to visually build queries and visualize connections between data without writing code, which speeds discovery and enhances the ability to uncover hidden connections within data.

Posted November 10, 2015

The concept of the data lake has become a hot topic. The data lake retains data in its original format to allow the data to be more flexible for everyone involved. While this sounds fine in theory, it is a more complicated in practice due to the need for governance and security.

Posted November 09, 2015

Almost every commercial endeavor and, indeed, almost every human undertaking, has software at its core. Yet, with software at the core of so much of our society, it's surprising to realize it's getting harder and harder to actually make a living selling software. In his recent book, "The Software Paradox," Stephen O'Grady - co-founder of analyst firm RedMonk - provides a cohesive and persuasive analysis of what those of us in the software business have been experiencing for several years - it's getting increasingly difficult to generate revenues selling "shrink-wrapped" software.

Posted November 09, 2015

To better manage the data explosion now and with scalable options for the future, existing data architecture is evolving beyond traditional databases, data stores, data warehouses, and the like into a more unfiltered repository known as the data lake.

Posted November 05, 2015

Following its separation from Hewlett-Packard Company, Hewlett Packard Enterprise made its debut today as an enterprise technology provider with $53 billion in annual revenue and a comprehensive product portfolio. HP announced plans to separate into two new publicly traded companies a year ago with one including HP's enterprise technology infrastructure, software and services businesses, doing business as Hewlett-Packard Enterprise, and the other comprising HP's personal systems and printing businesses, operating as HP Inc. and retain the current logo.

Posted November 02, 2015

There's unrelenting pressure on businesses to compete on analytics and to be able to anticipate customer needs and trends ahead of the curve. Enterprises are looking to expand BI and analytics capabilities as far and wide as technologies and budgets will allow them to go. As a result, the continuing advance of analytic capabilities across the enterprise has reached a "tipping point."

Posted October 28, 2015

Oracle has announced new services - Oracle Big Data Preparation Cloud Service, Oracle GoldenGate Cloud Service, Oracle Big Data Discovery Cloud Service and Oracle NoSQL Database Cloud Service - that complement Oracle's Big Data Cloud Service and Big Data SQL Cloud Service, to form a comprehensive portfolio of solutions for big data in the cloud.

Posted October 28, 2015

AtScale, a company specializing in connecting business users to their data in Hadoop, is releasing a new version of its platform, introducing a new tool that will enhance performance of big data on Hadoop.

Posted October 21, 2015

MapR is including Apache Drill 1.2 in its Apache Hadoop distribution and is also now offering a new Data Exploration Quick Start Solution, leveraging Drill to help customers get started more rapidly with big data projects. Apache Drill is an open source, low-latency query engine for Hadoop that delivers secure, interactive SQL analytics at petabyte scale. With the two announcements, MapR says customers and partners will able to more quickly leverage Drill to get fast business insights from all their data in Hadoop and other sources. MapR also released a comprehensive SQL-based test framework to the open source community.

Posted October 21, 2015

Teradata is introducing a series of platforms that will capture streaming IoT data, land that data in Hadoop, and use advanced Aster machine learning natively in Hadoop to analyze data.

Posted October 20, 2015

Concurrent is releasing the next version of its Driven platform, enabling application performance monitoring and management across heterogeneous Hadoop and Spark environments within a single, comprehensive solution.

Posted October 20, 2015

Splice Machine has released version 1.5 of its Hadoop RDBMS, which adds multiple enterprise-ready features. The new release adds functionality and performance improvements to enable companies to increase the benefits of using Splice Machine to support real-time applications, run operational data lakes, and accelerate their ETL pipelines.

Posted October 20, 2015

IBM Power Systems were recently showcased at Strata + Hadoop World in New York City, where IBM showed how Power Systems provides the world's superior platform for Systems of Insight, driving digital business in a waitless world. At the event, Power Systems highlighted new capabilities for Spark and Hadoop on the world's first system designed for data.

Posted October 14, 2015

Clearlake Capital Group, L.P., a private investment firm with a sector-focused approach, has reached a definitive agreement to acquire Syncsort Inc., a global big data enterprise software provider. Financial terms of the transaction were not disclosed. Headquartered in Woodcliff Lake, NJ, Syncsort provides enterprise-grade software that spans "big iron to big data," including fast-growing analytical platforms such as Apache Hadoop, Splunk, Apache Spark, and the cloud, as well as more mature platforms such as the IBM z Systems mainframe.

Posted October 14, 2015

At Strata + Hadoop World 2015, Attunity announced the release of Attunity Replicate Express, a downloadable edition of its data replication and loading software. The solution, which answers a growing demand for more accessible real-time big data analytics, is freely available to download online. The new solution supports ingesting data to and from Oracle, SQL Server, and Hadoop Data Lakes for test and development environments.

Posted October 14, 2015

We live in an "as-it-happens" world, and consumers expect on-demand everything—TV, taxi service, you name it. That same culture is now (and rightly so) expected in businesses. Don't just think about "good enough"—think and plan for real time. The technology is here now to leverage real time in your business for shorter feedback loops, improved time-to-market, and happier customers. The sooner, the better. It's never too early for real time. "Good enough" simply isn't, well, good enough anymore.

Posted October 13, 2015

In what is being hailed as the biggest tech merger ever, Dell Inc. and EMC Corp. today formally announced they have signed a definitive agreement under which Dell will acquire EMC. The total transaction is valued at $67 billion. The deal is expected to close in the second or third quarter of Dell's fiscal year which ends February 3, 2017 (within the months of May to October 2016). The industry is going through a "tremendous transformation," with the old style of IT being "pretty quickly disrupted" yet this rapid change is also presenting "incredibly rich" opportunities, said Joe Tucci, chairman and chief executive officer of EMC, during a conference call with media and industry analysts.

Posted October 12, 2015

ClearStory Data, a company specializing in bringing business-oriented data intelligence to everyone, introduced new advancements to its Spark-native Intelligent Data Harmonization and blending capabilities that will help make users more self-reliant.

Posted October 12, 2015

MapR Technologies has added native JSON support to the MapR-DB NoSQL database. The in-Hadoop document database will allow developers to quickly deliver scalable applications that also leverage continuous analytics on real-time data. A developer preview of MapR-DB with sample code is available for download and general availability of these new capabilities in MapR-DB will be available in Q4 2015.

Posted October 07, 2015

Built on Hadoop, Kyvos gives business users and analysts the ability to query billions of rows of data within seconds. Kyvos' technology allows users to pre-process data and build cubes on Hadoop for faster performance and instant responses. With this partnership, Kyvos can connect Tableau users to their Hadoop data within minutes, the companies say. "It's a benefit to Tableau because it opens up the data that's available to the business user through Tableau and improves the response time," said Ajay Anand, vice president of product management and marketing at Kyvos."They've been very supportive with what we are trying to do."

Posted October 06, 2015

One of the noticeable changes this year at Strata + Hadoop World 2015 was the rise of Apache Spark, an engine for large scale data processing. In recent months, many companies have extended support to Spark, which can be complementary to Hadoop, but can also be deployed without it.

Posted October 05, 2015

Syncsort is continuing to grow its platforms capabilities by announcing new integration with two active open source platforms, Apache Kafka and Apache Spark, enabling users to better handle real-time, large-scale data processing, analytics, and feeds.

Posted October 01, 2015

At Strata + Hadoop World 2015, SAP showcased its portfolio of big data solutions, including the HANA platform that offers real-time integration of big data and information held in Hadoop with business processes and operational systems, Lumira and SAP BI tools that enable data discovery on Hadoop along with data wrangling capabilities, SAP Data Services, and the newest SAP product for the Hadoop world, HANA Vora, which takes advantage of an in-memory query engine for Apache Spark and Hadoop to speed queries. SAP HANA Vora can be used as a stand-alone, or in concert with SAP HANA platform to extend enterprise-grade analytics to Hadoop clusters and provide enriched, interactive analytics on Hadoop and HANA data.

Posted October 01, 2015

Teradata Corp. has accelerated its roadmap for the open source Presto by delivering ODBC (Open Database Connectivity)/JDBC (Java Database Connectivity) drivers for free. Presto is an open source SQL query engine which supports big data analytics.

Posted October 01, 2015

Pepperdata, a provider of solutions that optimize cluster performance in Hadoop, showed off a new feature of its platform that will help measure and allocate the costs of increasing workloads across distributed systems at Strata + Hadoop World in NYC. With this new chargeback feature IT teams can clearly see how much capacity each user or workload requires and allocate costs back to departments that share a centralized, multi-tenant Hadoop deployment.

Posted October 01, 2015

Objectivity, which recently introduced ThingSpan, a purpose-built information fusion platform intended to simplify and accelerate companies' ability to deploy and derive value from industrial Internet of Things (IoT) applications, has announced plans to support Intel's TAP (Trusted Analytics Platform) at Strata + Hadoop World, in NYC. ThingSpan is aimed at helping companies "that are drowning in data but thirsty for answers in time" said Jay Jarrell, CEO and president of Objectivity, during an interview at the conference.

Posted September 30, 2015

At Strata + Hadoop World in New York City, Talend, a provider of data integration software for the cloud and big data, is announcing a new version of its platform, now offering support for Apache Spark and Spark Streaming. Talend 6 will leverage over 100 Spark components to deliver rapid data processing speed and enable any company to convert streaming big data or IoT sensor information into immediate actionable insights.

Posted September 30, 2015

DataTorrent is teaming up with two big companies that will allow it to provide access to better security and make adoption of Hadoop easier. DataTorrent is partnering with Cisco to allow integration between its DataTorrent RTS platform and Cisco's Application Centric Infrastructure (ACI) through the Application Policy Infrastructure Controller (APIC), offering a unified management architecture for enterprises to manage their big data applications along with network and security. DataTorrent is also integrating its platform with Microsoft Azure HDInsight via the Microsoft Azure Marketplace.

Posted September 29, 2015

Pivotal has confirmed it will continue its commitment to advancing open source by contributing its technology to Apache Software Foundation (ASF). Pivotal's contribution of the HAWQ advanced SQL on Hadoop analytics and MADlib machine learning technologies will cement Hadoop's place as the as the cornerstone of advanced data science, business intelligence, and data warehousing, according to Pivotal.

Posted September 29, 2015

Arcadia Data, a provider of a unified visual analytics and business intelligence (BI) platform for big data, is releasing Arcadia Enterprise, a solution that will run natively in Hadoop. The company says the platform, dubbed Arcadia Enterprise, bypasses the restrictions of legacy BI and visualization tools by allowing users to work directly with their data on Hadoop. "We give the analyst the ability to do free-form exploration of the highest granularity of data in the Hadoop system," said Priyank Patel, co-founder and chief product officer at Arcadia.

Posted September 29, 2015

The Hortonworks DataFlow (HDF) support subscription is now available. HDF, powered by Apache NiFi, a top-level open source project, is intended to help organizations take advantage of data related to the Internet of Anything (IoAT) and helps make it easier to automate and secure data flows and collect, conduct and curate real-time business insights and actions derived from any data, from anything, anywhere. "By flowing that data into HDP, our customers are able to rapidly bring these new data elements under management in a completely secure and purely open way," said Tim Hall, vice president of product management at Hortonworks.

Posted September 29, 2015

Cloudera has launched a public beta release of RecordService, a new high-performance security layer for Apache Hadoop that centrally enforces role-based access control policies across the platform. Complementing Apache Sentry (incubating), which provides unified policy definition, RecordService delivers complete row- and column-based security, and dynamic data masking, for every Hadoop access engine. The announcement was made at Strata + Hadoop World in New York City.

Posted September 28, 2015

Pentaho, a Hitachi Data Systems company, and Melissa Data, a provider of global contact data quality solutions, have formed a partnership to create new data quality plug-ins for Pentaho's big data integration and analytics platform.

Posted September 24, 2015

MemSQL, a provider of real-time databases for transactions and analytics, has announced Spark Streamliner, an integrated Spark solution to give enterprises immediate access to real-time analytics.

Posted September 24, 2015

MarkLogic, which bills itself as the only enterprise NoSQL database provider, completed a $102 million financing round earlier this year that it will use to accelerate the pace of growth in the $36 billion operational database market. Recently, Big Data Quarterly spoke with Joe Pasqua, executive vice president of products at MarkLogic, about the changing database management market, and what MarkLogic is doing to meet emerging enterprise customer requirements.

Posted September 24, 2015

To help companies get more value from big data, SAP has introduced HANA Vora, a new in-memory computing engine that leverages and extends the Apache Spark execution framework to provide enriched, interactive analytics on Hadoop. HANA Vora is a completely new product built from the ground up and is aimed at better processing of data to make business decisions.

Posted September 23, 2015

MapR Technologies, Inc., a provider of a distribution for Apache Hadoop, has extended its support for SAS, a provider of business analytics software and services. According to the vendors, the collaboration between SAS and MapR provides advanced analytics with ease of data preparation and integration with legacy systems, assurance of SLAs, and security and data governance compliance. Additionally, joint customers can cost-effectively grow their big data storage infrastructure without relying on storage area network (SAN) or network-attached storage (NAS).

Posted September 22, 2015

Platfora, a data discovery platform native to Hadoop, is reaching another milestone for its solution, bringing modern data preparation and real-time capabilities to a wide audience.

Posted September 22, 2015

Data has continued to grow at an exponential pace, and along with that trend, more businesses are beginning to take advantage of data. Businesses have begun to rely more and more on their IT departments to be able to leverage their data quicker than their competition. However, it is difficult to get the most out of data as fast as organizations would like to do so. Recently, DBTA held a special roundtable webcast to provide education on new data management technologies and techniques for meeting increasing requirements posed by modern applications for speed, scale and flexibility.

Posted September 15, 2015

Anyone who thought Hadoop was a fly-by-night technology was wrong. Hadoop has rapidly evolved—improving and gaining mainstream adoption as a technology and framework for enabling data applications previously out of reach for all but the savviest of companies. The open source Apache Hadoop developer community (and distribution vendors) continuously contributes advances to meet the demands of companies seeking more powerful—and useful—data applications, while also focusing on requirements for improved data management, security, metadata, and governance. Hadoop is not only stable but worthy of consideration for core IT strategies.

Posted September 14, 2015

By now, we all know the value of big data analytics. In the 1990s, data mining and business intelligence (BI) efforts were used mostly for after-the-fact reporting and problem solving. The need for these capabilities will always remain, and high levels of confidence and investment in these disciplines will continue. However, the current focus on big data analytics emphasizes predicting and influencing the future, based on real-time analysis of 100% of enterprise data.

Posted September 14, 2015

There are a lot of moving parts that data managers and professionals need to attend to in today's enterprises. Here are the eight things that matter the most in today's market.

Posted September 09, 2015

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Sponsors