Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

With the growing appreciation of data as a valuable resource and the pressure on organizations to become data-driven, the role of database administration has become more critical than ever, and database administrators are increasingly appreciated for the critical role they play in delivering value to the organizations. Key concerns as far as database administration are availability, security, and integration across a variety of data types and storage mechanism so that analysts and business users are empowered to be able uncover key insights when they need it. It's a lot to think about, and that is why strong database administration solutions are highly valued.

Posted August 03, 2016

In 2016, Hadoop marked its 10th anniversary and now represents much more than a platform for the storage and batch processing of vast quantities of data from disparate sources in many formats. The Apache Hadoop framework, consisting of Hadoop Common, Hadoop Distributed File System (HDFS); Hadoop YARN, and Hadoop MapReduce, remains central to most big data projects and to the creation of data lakes, but Hadoop has also expanded to represent a large ecosystem of more than 100 interconnected open source Hadoop-related projects.

Posted August 03, 2016

If information is the lifeblood of organizations today, then delivering information where it is needed faster can be considered a matter of business health, and in some cases, even business survival.

Posted August 03, 2016

There is more data available to organizations than ever before, but the goal remains the same - to unlock nuggets of gold, the useful information that will result in competitive advantage for the organization, allowing it to react to customer's needs with lightning speed, uncover new opportunities, and act fast to counter competitive threats.

Posted August 03, 2016

Over the years, MultiValue technology has maintained its base of committed advocates despite the decades-long trend toward relational database management systems. And, now with an expanding appreciation for polyglot persistence, or put more simply, the selection of the best tool for the job, there is a growing recognition that different data management systems offer different benefits with some simply better suited for certain requirements than others.

Posted August 03, 2016

While relational database technology is still the undisputed leader when it comes to enterprise data management, it is also becoming increasingly apparent that it is no longer the only game in town. By now it is clear that Not Only SQL or NoSQL technology represents a key piece of a data management picture that is increasingly diverse.

Posted August 03, 2016

Cloud solutions and services continue to grow in acceptance for many reasons - the ease of deployment and upgrades, elastic scalability, and the pay as you go simplicity. Cloud database options offer an approach that allows organizations to run a database on a cloud platform, either running the DB on the cloud independently or selecting a database service that is maintained by a public cloud service provider.

Posted August 03, 2016

The rise of new data types over the past 10 years has led to new ways of thinking about data, and new data storage and management technologies such as Hadoop, NewSQL, and NoSQL. However, despite all the new technologies that have emerged in the last 10 years, one thing is clear: the relational database management system which has been the enterprise workhorse for decades will remain a critical component of the data architecture.

Posted August 03, 2016

Today, data is being recognized and appreciated as an asset, and even, some have suggested, a kind of currency. But beyond the obvious businesses built on data - such as Airbnb's rental business, Uber's car service app, and Alibaba's online marketplace - every business today is striving to become a data-driven organization, with turn-on-a-dime agility and rapid insights into customer behaviors and desires.

Posted August 03, 2016

The rapid expansion of The IT market shows no signs of abating. The growth of data in all its forms—from traditional data sources and newer sources such as social media and connected devices—is driving swift innovation. To address the need to secure, integrate, and draw meaningful insights from all this data, a steady flow of products and services, as well as new features to long-established offerings, continues to emerge.

Posted August 03, 2016

Splice Machine is teaming up with Incedo, a technology solutions provider specializing in data and analytics, product engineering, and emerging technologies, to generate solutions that will help enterprises manage data and accelerate data processing.

Posted August 03, 2016

Talend S.A., a provider of big data and cloud integration solutions, successfully launched its IPO. The offering included 5,250,000 American Depositary Shares (ADSs), each representing one of its ordinary shares, at a price to the public of $18 per ADS, which was higher than anticipated - and then rose sharply.

Posted July 29, 2016

There's a new buzzword on the loose, the data lake. At first glance, a data lake could be easily mistaken for a data warehouse. The two big data concepts have a common focus on analytics and they may, in certain situations, produce roughly equivalent output. But that's about where their similarities end.

Posted July 27, 2016

Cloudera, a provider of data management and analytics platform built on Apache Hadoop and additional open source technologies, has introduced Cloudera Navigator Optimizer, alongside the production release of Cloudera Enterprise 5.8. First launched in beta in late 2015, Navigator Optimizer, now generally available, is targeted at customers looking to modernize their analytic database or augment their data warehouse solution with Hadoop, and helps to provide insights for predictably offloading key workloads. The tool also provides DBAs with the usage visibility to manage Hadoop data models and guidance to optimize query performance.

Posted July 21, 2016

Database administration is undergoing some significant changes these days. The DBA, traditionally, is the technician responsible for ensuring the ongoing operational functionality and efficiency of an organization's databases and the applications that access that data. But modern DBAs are relied upon to do far more than just stoke the fires to keep database systems performing

Posted July 20, 2016

Splice Machine, provider of an RDBMS powered by Hadoop and Spark, has announced a cloud-based sandbox for developers to put its just launched open source Community Edition to the test. The company is making available an open source standalone and cluster download, and has announced the general availability of V2.0, and the launch of its developer community site.

Posted July 19, 2016

Monte Zweben, CEO and co-founder of the company that was founded in 2012, talks with Big Data Quarterly about why Splice Machine has rolled out an open source Community Edition - and why it is doing so now

Posted July 18, 2016

RedPoint Global, a provider of data management and customer engagement software, has announced integration with Microsoft Azure HDInsight to support enhanced data management capabilities via Hadoop deployments on Microsoft Azure. RedPoint is a member of the Microsoft Partner Network, and the new integration evolved from its participation in the Microsoft Enterprise Cloud Alliance Program.

Posted June 30, 2016

Actian Corporation is releasing an updated version of the Actian Vector in Hadoop (VectorH) database, enabling Spark users to have a new powerful way to help derive true business value from their data.

Posted June 29, 2016

Hortonworks, Inc. unveiled new innovations at Hadoop Summit that will improve the Hortonworks Data Platform (HDP), allowing enterprises to accumulate, analyze, and act on data.

Posted June 29, 2016

Qubole unveiled a new feature to its Qubole Data Service (QDS) called auto-caching, a next-generation disk cache for cloud storage systems that works across different data engines.

Posted June 28, 2016

Hortonworks, Inc.,is partnering with AtScale to resell AtScale's technology, providing users with the ability to query data without any data movement from any business intelligence tool.

Posted June 28, 2016

Dataguise has announced the availability of Dataguise DgSecure 6.0, the company's data security platform. According to the vendor, DgSecure 6.0 offers a monitoring solution for all data source types, allowing users to quickly understand what, where, and how sensitive data is being detected, protected, and accessed across the enterprise.

Posted June 28, 2016

Attunity Ltd. is releasing a new version of Attunity Visibility for Hadoop with enhanced technology to enable comprehensive data usage analytics for large-scale and fast-growing Hadoop Data Lake environments. The new release brings "a very unique, needed solution to the fast and growing market of Hadoop," said Itamar Ankorion, EVP of business development and corporate strategy at Attunity.

Posted June 28, 2016

MapR Technologies is introducing a new initiative that will help support Hadoop deployments and increase user and administrator productivity.

Posted June 28, 2016

Pepperdata is unveiling a new tool that will evaluate and assess Hadoop clusters and provide visibility into current cluster conditions.

Posted June 27, 2016

At the 2016 Hadoop Summit in San Jose, Teradata announced the certification of multiple BI and visualization solutions on the Teradata Distribution of Presto.

Posted June 27, 2016

Trifacta, a provider of data wrangling software, is deepening technical integration with the Hortonworks Data Platform (HDP) and the industry's first certification for Apache Atlas, a data governance and metadata framework for Hadoop.

Posted June 23, 2016

For those who haven't encountered the term, the "trough of disillusionment" is a standard phase within the Gartner hype cycle. New technologies are expected to pass from a "peak of inflated expectations" through the trough of disillusionment before eventually reaching the "plateau of productivity." Most new technologies are expected to go through this trough, so it's hardly surprising to find big data entering this phase.

Posted June 22, 2016

The data manager now sits in the center of a revolution swirling about enterprises. In today's up-and-down global economy, opportunities and threats are coming in from a number of directions. Business leaders recognize that the key to success in hyper-competitive markets is the ability to leverage data to draw insights that predict and provide prescriptive action to stay ahead of markets and customer preferences. For that, they need to keep up with the latest solutions and approaches in data management. Here are 12 of the key technologies turning heads—or potentially opening enterprise wallets—in today's data centers.

Posted June 22, 2016

Announcing a new version of its "data lake in-a-box," Koverse, Inc. has released the Koverse Platform Version 2.0, which provides enhancements for organizations trying to extract value out of their investments in big data and analytics. With this introduction, the company says it is offering enterprises a 30-day guarantee to bring their data into production with a data lake capable of delivering insights for real-world organizational challenges.

Posted June 21, 2016

With the increase in data sources, data types, and data management platforms, new obstacles can also appear, creating difficulties in combining data for important insights. During educational presentations on industry trends and technologies, keynotes, discussions, and hands-on workshops at Data Summit 2016, the philosophies and technical approaches that can help organizations be successful at putting their data to work were addressed.

Posted June 17, 2016

What's on the horizon for big data, analytics, and business intelligence as technology evolves faster and faster? In Data Summit's 2016 closing keynote John O'Brien, principal analyst and CEO, at Radiant Advisors, discussed how technology will evolve and grow in the future.

Posted June 17, 2016

Hortonworks, Inc. is enhancing its Global Professional Services (GPS) program to support and enable Hortonworks Connected Data Platforms customers.

Posted June 16, 2016

Talend is "going all-in" with Amazon Web Services (AWS), now providing its entire product line on AWS cloud. The latest release of Talend Integration Cloud extends the company's ability to allow IT organizations to quickly "spin up and spin down" big data and data integration workloads running on Amazon Redshift or Amazon EMR.

Posted June 10, 2016

Cloudera is collaborating with Microsoft to build a new open source platform that will reduce the burden on application developers leveraging Spark. The two entities, together with other open source contributors, have built a new open source Apache licensed REST-based Spark Service, called Livy, which is still in early alpha development.

Posted June 09, 2016

Data Summit 2016 in New York City drew IT managers, data architects, application developers, data analysts, project managers, and business managers. Analytics, search, machine learning, and IoT were some of the key topics of discussion in educational presentations on industry trends and technologies, keynotes, and hands-on workshops.

Posted June 08, 2016

Progress is releasing a new package of platforms that will enable enterprises to tap into the full potential of digital business. Progress DigitalFactory is a new cloud-based platform that provides a holistic, extensible solution for businesses to create omni-channel digital experiences.

Posted June 08, 2016

This week at Spark Summit, data management companies are rolling out new Spark integrations and support at Spark Summit to enable their users to take advantage of the open source data processing framework. In addition, Databricks, the company founded by the team that created Apache Spark, has announced that the Databricks Community Edition (DCE) is now generally available.

Posted June 07, 2016

Emerging and newer vendors can offer fresh, innovative ways of dealing with data management and analytics challenges. Here, DBTA looks at the 10 companies whose approaches we think are worth watching.

Posted June 06, 2016

The IT landscape is always shifting and being contoured by external market forces and internal industry initiatives. Against this changing backdrop, each year, DBTA presents a list of 100 companies that matter in data, compelling us to pause and reflect on the market changes taking place.

Posted June 06, 2016

Teradata has introduced the Teradata Aster Connector for Spark, an integration of Apache Spark analytics with Teradata Aster Analytics. The connector enables pre-built analytics functions from both solutions to be executed from Aster Analytics, enabling anyone who can use Aster Analytics to also run advanced analytics on Spark without the need to learn or know Scala.

Posted June 06, 2016

NoSQL database technology vendor Couchbase has introduced a new Couchbase Spark Connector. According to Couchbase, the new Spark connector will enable businesses to gain business insights faster, enabling them to deliver better customer experiences through web, mobile and IoT applications.

Posted June 06, 2016

Today at Spark Summit, MapR Technologies is announcing a new enterprise-grade Apache Spark Distribution. "This is a Spark-focused distribution that combines Apache Spark with the real time, persistent, web-scale data layer of MapR," said Jack Norris, SVP, Data and Applications, MapR. The new Spark Distribution option for the MapR Converged Data Platform enables advanced analytics - including batch processing, machine learning, procedural SQL, and graph computation, and is a production-ready platform for Spark workloads on-premise and in the cloud.

Posted June 06, 2016

In the wide world of Hadoop today, there are seven technology areas that have garnered a high level of interest. These key areas prove that Hadoop is not just a big data tool; it is a strong ecosystem in which new projects coming along are assured of exposure and interoperability because of the strength of the environment.

Posted June 03, 2016

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21

Sponsors