Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

The demand for speed and agility are among the key drivers of the growing DevOps movement, which seeks to better align software development and IT operations. Yet, challenges still exist.

Posted June 07, 2017

Databricks has introduced a new offering to simplify the management of Apache Spark workloads in the cloud. "Databricks Serverless" is a managed computing platform for Apache Spark that allows teams to share a pool of computing resources and automatically isolates users and manages costs. The new offering aims to remove the complexity and cost of users managing their own Spark clusters.

Posted June 06, 2017

MapR Technologies, Inc., provider of a converged data platform that integrates analytics with operational processes in real time, has announced MapR-XD, a cloud-scale data store to manage files and containers. As part of the MapR Converged Data Platform, MapR-XD supports any data type from the edge to the data center and multiple cloud environments with automatic policy-driven tiering from hot, warm or cold data to enable customers to create global data fabrics which are ready for analytical and operational applications.

Posted June 06, 2017

In the last few years, a frequent topic of conversation within some of the largest corporations in the world has been the move to the cloud—how to prepare for it, how to address it, and how to benefit from it. Yet over the past several months, some are also talking about a more ambitious goal: to be cloud-only by 2025.

Posted June 01, 2017

Pythian, a technology services provider, is launching a customized analytics solution that integrates multiple data types from both internal and external sources. The new solution, "Kick Analytics As A Service" (Kick AaaS), gathers multi-source, multi-format data together in the cloud, and adds advanced analytics, machine learning and visualizations to ensure business users and business systems get the insights they need when they need them

Posted May 31, 2017

Qubole, the big data-as-a-service company , is building an autonomous data platform that will include Qubole Data Service (QDS) Community Edition, QDS Enterprise Edition, and QDS Cloud Agents. The solution can intelligently automate and analyze platform usage to make data teams more effective.

Posted May 26, 2017

When people talk about the next generation of applications or infrastructure, what is often echoed throughout the industry is the cloud. On the application side, the concept of "serverless" is becoming less of a pipe dream and more of a reality. The infrastructure side has already proven that it is possible to deliver the ability to pay for compute on an hourly or more granular basis.

Posted May 15, 2017

What are the enabling technologies that make enterprise architecture what it is today? There are a range of new-generation technologies and approaches shaping today's data environments. The key is putting them all together to help enterprise architecture fit into the enterprise's vision of itself as a data-driven organization. Tools and technologies emerging within today's data-driven enterprise include cloud, data lakes, real-time analytics, microservices, containers, Spark, Hadoop, and open source trends.

Posted May 15, 2017

Dell EMC is announcing new data backup and protection solutions to enable customers to ensure data is secure, backed up and protected against disasters and outages.

Posted May 09, 2017

Advanced Systems Concepts, Inc. (ASCI) has released an update to its flagship platform that adds support for Hadoop ecosystem as well as workflow performance. ActiveBatch Version 11 is designed to get data into the hands of end users in real-time.

Posted May 08, 2017

Confluent, a provider of a streaming platform based on Apache Kafka, is rolling out Confluent Cloud, a fully managed streaming data service aimed at helping developers to focus on building streaming applications with Apache Kafka, rather than Kafka operations. Currently available via an early access program, the Confluent Cloud service will initially be available in Amazon Web Services, with support for Microsoft Azure and Google Cloud to be added in the future.

Posted May 08, 2017

Cloudera, which last week began trading on the New York Stock Exchange under the symbol "CLDR," has announced the general availability of the Cloudera Data Science Workbench, a self-service tool for data scientists. The workbench, which was announced in beta at Strata+Hadoop World San Jose 2017, enables fast, easy and secure self-service data science for the enterprise.

Posted May 02, 2017

Cloudera, a provider of a platform for machine learning and advanced analytics built on open source technologies including Hadoop, launched an IPO and is beginning public trading on the New York Stock Exchange under the symbol "CLDR."

Posted April 28, 2017

MicroStrategy is partnering with Alation, offering users of Alation Data access to a data catalog directly within the MicroStrategy interface that can seamlessly conduct self-service enterprise data discovery and analytics in the MicroStrategy platform. When Alation connects to an organization's data sources, it crawls and indexes data assets stored across different physical repositories, including databases, Hadoop files and data visualization tools, to produce a rich catalog.

Posted April 18, 2017

As mobile has pushed deeper into enterprises, there is a growing recognition that it may be possible to run significant parts of businesses from relatively small devices. While mobile devices may not be ready to run entire enterprises, in many cases, they certainly can run more limited functions.

Posted April 18, 2017

Hewlett Packard Enterprise (HPE) is announcing its HPE SecureData platform has achieved the industry's Federal Information Processing Standard (FIPS) 140-2 validation of Format-Preserving Encryption (FPE). HPE SecureData has the world's first FIPS-validated AES-FF1 encryption configuration option to operate in strict FIPS mode, according to the vendor.

Posted April 13, 2017

Innovative technologies, such as artificial intelligence, augmented reality, robotics, and IoT, have had, and will continue to have, broad impact that we don't yet fully understand. Organizations that adopt these technologies will require new business models and processes. We will need to understand who our customers are and what they expect. The world of work as we know it today will continue to evolve at a faster pace—this is why adaptability and resilience are critical to a vibrant career.

Posted April 12, 2017

MapR Technologies has released an updated version of the MapR Ecosystem Pack (MEP) program, a set of open source ecosystem projects that support applications running on the MapR Converged Data Platform with inter-project compatibility.

Posted April 10, 2017

By now we are all in agreement: The business of data is changing. Business users are more empowered to work with data; IT is becoming less about control and more about enablement. New data science job descriptions—such as the data scientist—are springing up as companies everywhere look for the right people with the right skill sets to squeeze more value from their data. Data itself is getting bigger, hardware more economical, and analytical software more "self-service." We've embraced the paradigm shift from traditional BI to iterative data discovery. It's a new era.

Posted April 07, 2017

As the Internet of Things (IoT) revolution works its way through marketing hype and seeks its place of valuable contribution within companies and industries, you might pause to wonder how IoT can create opportunities for your company. Yet that assessment is difficult in part because the buzz does not always align with reality. In short, it's no simple task to discern the true potential of IoT today, leaving one to wonder: What is realistic, what difference could IoT make in my company, and how mature are other companies in embracing IoT potential?

Posted April 07, 2017

The concept of data lakes is a great one, but if not done correctly, this treasure trove of information can quickly turn into a black abyss for data analysts and scientists, let alone business users.

Posted April 07, 2017

Make no mistake: Big data is promising, exciting, and effective—when done right. Once considered an overhyped buzzword, it's now a potential tool that leaders in every vertical want to harness. Unfortunately, the majority of new big data projects—about 55% of them, according to Gartner—are shuttered before they even get off the ground.

Posted April 07, 2017

There has been a sea of change in how enterprises are thinking about Apache Hadoop and big data. Today, a majority of enterprises are thinking about the cloud first, not on-premises, and are increasingly relying on ecosystem standards to drive their Apache Hadoop distribution selection.

Posted April 07, 2017

It often seems that working around things is a full-time task in every area of information technology. When workarounds are conceived and deployed, people are not always in agreement.

Posted April 07, 2017

It is difficult to find someone not talking about or considering using containers to deploy and manage their enterprise applications. A container just looks like another process running on a system; a dedicated CPU and pre-allocated memory aren't required in order to run a container. The simplicity of building, deploying, and managing containers is among the reasons that containers are growing rapidly in popularity.

Posted April 07, 2017

GridGain Systems, provider of enterprise-grade in-memory computing platform solutions based on Apache Ignite, has obtained certifications from Hortonworks and Tableau and joined their technology partnership programs. GridGain says these relationships will make it easier for its customers to launch high performance big data systems built on Hortonworks that leverage in-memory computing and to visualize in-memory data held in GridGain using Tableau.

Posted March 31, 2017

Data analytics platform provider Looker closed an $81.5 million Series D funding round led by CapitalG, Alphabet's growth equity investment fund.

Posted March 30, 2017

Oracle has announced that Hearst has selected Oracle Cloud to provide its businesses with a common platform to accelerate business growth and global expansion.

Posted March 29, 2017

The Independent Oracle Users Group (IOUG) has represented the voice of data technologists and professionals for more than 20 years, and we are excited about how our community continues to grow and focus on peer-to-peer education and know-how. With that focus we are excited for our premier yearly event: COLLABORATE 17 - IOUG Forum.

Posted March 29, 2017

Talend, a provider of cloud and big data integration software, has announced that the newest version of its Talend Data Fabric integration solution has been certified on the MapR Converged Data Platform, which includes MapR-FS, MapR-DB and MapR Streams.

Posted March 29, 2017

Jethro, provider of an index-based SQL enterprise platform, is launching Jethro 3.0, combining the power of indexing architecture with "auto-cubes" to accelerate all possible business intelligence use cases using big data.

Posted March 28, 2017

MicroStrategy is releasing an enhanced version of its signature platform, delivering a new set of APIs that will allow users to connect to almost any data source. MicroStrategy 10.7 also adds integrations with Natural Language Generation (NLG) providers Automated Insights and Narrative Science, letting users add Intelligent Narratives to their dashboards alongside their reports, graphs, and visualizations.

Posted March 27, 2017

Alation and Trifacta say they are extending their partnership to jointly deliver an integrated solution for self-service data discovery and preparation that enables users to access the data catalog and data wrangling features within a single interface.

Posted March 15, 2017

SAP has announced advancements in the SAP Vora solution to help customers accelerate project implementations and improve their enterprise business analytics.

Posted March 15, 2017

Dataguise, a provider of sensitive data governance, has announced that DgSecure now provides sensitive data monitoring and masking in Apache Hive.

Posted March 15, 2017

The rise of big data and the growing popularity of cloud is a combination that presents valuable new opportunities to leverage data with greater efficiency. But organizations also need to be aware of some key differences between on-premise and cloud deployments, says Charles Zedlewski, senior vice president, products, at Cloudera.

Posted March 15, 2017

MapR Technologies has added a small footprint edition of the MapR Converged Data Platform to address the need to capture, process, and analyze data generated by IoT devices close to the source. MapR Edge enables secure local processing, quick aggregation of insights on a global basis, and the ability to push intelligence back to the edge for faster business impact.

Posted March 14, 2017

Infoworks.io, Inc., which provides data warehousing on Hadoop, has closed $15 million in a Series B financing which it will use to scale go-to-market and customer success programs to meet customer demand.

Posted March 14, 2017

Impetus Technologies, has announced StreamAnalytix 3.0, which adds support for Apache Spark-based batch processing and enriched online and offline machine learning features. The new capabilities are targeted at helping enterprises improve the performance of their analytical models and achieve more favorable business outcomes. StreamAnalytix 3.0 will be available under a beta program online by the end of April 2017.

Posted March 14, 2017

Zoomdata, the developer of a visual analytics platform for big data, has announced the launch of a new Smart Connector for the Vertica Advanced Analytics database from Hewlett Packard Enterprise (HPE). Vertica systems integrator, Clarity Insights, will offer customers pre-integrated Zoomdata packages for Vertica as well as other supported data sources.

Posted March 14, 2017

Teradata is introducing a new platform in the open source space that will deliver unprecedented efficiencies for companies creating data lakes. Teradata is launching Kylo, a data lake management software platform built using the latest open source capabilities such as Apache Hadoop, Apache Spark, and Apache NiFi.

Posted March 09, 2017

Arcadia Data, a provider of visual analytics software, has announced the launch of Arcadia Enterprise 4.0, with enhancements for building, branding, sharing, and embedding data-centric applications to help make Apache Hadoop and cloud-based data lakes more accessible and valuable to internal and external users.

Posted March 07, 2017

Ash Munshi, Pepperdata CEO, recently discussed the need for DevOps for big data, and the role of the Dr. Elephant project, which was open sourced in 2016 by LinkedIn and is available under the Apache v2 License.

Posted March 07, 2017

Today's successful organizations are data-driven, and many are building, maintaining, and accessing databases that scale well beyond the terabyte range. In fact, many have total data assets that now measure in the petabytes. But it's not just the size of databases that is expanding.

Posted March 01, 2017

Oracle has expanded the Oracle Cloud Platform's data integration offerings with the launch of Oracle Data Integrator Cloud. The new cloud service is aimed at speeding and simplifying cross-enterprise data integration to support real-time analytics.

Posted February 22, 2017

Qubole, which provides big data-as-a-service and is a silver level member of Oracle PartnerNetwork, is making its Qubole Data Service (QDS) available on the Oracle Cloud. With this offering, the Qubole Data Service is leveraging bare metal on Oracle Cloud to provide customers with fast access to analytics.

Posted February 22, 2017

Are enterprises more or less secure than 5 years ago? That's the big question of the moment, especially with ongoing revelations about state-sponsored hacking, as well as an unending stream of reports about customer and employee data being compromised by even the most seemingly security-conscious organizations. Awareness of data security is running at a fever pitch at the highest levels of government and business organizations. There have been plenty of technology advances, and awareness has grown. Still, the wave of breaches and threats never seems to abate, and likely never will.

Posted February 22, 2017

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15

Sponsors