Big Data Quarterly Articles



Platfora has added $30 million in growth-stage financing from new investors HSBC and Harmony Partners, with participation from earlier-stage backers, including Allegis Capital, Andreessen Horowitz, Battery Ventures, Citi Ventures, Cisco, Sutter Hill Ventures and Tenaya Capital. This new round brings Platfora's total amount raised to $95 million.

Posted December 08, 2015

IBM is acquiring Clearleap to boost its IBM Cloud platform and accelerate the delivery of video service as a strategic source of data for business. The acquisition is part of IBM's moves to help clients improve access to "dark data" which, because it is unstructured and dark to computer systems, cannot be effectively managed or exploited.

Posted December 08, 2015

Harnessing operational big data does not come with a "one size fits all" solution. Organizations are at different points in their data management cycles, and whether that they are building new applications or optimizing existing ones, each needs a unique solution. This was among the key points made during a special DBTA roundtable webinar on harnessing operational big data. The webinar featured Matt Allen, senior product marketing manager with MarkLogic, Kevin Petrie, senior director with Attunity, and Jason Paul Kazarian, senior architect with Hewlett Packard Enterprise (HPE).

Posted December 07, 2015

As 2015 draws to a close it's time to look back on the year's big changes in data management and reflect on some of the most insightful observations by leading data experts. Here, Big Data Quarterly presents a round-up of thought-provoking articles that explore some of the key advances of 2015, how organizations are harnessing what big data has to offer, and the challenges they face as they seek to compete on analytics.

Posted December 02, 2015

Hadoop distribution provider Cloudera has introduced Cloudera Enterprise 5.5, including Cloudera Navigator Optimizer, a new product targeted at helping organizations improve big data workload performance and efficiency. Cloudera Navigator Optimizer, now in beta, is expected to be generally available in 2016. The new release of Cloudera Enterprise has three main areas of focus, according to Anupam Singh, head of data management at Cloudera.

Posted November 19, 2015

Alpine Data has announced the general availability of the Alpine Custom Operator Framework, a methodology for developing custom algorithms that can be plugged into Alpine's parallel machine learning engine. Complementing Alpine Touchpoints, the Custom Operator Framework enables data science and business analyst teams to create, manage and distribute frequently-requested analytic assets to business users directly into their existing activities and workflows.

Posted November 17, 2015

Not too long ago, large volumes of information were available only to the select few organizations able to afford the expensive IT infrastructure to collect, store, manage, and analyze it—the big-budget companies with seemingly bottomless pockets, or the professional research services that built empires on collecting and interpreting data. But now, through the realized effects of Moore's law, along with the consumerization of BI tools, this data is increasingly available to everyone—and without having to dig deep into budgets or employ robust IT departments.

Posted November 13, 2015

As the business value of big data increases, vendors are offering cloud-hosted big data technology, known as big data as a service (BDaaS). Why choose BDaaS instead of on-premises deployments? Do you lack internal IT expertise in big data? Is the upfront cost for a cluster an issue? Or do you not have the luxury of time to build a cluster? These are all important questions when considering BDaaS.

Posted November 13, 2015

With the Hadoop ecosystem expanding rapidly, Hortonworks' Shaun Connolly recently discussed how Hadoop and related technologies are growing and being used, as well as the factors for a successful deployment.

Posted November 13, 2015

A typical organization loses about 5% of its revenues to fraud each year. The total cost of non-health insurance fraud in the U.S. is estimated to be more than $40 billion per year. These numbers stress the importance and need of finding sophisticated tools to both detect and prevent fraud. Big data and analytics offer a new valuable toolkit in the fight against fraud.

Posted November 13, 2015

"Caveat emptor" is Latin for "Let the buyer beware." In the realm of the modern information technology cloud, this sage advice rings especially true.

Posted November 13, 2015

With increased demand for mass customization and personalization, the emergence of Web 2.0, and one-to-one marketing, and the need for better risk management and timely fraud detection, the pressure is on for organizations to improve their ability to extract, understand, and exploit analytical patterns of customer behavior and strategic intelligence.

Posted November 12, 2015

To better manage the data explosion now and with scalable options for the future, existing data architecture is evolving beyond traditional databases, data stores, data warehouses, and the like into a more unfiltered repository known as the data lake.

Posted November 05, 2015

A new Intel IoT Platform reference architecture and new hardware and software products have been introduced as part of the company's effort to build out comprehensive offerings for the Internet of Things (IoT) marketplace.

Posted November 05, 2015

Trove is the database as a service component of OpenStack that lets administrators and DevOps manage multiple instances of different database management systems (DBMS), both relational and NoSQL, using a common infrastructure. It makes database capacity available that can be consumed on-demand. This makes it much easier and faster to select and operate a database - while retaining all its capabilities - in a secure private or public cloud. The result is that routine tasks like provisioning, and managing regular administrative tasks like clustering, replication, backup and restore are handled in a simple, unified way.

Posted November 04, 2015

MapR is including Apache Drill 1.2 in its Apache Hadoop distribution and is also now offering a new Data Exploration Quick Start Solution, leveraging Drill to help customers get started more rapidly with big data projects. Apache Drill is an open source, low-latency query engine for Hadoop that delivers secure, interactive SQL analytics at petabyte scale. With the two announcements, MapR says customers and partners will able to more quickly leverage Drill to get fast business insights from all their data in Hadoop and other sources. MapR also released a comprehensive SQL-based test framework to the open source community.

Posted October 21, 2015

Citus Data, which provides solutions to scale out PostgreSQL for real-time big data, has secured $9.5 million in Series A funding led by Khosla Ventures, with participation by previous Citus Data investors including Data Collective and Vaizra Investments. The company says the funding round will be used to grow the Citus Data development team and expand its sales and marketing efforts.

Posted October 20, 2015

Splice Machine has released version 1.5 of its Hadoop RDBMS, which adds multiple enterprise-ready features. The new release adds functionality and performance improvements to enable companies to increase the benefits of using Splice Machine to support real-time applications, run operational data lakes, and accelerate their ETL pipelines.

Posted October 20, 2015

In the future, the clear winners in the IoT Revolution will be those companies that not only embrace the Internet of Things, but use it to transform their businesses. Those winning companies will be the ones that integrate IoT into their operations, products, and customer interactions to create new business models and sources of value. In fact, McKinsey estimates there could be as much as $11 trillion per year by 2020 in new economic value from adopting IoT.

Posted October 13, 2015

We live in an "as-it-happens" world, and consumers expect on-demand everything—TV, taxi service, you name it. That same culture is now (and rightly so) expected in businesses. Don't just think about "good enough"—think and plan for real time. The technology is here now to leverage real time in your business for shorter feedback loops, improved time-to-market, and happier customers. The sooner, the better. It's never too early for real time. "Good enough" simply isn't, well, good enough anymore.

Posted October 13, 2015

MapR Technologies has added native JSON support to the MapR-DB NoSQL database. The in-Hadoop document database will allow developers to quickly deliver scalable applications that also leverage continuous analytics on real-time data. A developer preview of MapR-DB with sample code is available for download and general availability of these new capabilities in MapR-DB will be available in Q4 2015.

Posted October 07, 2015

One of the noticeable changes this year at Strata + Hadoop World 2015 was the rise of Apache Spark, an engine for large scale data processing. In recent months, many companies have extended support to Spark, which can be complementary to Hadoop, but can also be deployed without it.

Posted October 05, 2015

MarkLogic, which bills itself as the only enterprise NoSQL database provider, completed a $102 million financing round earlier this year that it will use to accelerate the pace of growth in the $36 billion operational database market. Recently, Big Data Quarterly spoke with Joe Pasqua, executive vice president of products at MarkLogic, about the changing database management market, and what MarkLogic is doing to meet emerging enterprise customer requirements.

Posted September 24, 2015

StreamSets Inc., a company that aims to speed access to enterprise big data, has closed a $12.5 million round of Series A funding. The single biggest barrier to a successful enterprise analytics platform is the effective and efficient ingest of data, the company says.

Posted September 24, 2015

Anyone who thought Hadoop was a fly-by-night technology was wrong. Hadoop has rapidly evolved—improving and gaining mainstream adoption as a technology and framework for enabling data applications previously out of reach for all but the savviest of companies. The open source Apache Hadoop developer community (and distribution vendors) continuously contributes advances to meet the demands of companies seeking more powerful—and useful—data applications, while also focusing on requirements for improved data management, security, metadata, and governance. Hadoop is not only stable but worthy of consideration for core IT strategies.

Posted September 14, 2015

By now, we all know the value of big data analytics. In the 1990s, data mining and business intelligence (BI) efforts were used mostly for after-the-fact reporting and problem solving. The need for these capabilities will always remain, and high levels of confidence and investment in these disciplines will continue. However, the current focus on big data analytics emphasizes predicting and influencing the future, based on real-time analysis of 100% of enterprise data.

Posted September 14, 2015

It's no secret that the world of information technology is changing fast. Data is being created in ways not possible a few years ago. It is now feasible to collect and analyze data from a wide range of sources, including mobile devices, machines, social media, documents, and emails. To help organizations navigate the rapidly changing big data landscape, Big Data Quarterly presents the "Big Data 50," a list of companies driving innovation.

Posted September 09, 2015

IBM reports that nearly 15 petabytes of data are created every day; eight times greater than the amount of information stored in all of the libraries in the U.S. To keep up, enterprises are deploying web-scale architectures that enable virtualization, compute and storage functionality on a vast scale.

Posted September 08, 2015

Databricks is enhancing its cloud-based platform to strengthen its security, manageability and ease of application development. According to the vendor, the new features securely manage data access for large teams while streamlining Spark application development and deployment for enterprises dealing with complex and fast-paced environments.

Posted September 02, 2015

Hadoop distribution provider Cloudera has released Cloudera Director 1.5, an integrated solution for deploying and managing enterprise-grade Hadoop in cloud environments that enables deployment flexibility through an open API. Google Cloud Platform (GCP) is the first to integrate with the API.

Posted August 12, 2015

One of the side effects of the growth in data is the vast variety that data now comes in. While it presents advantages, the newer data types and sources also make data integration more complex. To highlight new technologies and techniques for overcoming data silos and enabling the delivery of information across the enterprise - where and when it is needed - Database Trends and Applications recently hosted a special roundtable webinar sponsored by Attunity, MarkLogic, and SnapLogic. The webinar covered the key big data integration issues facing enterprises and presented real-world solutions and best practices.

Posted August 10, 2015

Faced suddenly with the challenge of quickly extracting valuable consumer data from a massive file for customer acquisition programs, DX Marketing selected Oracle Database Cloud Service and Oracle Marketing Cloud Service to replace its on-premise computing platform.

Posted August 07, 2015

Addressing the need for more big data skills training, Apache Hadoop distribution provider MapR Technologies has announced availability of a complete Apache HBase design and development curriculum on its free Hadoop On-Demand Training program.

Posted August 06, 2015

Unless you have been trapped under an elephant, you have heard a lot about big data. Today, most IT teams are either contemplating a Hadoop strategy or are well under way in the hopes of making data actionable.

Posted August 06, 2015

A typical organization loses 5% of its revenues to fraud each year, and in the U.S. alone the total cost of (non-health) insurance fraud is estimated at more than $40 billion per year, according to Bart Baesens, one of the authors of a new book on data science techniques for fraud detection. While opportunities for fraud are continuously evolving, fraud detection always has the biggest impact if it is done quickly. Fortunately, says Baesens, big data analytics can help to make early fraud detection possible.

Posted August 05, 2015

Predixion, a developer of cloud-based advanced analytics software, is partnering with PLAT.One to deliver predictive insights for IoT applications.

Posted July 30, 2015

With the continued increase in data over the recent years, data infrastructures are changing as well. The one-size-fits-all approach for data environments is going by the wayside due to the expanding range of data types. With the growing understanding that the world of big data is heterogeneous, businesses are now employing versatile hybrid data architectures. But while they provide the right tools for the myriad requirements of big data, hybrid architectures must be properly integrated and managed to deliver their full value.

Posted July 28, 2015

Continuing its global expansion, CoolaData, which provides a solution that tracks user behavior across all channels and aggregates the data to help monetize the insights, has opened an office in New York. The company's investors include 83North (formerly Greylock IL) and Carmel Ventures.

Posted July 22, 2015

The Hortonworks Data Platform (HDP) version 2.3 is now available, with enhancements for user experience for both operators and developers, new security and data governance capabilities, a new cluster monitoring service for support subscription customers.

Posted July 22, 2015

Franz, a supplier of semantic graph database technology, has added patented N-dimensional analysis capabilities to its flagship product AllegroGraph.

Posted July 20, 2015

Kyvos Insights, a big data analytics company, emerged from stealth mode to introduce a solution called Kyvos that it says provides insights from all corporate data, regardless of size and granularity.

Posted July 13, 2015

Responding to a growing need to deliver more data to the right people at the right time, Cloudera and Teradata have announced the Teradata Appliance for Hadoop with Cloudera—the enterprise-ready Hadoop distribution.

Posted July 09, 2015

Graph databases are especially good for managing data about the connections among resources, and they are here to stay as a vital mechanism for managing data. In some enterprises, graph databases are even replacing relational databases as the primary database of record.

Posted July 08, 2015

These days, everybody talks about big data and analytics but does anybody really know how to tackle it? Everybody thinks the others are doing it and hence also claim to do it as well, but it's likely that only the fortunate have had (positive) experience(s) with it so far.

Posted June 26, 2015

Describing it as potentially the most important new open source project in a decade, IBM announced a major commitment to Apache Spark.

Posted June 15, 2015

Data is only going to continue to grow and so must data processing capabilities.

Posted June 15, 2015

Syncsort is working with Dell to help businesses improve operational efficiency and lower costs by shifting expensive workloads and associated data from enterprise data warehouses (EDW) to Hadoop.

Posted June 10, 2015

Map R is releasing version 5.0 of the MapR Distribution including Hadoop that will process big and fast data on a single data platform that enables a new class of real-time applications.

Posted June 09, 2015

Unstructured data presents many challenges — it's hard to manage, datasets can be extremely large, and it does not have a pre-defined schema. Still, enterprises and service providers who manage to tame and mine unstructured data will have the ability to drive true business transformation based on the new insights it provides.

Posted June 05, 2015

MapR Technologies, Inc., a provider of a distribution for Apache Hadoop, is including Apache Drill 1.0 in the MapR Distribution.

Posted May 19, 2015

Pages
1
2
3
4
5
6
7
8
9

Newsletters

Subscribe to Big Data Quarterly E-Edition