NoSQL Central Articles
Pentaho Corporation, an open source business analytics company, has formed a strategic partnership with DataStax, a provider of big data solutions built upon the Apache Cassandra project, a high performance NoSQL database. The relationship will provide native integration between Pentaho Kettle and Apache Cassandra. This will merge the scalable, low-latency performance of Cassandra with Kettle's visual interface for high-performance ETL, as well as integrated reporting, visualization and interactive analysis capabilities. According to the companies, organizations seeking to leverage their big data have found it difficult to implement and employ analytics technologies. "One of the big challenges today is ease of use of these tools," says Ian Fyfe, Pentaho's chief technology evangelist. Often built on open source projects, it "takes a lot of deep skills to use these systems, and these are skills that are hard to find," he explains.
Posted March 21, 2012
In years to come, we might remember October 2011 as the month the big database vendors gave in to the dark side and embraced Hadoop. In October, both Microsoft and Oracle announced product offerings which included and embraced Hadoop as the enabler of their "big data" solution. The last of the big three database vendors - IBM - embraced Hadoop back in 2010.
Posted February 09, 2012
Businesses are struggling to cope with and leverage an explosion of complex and connected data. This need is driving many companies to adopt scalable, high performance NoSQL databases - a new breed of database solutions - in order to expand and enhance their data management strategies. Traditional "relational" databases will not be able to keep pace with "big data" demands as they were not designed to manage the types of relationships that are so essential in today's applications.
Posted January 25, 2012
Database Month, a new conference "for all things database," starts this week with a series of evening events in NYC. "We took the principle of a conference and spread it out over the entire second half of January to form what is essentially a database festival," Eric David Benari, chairman of Database Month, tells 5 Minute Briefing.
Posted January 17, 2012
MapR Technologies, Inc., an Apache Hadoop distribution, introduced a new release that expands support for C/C++ API access and Windows and Mac clients, as well as being available as a virtual machine. MapR version 1.2 also includes the underpinnings to support MapReduce 2.0, which expands the types of applications that can take advantage of a Hadoop cluster.
Posted December 13, 2011
Jaspersoft, a provider of business intelligence software, has released Jaspersoft 4.5. Designed with the non-technical user in mind, Jaspersoft 4.5 delivers drag-and-drop analytics and reporting on any flavor of big data including Apache Hadoop, NoSQL and analytic databases. Key new features include an enhanced analysis user interface, an improved in-memory engine with intelligent query push-down for greater performance, and native, low-latency access to non-relational data like Hadoop HBase and MongoDB. In addition, the new release provides enhanced Excel output and extended REST APIs.
Posted December 13, 2011
As the leading provider of relational database software, it's hardly surprising that Oracle initially gave little or no credence to the NoSQL movement that emerged in 2009. Indeed, an Oracle white paper from May 2011 concluded with the recommendation to "Go for the tried and true path," and avoid NoSQL databases.
Posted December 01, 2011
Versant Corporation, a provider of data management software, announced the re-launch of their database developer community, now live on their website. The community is an online portal to help developers apply Versant's NoSQL database technologies, and includes live forums and videos, webcasts, samples and documentation for managing complex object and big data solutions.
Posted November 29, 2011
MapR Technologies, Inc., the provider of an advanced distribution for Apache Hadoop, has unveiled the MapR Training Academy, a free internet-based source for training videos and documentation to help the Hadoop user community accelerate the use of Hadoop for more business users. MapR Academy provides an introduction to Hadoop as a big data analytics platform and delves into the architecture, design and administration of Hadoop.
Posted November 15, 2011
Quest Software has begun shipping a version of its Toad data management toolkit for cloud or NoSQL databases, available as a fully functional, commercial-grade product, for free. While the term 'cloud databases' is in the product name, the toolkit is intended for a variety of non-relational databases in both cloud and on-premises settings, John Whittaker, senior manager of product marketing for Quest, tells 5 Minute Briefing. "It's for big data, NoSQL, non-relational databases, which are commonly called 'cloud databases' in the industry," he explains. "We look at them as cloud databases because they were born out of the likes of Yahoo and Google." However, he adds, "this technology doesn't have to live in the cloud. It can be in a variety of places."
Posted November 08, 2011
MapR Technologies, Inc., a provider of an advanced distribution for Apache Hadoop, and Lucid Imagination, a commercial company for Apache Lucene and Apache Solr search technology, have announced a strategic partnership. The agreement will enable Lucid Imagination to integrate the dependability, ease of use and performance of the MapR Distribution with its LucidWorks search platform. Enterprise users will be able to take advantage of the platform's underlying Lucene/Solr search innovations to better utilize the benefits of big data analytics. "The power to leverage MapR's Hadoop distribution with Lucid Works offers organizations advantages including full data protection with unlimited scale," said Frank Calderon, president and COO of Lucid Imagination.
Posted November 02, 2011
Neo Technology has launched Spring Data Neo4j 2.0, a scalable graph database, which it describes as the best NoSQL database for Spring. The announcement was made at the SpringOne 2GX 2011 Conference by Neo, a platinum sponsor of the event. This new release gives users the ability to develop applications that leverage complex, connected data, quickly on the Spring Framework. Neo also announced new customers, and named key executive additions, including Rod Johnson, Rickard Öberg, and Lars Nordwall to its team.
Posted November 02, 2011
The Oracle NoSQL Database, which was announced at Oracle OpenWorld, is now available for download from the Oracle Technology Network (OTN). Oracle also said that its new engineered system, the Oracle Big Data Appliance, an optimized platform for running the Oracle NoSQL Database and Oracle's other big data offerings that was also unveiled at OpenWorld, will ship in Q1 of the 2012 calendar year.
Posted November 02, 2011
DataStax, a provider of solutions based on the open source Apache Cassandra database platform, announced it is shipping an enterprise database platform designed to enable the management of both real-time and analytic workloads from a single environment. The new platform, DataStax Enterprise, is designed to leverage the features of Cassandra to provide performance enhancements and cost savings over traditional database management solutions, the vendor claims.
Posted October 26, 2011
One of the greatest achievements in artificial intelligence occurred earlier this year when IBM's Watson supercomputer defeated the two reigning human champions in the popular Jeopardy! TV show. Named after the IBM founder Thomas Watson and not - as you may have thought - Sherlock Holmes' famous assistant, Watson was the result of almost 5 years of intensive effort by IBM, and the intellectual successor to "Deep Blue," the first computer to beat a chess grand master.
Posted October 15, 2011
Pervasive Software Inc., a provider of solutions for cloud-based and on-premises data analysis and integration, has announced that Pervasive DataRush for Hadoop v5.1 is a Cloudera Certified Technology. "With this certification, users have the assurance that our highly parallel big data engine has been tested and validated to work with Cloudera's Distribution Including Apache Hadoop," says Mike Hoskins, Pervasive CTO and general manager, Pervasive Big Data Products and Solutions.
Posted September 29, 2011
Michael Stonebraker is widely recognized as one of the pioneers of the relational database. While at Berkeley, he co-founded the INGRES project, which implemented the relational principles published by Edgar Codd in his seminal papers. The INGRES project became the basis for the commercial Ingres RDBMS, which, during the 1980s, provided some of the most significant competition to Oracle.
Posted August 11, 2011
Cloudera Inc., a provider of Apache Hadoop-based data management software, released a solution intended to simplify the process of running and managing Hadoop throughout the operational lifecycle of a system. The new release, Cloudera Enterprise 3.5, includes new automated service, configuration and monitoring tool,s and one-click security for Hadoop clusters. The software is targeted at enterprise data sites of all sizes, Charles Zedlewski, VP of product management for Cloudera, tells 5 Minute Briefing. "Cloudera Enterprise was developed to help businesses run Apache Hadoop use cases in production at all scopes of scale and criticality," he says.
Posted July 12, 2011
Datameer, Inc., a provider of end user analytics solutions built on Apache Hadoop, has announced its partnership with MapR Technologies Inc., and support for the MapR distribution for Apache Hadoop. The partnership will include joint technology and marketing efforts designed to accelerate the adoption and usage of Hadoop-based analytics. As part of the partnership, a full-featured, trial version of the DAS will be included in the MapR distribution and will be available for download.
Posted July 12, 2011
Jaded IT professionals and managers, as well as market analysts, weary and wary from decades of overblown analyst claims about emerging new technologies, "paradigm shifts" and "enchanted quadrants," will take heart in a new series of Unisphere Research studies being released over the next several months. The first of these, "The Post-Relational Reality Sets In: 2011 Survey on Unstructured Data," has just been released, and tackles the current dimensions and impact of unstructured data on enterprise IT practices, technologies, policies, purchasing priorities and the evaluation of new technologies.
Posted June 22, 2011
Both HBase and Cassandra can deal with large data sets, and provide high transaction rates and low latency lookups. Both allow map-reduce processing to be run against the database when aggregation or parallel processing is required. Why then, would a merge of Cassandra and Hadoop be a superior solution?
Posted June 08, 2011
SnapLogic, a provider of application integration software, has introduced a solution aimed at enabling easy connection and reliable large data integration between business applications, cloud services, social media and Hadoop. The product, called SnapReduce, transforms SnapLogic data integration pipelines directly into MapReduce tasks, making Hadoop processing more accessible and resulting in optimal Hadoop cluster utilization. "This is Hadoop for humans," says Gaurav Dhillon, CEO of SnapLogic.
Posted May 25, 2011
RainStor, an infrastructure software company specializing in online data retention (OLDR), today announced that RainStor 4.5 can be deployed using Cloudera's Distribution including Apache Hadoop. "We are focused on storing and retaining data in its original form but at a much lower priced footprint than you would do with a normal relational database or a data warehouse or, in this particular case, even if you ran it on low-cost commodity hardware in Hadoop," Ramon Chen, RainStor's vice president of product management, tells 5 Minute Briefing.
Posted May 24, 2011
Teradata Corporation has announced the launch of the Aster Data SQL-MapReduce Developer Portal, a collaborative online developer community for SQL-MapReduce analytics, an emerging framework for processing non-relational data and ultra-fast analytics. "The new Aster Data SQL-MapReduce Developer Community helps bring MapReduce analytics to a broad audience of data scientists and quantitative analysts, helping organizations more easily leverage the power of MapReduce without sacrificing the familiarity and broad ecosystem support for standard SQL," Jon Bock, director of product marketing for Teradata, tells 5 Minute Briefing.
Posted May 10, 2011
DataStax, the commercial leader in Apache Cassandra, today released Brisk, a second-generation open source Hadoop distribution that the company says eliminates the key operational complexities with deploying and running Hadoop and Hive in production. Brisk is powered by Cassandra and offers a single platform containing a low-latency database for high-volume web and real-time applications, while providing tightly coupled Hadoop and Hive analytics.
Posted May 09, 2011
Starcounter, based in Stockholm, Sweden, has announced its patent-pending VMDBMS technology, which the company says is designed to improve database performance. According to the company, the key features of Starcounter is that it is a general-purpose transactional database (OLTP), ACID-compliant, memory-centric, object database, with SQL support.
Posted May 03, 2011
Citrusleaf, a new database technology company, has officially launched and announced Citrusleaf 2.0, which the company describes as a different type of NoSQL database that combines the best practices of both database and distributed technology. Simultaneously with the product launch, the company, which was founded in 2009 by Brian Bulkowski, CEO, and Srini Srinivasan, CTO, also announced Series-A funding from Alsop Louie Partners, Kalpathi Investments and Draper Associates. The funding will be used to enhance products, expand the team, and support growing vertical market customers in the advertising, financial, government and healthcare sectors.
Posted April 26, 2011
Queplix Corp., a provider of data virtualization software, has introduced new Application Software Blades for Hive, HBase and Cassandra. Queplix's Application Software Blades enable the Queplix Virtual Data Manager platform to connect to many different source applications and data. According to the company, the new software blades for NoSQL databases can identify and extract key metadata and associated security information from the data stored within these databases, then bring it into the Queplix Engine to support data integration with other applications.
Posted April 12, 2011
"Big data" has emerged as an often-used catch phrase over the past year to describe exponentially growing data stores, and increasingly companies are bolstering their product lines to address the challenge. But helping companies manage and derive benefit from the onslaught of mainly unstructured data has consistently been the focus for MarkLogic Corporation, whose flagship product, MarkLogic Server, is a purpose-built database for unstructured information. The company, which has roughly 240 customers in industries, including media, government and financial services, today announced Ken Bado as its new chief executive officer and a member of the board of directors. "Unstructured data, literally and figuratively, is huge. Clearly, 80% of the data that is generated every day by all of us is unstructured. The question is: How do you deal with it?" Bado tells 5 Minute Briefing.
Posted April 05, 2011
Digital Reasoning, a provider of solutions for complex, large-scale unstructured data analytics, announced it has been issued a U.S. patent for its distributed system of intelligent software agents for discovering the meaning in text. The invention enables the extraction of meaning from text as humans do - by analyzing concepts and entities in context. According to the company, the software learns as it runs, continually comparing new text to existing knowledge. Associated entities and synonym relationships are automatically discovered and relevant documents are identified from across extremely large corpora.
Posted April 05, 2011
The relational database is primarily oriented toward the modeling of objects (entities) and relationships. Generally, the relational model works best when there are a relatively small and static number of relationships between objects. It has long been a tricky problem in the RDBMS to work with dynamic, recursive or complex relationships. For instance, it's a fairly ordinary business requirement to print out all the parts that make up a product - including parts which, themselves, are made up of smaller parts. However, this "explosion of parts" is not consistently supported by all the relational databases. Oracle, SQL Server and DB2 have special, but inconsistent, syntax for these hierarchical queries, while MySQL and PostgreSQL lack specific support.
Posted April 05, 2011
10gen, a company that develops and supports MongoDB, has announced version 1.8 of the scalable data store.The latest version of MongoDB adds several new features, including journaling that allows for fast recovery in the case of a crash, support for covered and sparse indexes, and incremental map/reduce. In addition to these new features, 10gen says it continued to invest in MongoDB's performance and scale-out capabilities, with improvements to replication and sharding in this newest release.
Posted March 29, 2011
Membase (formerly NorthScale) and CouchOne have joined forces to create Couchbase, a provider of an end-to-end family of NoSQL database products. The merger will enable a lineup of data management capabilities built with Apache CouchDB document database technology, memcached distributed caching technology and the Membase data flow and cluster management system.
Posted March 01, 2011
Sybase, Inc., an SAP company, has announced that Sybase IQ v.15.2, its column-oriented analytics server, working with the IBM POWER 780 Model 9179-MHB server and running the Red Hat Enterprise Linux 6 Operating System (OS), achieved the best overall performance results among non-clustered systems for the TPC-H benchmark at 1 TB scale factor.
Posted February 22, 2011
Quest Software has unveiled support for Apache Hive in its two free NoSQL tools, Toad for Cloud Databases and OraOop. Additional enhancements include Eclipse support in Toad for Cloud Databases, and full support for Oracle RAC in OraOop. According to Billy Bosworth, vice president and general manager of Enterprise Database, Quest Software, Quest is adding adding functionality and additional platform support for Toad for Cloud Databases and OraOop almost in tandem with new demands it sees in the market. "Right now, we're seeing very strong adoption of the Hive system in conjunction with Hadoop, and we think the ability to store the output of Hive analytic queries in Oracle will benefit our customers by allowing them to integrate Hadoop data with traditional business intelligence solutions."
Posted February 08, 2011
Salesforce.com is well known as the pioneer of software as a service (SaaS) - the provision of hosted applications across the internet. Salesforce launched its SaaS CRM (Customer Relationship Management) product more than 10 years ago, and today claims over 70,000 customers. It's less widely known that Salesforce.com also has been a pioneer in platform as a service (PaaS), and is one of the first to provide a comprehensive internet-based application development stack. In 2007 - way before the current buzz over cloud development platforms such as Microsoft Azure - Salesforce launched the Force.com platform, which allowed developers to run applications on the same multi-tenant architecture that hosts the Salesforce.com CRM.
Posted February 02, 2011
There is a wealth of information, connections and relationships within the terabytes and petabytes of data being collected by organizations on distributed cloud platforms. Utilizing these complex, multi-dimensional relationships will be the key to developing systems to perform advanced relationship analysis. From predictive analytics to the next generation of business intelligence, "walking" the social and professional graphs will be critical to the success of these endeavors.
Posted February 02, 2011
DataStax (formerly Riptano), has unveiled DataStax OpsCenter for Apache Cassandra. A platform for managing, monitoring and operating enterprise Cassandra applications, DataStax OpsCenter provides Cassandra users with an advanced operations environment bundled with support for their real-time, high-volume, and low-latency applications. "Our customers are finding that anywhere that scale of data and real-time responsiveness are a challenge, Cassandra is proving to be the answer," said Matt Pfeil, CEO and co-founder, DataStax. "With DataStax OpsCenter we're giving these customers the confidence and control they need to deploy Cassandra for their most important applications."
Posted February 01, 2011
Jaspersoft today released software to support a variety of Big Data sources for business intelligence reporting. With the release of more than a dozen connectors as part of its open source "Big Data Reporting" project, as well as beta connectors for selected Big Data proprietary databases, Jaspersoft delivers native reporting for Hadoop, NoSQL and MPP analytic databases.
Posted January 25, 2011
The NoSQL acronym suggests it's the SQL language that is the key difference between traditional relational and newer non-relational data stores. However, an equally significant divergence is in the NoSQL consistency and transaction models. Indeed, some have suggested that NoSQL databases would be better described as "NoACID" databases - since they avoid the "ACID" transactions of the relational world.
Posted January 07, 2011
Karmasphere has introduced Karmasphere Analyst, productivity software for data professionals working with massive data sets. "Karmasphere is focused on providing software for developers and analysts to work with Big Data in Hadoop clusters. Over the past 12 months, we have been introducing products specifically focused on developers who are early adopters of Hadoop and are working on Big Data in Hadoop, but what we have heard time and time again from enterprise organizations is the need to unlock access to data in Hadoop clusters for enterprise analysts," Martin Hall, CEO of Karmasphere, tells 5 Minute Briefing.
Posted December 21, 2010
Because any database that does not support the SQL language is, by definition, a "NoSQL" database, some very different databases coexist under the NoSQL banner. Massively scalable data stores like Cassandra, Voldemort, and HBase sacrifice structure to achieve scale-out performance. However, the document-oriented NoSQL databases have very different architectures and objectives.
Posted November 30, 2010
Sybase, Inc., an SAP company, has announced the release of Sybase IQ 15.3 to beta customers. The new release introduces the PlexQ Distributed Query Platform, a massively parallel processing (MPP) architecture that accelerates highly complex queries by distributing work to many computers in a grid configuration. GA is planned for the first half of 2011.
Posted November 17, 2010