Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Databricks, the Data and AI company and a provider of data lakehouse architecture, is acquiring the German startup, 8080 Labs, enabling the company to integrate UI-driven capabilities across Databricks' Lakehouse Platform—marking the company's expansion into the low-code/no-code space.

Posted November 12, 2021

NS1, a provider in application traffic intelligence and automation, is releasing its cloud-managed solution for DNS, DHCP, and IP address management (DDI), delivered through the NS1 Connect platform. NS1 Cloud-Managed DDI enables organizations to deliver core network services across their distributed network footprint with the agility of software-based deployment, the scale of cloud-native operations, and the operating efficiency of SaaS management, according to the company.

Posted November 03, 2021

Immuta's Sumit Sarkar discussed the challenges organizations face with implementing and maintaining cloud data ecosystems—particularly as more cloud data platforms emerge—during his presentation at Data Summit Connect 2021.       

Posted November 03, 2021

Provectus, a Silicon Valley artificial intelligence (AI) consultancy, is debuting enhancements to its Open-Source Data Discovery (ODD) and Observability Platform v0.2, upgrading the platform's data discovery and observability capabilities while adding new features for data quality assurance and support of new, third-party service adapters, including Amazon Athena, Amazon SageMaker Feature Store, Feast, and Great Expectations (GE).

Posted October 29, 2021

GridGain Systems, provider of enterprise-grade in-memory computing solutions powered by the Apache  Ignite distributed database, is now offering GridGain Nebula, the company's pay-as-you-go, cloud-native in-memory computing service, to all GridGain and Apache Ignite users. Previously, Nebula was available only to the world's largest enterprises.

Posted October 29, 2021

Hackolade founder and CEO Pascal Desmarets explained how polyglot persistence enables organizations to leverage the strength of multiple data stores, deal with scale more efficiently, and more during his presentation at Data Summit Connect 2021.       

Posted October 28, 2021

Nexla, the unified data operations company, today announced it has secured $12 million in a Series A funding round, enabling the company to continue to deliver on growing enterprise demand for ready-to-use data. Founded in 2016, Nexla streamlines the process of getting ready-to-use data to more applications and more users without having to use tens of different tools.

Posted October 18, 2021

New Relic, the observability company, is releasing New Relic Instant Observability (I/O), an open source ecosystem of quickstarts to empower all software engineers to instrument, dashboard, and alert their entire technology stack. New Relic I/O is introducing an ever-expanding, open ecosystem of knowledge-focused resources that codifies the collective experience of the world's observability experts and practitioners to help engineers around the world unlock the power of their data faster, according to the vendor.

Posted October 18, 2021

Domino Data Lab, provider of an Enterprise MLOps platform, announced it has received $100 million in a latest funding round along with expanding its partnership with NVIDIA to further integrate products and expand joint sales efforts to support customers' efforts to build model-driven businesses.

Posted October 12, 2021

Cube Dev, the open-source company behind Cube.js, is releasing Cube Cloud, a hosted version of the company's open source Cube.js analytics API. With Cube Cloud, companies build data applications like metrics dashboards and analytics features that consume data from their cloud data warehouse—without building or hosting any of the complex technologies required to make this possible, according to the vendor.

Posted October 07, 2021

The proliferation of data sources, types, and stores is increasing the challenge of combining data into meaningful, actionable information. As a result, the need for faster and smarter data integration capabilities is growing. In a recent survey, nearly half of DBTA subscribers indicated that real-time insights are critical to their data strategies. At the same time, to deliver actual value, people need information they can trust, so balancing governance is essential nowadays, especially with ongoing regulatory requirements.

Posted October 05, 2021

Zadara, a provider of edge cloud services, is partnering with Zenlayer, enabling the companies to offer managed storage solutions that businesses can deploy from on-premises data centers, private colocation facilities, or the cloud. The addition of Zadara's zStorage enables Zenlayer to provide backup and disaster recovery solutions on a global scale—even at the edge, closer to where data is generated and consumed.

Posted October 01, 2021

Matillion, a leading cloud data integration platform, announced it has secured $150 million in Series E funding, empowering the company to continue to grow its cloud analytics, AI, and machine learning for large global enterprises. The funding round was led by General Atlantic, a global growth equity firm, with participation from Battery Ventures, Sapphire Ventures, Scale Venture Partners, and Lightspeed Venture Partners.

Posted September 23, 2021

Nutanix, a provider of hybrid multi-cloud computing, is adding new capabilities in the Nutanix Cloud Platform that make it easier for customers to simplify data management and optimize database and big data workload performance.

Posted September 23, 2021

The Call for Speakers is now open for Data Summit 2022 which will be held at the Hyatt Regency Boston May 17-18, 2022, with pre-conference workshops on May 16, 2022. The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.

Posted September 09, 2021

Today, organizations need data-driven insights to advance decision making at all levels and digital transformation is a key component of those efforts. Supporting data-driven insights and digital transformation takes an ever-growing range of services, products, and tools from forward-thinking companies that are working to help their customers deliver the right insights to the right people at the right time.

Posted September 08, 2021

Confluent, Inc., the platform to set data in motion, is launching the Confluent Q3 '21 Release, featuring developments that help organizations reliably share data between different environments, seamlessly integrate with business-critical applications, and cost-effectively store data needed for next-generation, digital customer experiences and data-driven backend operations.

Posted August 17, 2021

Surviving and thriving with data science and machine learning means not only having the right platforms, tools and skills, but identifying use cases and implementing processes that can deliver repeatable, scalable business value. The challenges are numerous, from selecting data sets and data platforms, to architecting and optimizing data pipelines, and model training and deployment. As a result, new solutions have emerged to deliver key capabilities in areas including visualization, self-service and real-time analytics. Along with the rise of DataOps, greater collaboration and automation have been identified as key success factors.

Posted August 05, 2021

One of the challenges of working with Hadoop environments has been maintaining the infrastruc­ture for big data projects. That's where cloud makes things easier and, increas­ingly, has served as the underlying infra­structure platform of choice for Hadoop initiatives. At the same time, not every­thing has moved to the cloud just yet for big data environments. Many IT managers expect to live in a hybrid environment. They are planning for multi-cloud data management to deliver business value and are also still relying on old-school approaches and manual tools to support their data environments.

Posted August 02, 2021

Airbyte, creators of a fast-growing open-source data integration platform, is releasing an open source data integration for data lakes, enabling AWS users to replicate data from anywhere to their Amazon Simple Storage Service (S3) account. Companies are now able to leverage Airbyte's 75-plus pre-built connectors, or build their own custom connectors within two hours using Airbyte's Connector Development Kit (CDK), in order to replicate their data to S3.

Posted July 09, 2021

We're still at the start of the 2020s, and already, things look very different from the preceding decade. For data executives and profession­als, the years ahead may mean change on a scale never seen before in the IT industry. Promising new technologies—as well as redesigned and repurposed older ones—are reshaping the data center and analytics shops in new and exciting ways. We asked industry leaders for their views on what is enhancing the ability of enterprises to compete on data.

Posted June 10, 2021

Founded by the creators of Apache Kylin, venture-backed Kyligence and is dual-headquartered in San Jose, California, and Shanghai, China. Luke Han, co-founder and CEO at Kyligence, recently explained the company's connection to the open source project, its future goals, where it fits into the global data management ecosystem, and how it plans to differentiate itself from competitors.

Posted June 08, 2021

The one-size-fits-all RDBMS has given way to an explosion of diverse data management technologies. In a  session titled "Next-Generation Databases" at Data Summit Connect 2021, Guy Harrison looked at the history of data management from the mainframe through Hadoop to blockchain and considered the utility of  new database technologies for leveraging data assets, and speculated on how these will evolve to meet tomorrow's data needs.

Posted June 02, 2021

The move to next-generation databases is driven by their ability to help companies achieve competitiveness and reach customers faster and more efficiently. These new breeds of systems can be a force for business transformation—whether it is generating new sources of revenue, enhancing customer experience, or producing data-driven insights that improve how organizations interact with customers.

Posted May 26, 2021

Rubrik, the Cloud Data Management Company, is introducing major data security features that enable organizations around the world to easily and accurately assess the impact of ransomware attacks and automate recovery operations. Rubrik's data security provides an important line of defense against these common threats and helps IT teams to answer the most pressing questions regarding their business data: What is the content of the data? What is happening to the data? Who is accessing important business information?

Posted May 18, 2021

Data lakes are one of the fastest growing trends in managing big data across various industries. However, the rise of event streaming has created a new technology category for stream processing using frameworks like Apache Flink and Kafka Streams. Nishith Agarwal, engineering manager, Uber, and Sivabalan Narayanan, senior software engineer, Uber discussed "Apache Hudi: The Streaming Data Lake Platform" during their Data Summit Connect 2021 presentation.

Posted May 12, 2021

As software infrastructure is stretched between on premise, public clouds, and hybrid clouds, keeping software in compliance is a significant challenge. Michael Corey, co-founder, COO, LicenseFortress and Don Sullivan, product line manager, business critical applications, VMware discussed current software license trends, the difference between Oracle policy and your contractual obligations, licensing Oracle on a virtualized environment, and licensing best practices, during their Data Summit Connect 2021 presentation, "Database Licensing: Best Practices and Pitfalls."

Posted May 12, 2021

Migrating a large-scale Hadoop cluster to the cloud is challenging, especially when the cluster is very active and downtime during the migration is not an option. Tony Velcich, senior director, product marketing, WANdisco, and Ken Seier, chief architect, data and AI, Insight discussed the challenges involved in such a migration during their Data Summit Connect 2021 presentation, "Considerations for Large Scale Hadoop Data Migration to the Cloud."

Posted May 11, 2021

It's time to cast your vote for the annual Database Trends and Applications Readers' Choice Awards, a competition in which the winning information management solutions, products, and services are selected by you, our readers.

Posted April 29, 2021

From machine learning and automation to hybrid and multi-cloud environments, technology trends continue to reshape the practice of database management. As a result, database professionals face new challenges and opportunities. Today, the average database team is tasked with managing more databases, bigger databases, and a greater variety of databases—from the ground to the cloud.

Posted April 27, 2021

Marvell Technology, Inc., a leader in data infrastructure semiconductor solutions, announced it has completed its acquisition of Inphi Corporation, creating a U.S. semiconductor powerhouse positioned for end-to-end technology leadership in data infrastructure.

Posted April 23, 2021

As the world of data analytics continues to evolve and reshape after a tumultuous 2020, the need for agility is rapidly driving a new era in data culture in which it is imperative to handle data immediately and at scale. While emphasis on self-service data and analytics has been top-of-mind for some time now, the shift to self-sufficiency is held back by culture, not technology. With the new year pushing more robotics process automation at all levels of the business—and for all data users—organizations are becoming more acutely aware that true enablement isn't just about tools and tech. It's about people.

Posted April 05, 2021

The world changed over the last year. Future historians will complete their theses focusing on different quarters or even specific months of 2020. But one of the most overused cliches in thinking about this period of time has been the idea that "the more things change, the more they remain the same." Let's consider sports in 2020. Major League Baseball had a 60-game season, the NBA finals were played in October, and cardboard cutouts took the place of fans in every sport. However, the Lakers won the NBA finals, the Dodgers won the World Series with the Yankees playing deep into the playoffs, and Tom Brady went to his 10th Super Bowl. The more things change …

Posted April 05, 2021

Since its emergence, many companies have made great strides with DevOps, advancing their development processes. However, for DevOps to continue transforming enterprises, it must evolve to address longstanding challenges, take advantage of new opportunities, and spread beyond its comfort zone.

Posted April 05, 2021

Data may be at the heart of all digital engagements, but most enterprises are still behind the curve when it comes to effectively identifying and managing it. That's the takeaway from the latest survey of 419 enterprise executives from BARC, which finds continuing challenges with identifying and surfacing the data assets needed to succeed in today's digital economy.

Posted April 02, 2021

Faster decision making enabled by access to role-appropriate information is the goal of organizations striving to become data-driven. At the same time, there is strong pressure on companies to ensure data quality and trustworthiness, as well as to maintain data security to avoid breaches and risk regulatory non-compliance.

Posted April 02, 2021

Spectra Logic, a leader in data storage and data management solutions, is releasing the publication of its annual "Data Storage Outlook" report, which explores how the world manages, accesses, uses, and preserves its ever-expanding data repositories.

Posted April 01, 2021

From hybrid and multicloud, to real-time analytics and AI, a strong data architecture strategy is critical to supporting an organization's goals. Greater speed, flexibility and scalability are common wish-list items, alongside smarter data governance and security capabilities. DBTA recently held a special roundtable webinar with Danny Sandwell, director of product marketing, erwin, Inc.; Paul Lacey, senior director of product marketing, Matillion; and Michael Distler, senior Director of Product Marketing, Qlik, who discussed the top trends in modern data architecture for 2021.

Posted March 26, 2021

Overcoming travel challenges, Data Summit Connect 2021, presented by DBTA and Big Data Quarterly, is a virtual event that will run May 11-12 and include provocative sessions, exhibits, and opportunities to network. In addition, preconference workshops will be held on May 10.

Posted March 22, 2021

Cloudflare, Inc., the security, performance, and reliability company helping to build a better Internet, is introducing Magic WAN with Magic Firewall along with forming new strategic partnerships with major networking and data center providers as part of Cloudflare One, its cloud-based network-as-a-service solution.

Posted March 22, 2021

Precisely, a leader in data integrity, is being acquired by Capital Group, L.P. (together with its affiliates, "Clearlake") and TA Associates.

Posted March 22, 2021

Instaclustr, delivering reliability at scale through fully managed open source data technologies, is acquiring credativ, adding a rich collection of open source software and services to Instaclustr's portfolio.

Posted March 17, 2021

Our new industrial era poses a paradox for every manufacturer. The increased revenues driven by high consumer demand often conceal the pressure felt on margins due to rising material costs and constant labor shortages. Consequently, many manufacturers seek supply-chain innovations to optimize their asset utilization, reduce production waste, minimize re-work, and produce reliable lead times. However, none of these efficiencies are possible without a modern data infrastructure.

Posted March 16, 2021

Machine learning is becoming the go-to solution for greater automation and intelligence. A recent study fielded amongst the subscribers of DBTA found that 48% currently have machine learning initiatives underway with another 20% considering adoption. At the same time, most projects are still in the early phases. DBTA recently held a roundtable webinar with Gaurav Deshpande, VP of marketing, TigerGraph; Santiago Giraldo, director of product marketing data engineering and machine learning, Cloudera; and Paige Roberts, open source relations manager, Vertica, who discussed key technologies and strategies for maximizing machine learning's impact.

Posted March 12, 2021

WANdisco, the LiveData company, is forming a partnership with Snowflake, the Data Cloud company, to automate, accelerate, and simplify the migration of on-premises Hadoop analytics workloads to Snowflake's data platform.

Posted March 12, 2021

Navisite announced it is now a Google Cloud Partner, a designation that recognizes the company as an authorized managed services provider on Google Cloud. As a Google Cloud Partner, Navisite not only demonstrates the required knowledge and expertise to successfully migrate customers to Google Cloud but also the commitment and partnership with Google Cloud to help customers maximize business growth, innovation, and profitability.

Posted March 09, 2021

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Sponsors