Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Data management has never been so unfettered—and yet so complicated at the same time. An emerging generation of tools and platforms is helping enterprises to get more value from their data than ever. These solutions now support and automate a large swath of structural activities, from data ingestion to storage, and also enhance business-focused operations such as advanced analytics, AI, machine learning, and continuous real-time intelligence.

Posted February 08, 2022

As an industry, we've been talking about the promise of data lakes for more than a decade. It's a fantastic concept—to put an end to data silos with a single repos­itory for big data analytics. Imagine having a singular place to house all your data for analytics to support product-led growth and business insight.

Posted February 08, 2022

Most database administrators know database servers didn't initially come in a cloud or cluster. Once upon a time, DBAs had to reconfigure disk files and handle data manually. Now, with virtualization and the shift toward the cloud, the evolution of database administration yields more opportunities to automate tasks and fewer reasons for DBAs to get their hands dirty.

Posted February 08, 2022

Each year, Data Summit features industry-leading experts covering the topics that matter most for data professionals who want to stay on top of the latest technologies and strategies. The conference program is now available for review, and a variety of pass options are being offered, including special pricing for attendees who register early.

Posted February 08, 2022

Directus, a software company democratizing the future of data management, is releasing Directus 9, an Open Data Platform that can power any data driven app or digital experience. With a new codebase built on Node.js and Vue.js 3, Directus 9 achieves  higher performance than previous versions for near-instant SQL query responses, whether browsing vast datasets in the Directus app or performing deeply nested relational API requests, according to the vendor.

Posted February 02, 2022

The noted motivational speaker and author Zig Ziglar was quoted as saying, "When obstacles arise, you change your direction to reach your goal; you do not change your decision to get there." This sentiment rings truer today than ever. In the past 21 months, companies across every industry have had to alter their "go-to-market" strategies to simply survive. For the local diner, this may have initially meant relying on take-out orders, then finding ways to create outdoor seating areas, and later dealing with the weather inconveniently imposing itself on outdoor patrons. For much of corporate America, changing their company's direction to reach their goals has meant converting traditional workforces to remote employees.

Posted January 18, 2022

The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.

Posted December 15, 2021

Organizations that use MySQL or MariaDB databases for business-critical functions may struggle with direct attached storage (DAS) limitations for these deployments. Typically these platforms can accelerate access to data, increase business agility, and deliver business breakthroughs.

Posted December 15, 2021

The importance of leveraging data quickly and effectively is a message that has come through loud and clear in recent years—and with increasing intensity since the onset of the COVID-19 pandemic. Whether it is anticipating supply chain problems, addressing customer concerns with agility, or identifying new opportunities and pouncing quickly, the ability to achieve a comprehensive view of all available information for real-time decision making has become a strong theme. To help make the process of identifying useful products and services easier, here, DBTA presents a list of Trend-Setting Products for 2022.

Posted December 08, 2021

Databricks, the Data and AI company and a provider of data lakehouse architecture, is acquiring the German startup, 8080 Labs, enabling the company to integrate UI-driven capabilities across Databricks' Lakehouse Platform—marking the company's expansion into the low-code/no-code space.

Posted November 12, 2021

NS1, a provider in application traffic intelligence and automation, is releasing its cloud-managed solution for DNS, DHCP, and IP address management (DDI), delivered through the NS1 Connect platform. NS1 Cloud-Managed DDI enables organizations to deliver core network services across their distributed network footprint with the agility of software-based deployment, the scale of cloud-native operations, and the operating efficiency of SaaS management, according to the company.

Posted November 03, 2021

Immuta's Sumit Sarkar discussed the challenges organizations face with implementing and maintaining cloud data ecosystems—particularly as more cloud data platforms emerge—during his presentation at Data Summit Connect 2021.       

Posted November 03, 2021

Provectus, a Silicon Valley artificial intelligence (AI) consultancy, is debuting enhancements to its Open-Source Data Discovery (ODD) and Observability Platform v0.2, upgrading the platform's data discovery and observability capabilities while adding new features for data quality assurance and support of new, third-party service adapters, including Amazon Athena, Amazon SageMaker Feature Store, Feast, and Great Expectations (GE).

Posted October 29, 2021

GridGain Systems, provider of enterprise-grade in-memory computing solutions powered by the Apache  Ignite distributed database, is now offering GridGain Nebula, the company's pay-as-you-go, cloud-native in-memory computing service, to all GridGain and Apache Ignite users. Previously, Nebula was available only to the world's largest enterprises.

Posted October 29, 2021

Hackolade founder and CEO Pascal Desmarets explained how polyglot persistence enables organizations to leverage the strength of multiple data stores, deal with scale more efficiently, and more during his presentation at Data Summit Connect 2021.       

Posted October 28, 2021

Nexla, the unified data operations company, today announced it has secured $12 million in a Series A funding round, enabling the company to continue to deliver on growing enterprise demand for ready-to-use data. Founded in 2016, Nexla streamlines the process of getting ready-to-use data to more applications and more users without having to use tens of different tools.

Posted October 18, 2021

New Relic, the observability company, is releasing New Relic Instant Observability (I/O), an open source ecosystem of quickstarts to empower all software engineers to instrument, dashboard, and alert their entire technology stack. New Relic I/O is introducing an ever-expanding, open ecosystem of knowledge-focused resources that codifies the collective experience of the world's observability experts and practitioners to help engineers around the world unlock the power of their data faster, according to the vendor.

Posted October 18, 2021

Domino Data Lab, provider of an Enterprise MLOps platform, announced it has received $100 million in a latest funding round along with expanding its partnership with NVIDIA to further integrate products and expand joint sales efforts to support customers' efforts to build model-driven businesses.

Posted October 12, 2021

Cube Dev, the open-source company behind Cube.js, is releasing Cube Cloud, a hosted version of the company's open source Cube.js analytics API. With Cube Cloud, companies build data applications like metrics dashboards and analytics features that consume data from their cloud data warehouse—without building or hosting any of the complex technologies required to make this possible, according to the vendor.

Posted October 07, 2021

The proliferation of data sources, types, and stores is increasing the challenge of combining data into meaningful, actionable information. As a result, the need for faster and smarter data integration capabilities is growing. In a recent survey, nearly half of DBTA subscribers indicated that real-time insights are critical to their data strategies. At the same time, to deliver actual value, people need information they can trust, so balancing governance is essential nowadays, especially with ongoing regulatory requirements.

Posted October 05, 2021

Zadara, a provider of edge cloud services, is partnering with Zenlayer, enabling the companies to offer managed storage solutions that businesses can deploy from on-premises data centers, private colocation facilities, or the cloud. The addition of Zadara's zStorage enables Zenlayer to provide backup and disaster recovery solutions on a global scale—even at the edge, closer to where data is generated and consumed.

Posted October 01, 2021

Matillion, a leading cloud data integration platform, announced it has secured $150 million in Series E funding, empowering the company to continue to grow its cloud analytics, AI, and machine learning for large global enterprises. The funding round was led by General Atlantic, a global growth equity firm, with participation from Battery Ventures, Sapphire Ventures, Scale Venture Partners, and Lightspeed Venture Partners.

Posted September 23, 2021

Nutanix, a provider of hybrid multi-cloud computing, is adding new capabilities in the Nutanix Cloud Platform that make it easier for customers to simplify data management and optimize database and big data workload performance.

Posted September 23, 2021

The Call for Speakers is now open for Data Summit 2022 which will be held at the Hyatt Regency Boston May 17-18, 2022, with pre-conference workshops on May 16, 2022. The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.

Posted September 09, 2021

Today, organizations need data-driven insights to advance decision making at all levels and digital transformation is a key component of those efforts. Supporting data-driven insights and digital transformation takes an ever-growing range of services, products, and tools from forward-thinking companies that are working to help their customers deliver the right insights to the right people at the right time.

Posted September 08, 2021

Confluent, Inc., the platform to set data in motion, is launching the Confluent Q3 '21 Release, featuring developments that help organizations reliably share data between different environments, seamlessly integrate with business-critical applications, and cost-effectively store data needed for next-generation, digital customer experiences and data-driven backend operations.

Posted August 17, 2021

Surviving and thriving with data science and machine learning means not only having the right platforms, tools and skills, but identifying use cases and implementing processes that can deliver repeatable, scalable business value. The challenges are numerous, from selecting data sets and data platforms, to architecting and optimizing data pipelines, and model training and deployment. As a result, new solutions have emerged to deliver key capabilities in areas including visualization, self-service and real-time analytics. Along with the rise of DataOps, greater collaboration and automation have been identified as key success factors.

Posted August 05, 2021

One of the challenges of working with Hadoop environments has been maintaining the infrastruc­ture for big data projects. That's where cloud makes things easier and, increas­ingly, has served as the underlying infra­structure platform of choice for Hadoop initiatives. At the same time, not every­thing has moved to the cloud just yet for big data environments. Many IT managers expect to live in a hybrid environment. They are planning for multi-cloud data management to deliver business value and are also still relying on old-school approaches and manual tools to support their data environments.

Posted August 02, 2021

Airbyte, creators of a fast-growing open-source data integration platform, is releasing an open source data integration for data lakes, enabling AWS users to replicate data from anywhere to their Amazon Simple Storage Service (S3) account. Companies are now able to leverage Airbyte's 75-plus pre-built connectors, or build their own custom connectors within two hours using Airbyte's Connector Development Kit (CDK), in order to replicate their data to S3.

Posted July 09, 2021

We're still at the start of the 2020s, and already, things look very different from the preceding decade. For data executives and profession­als, the years ahead may mean change on a scale never seen before in the IT industry. Promising new technologies—as well as redesigned and repurposed older ones—are reshaping the data center and analytics shops in new and exciting ways. We asked industry leaders for their views on what is enhancing the ability of enterprises to compete on data.

Posted June 10, 2021

Founded by the creators of Apache Kylin, venture-backed Kyligence and is dual-headquartered in San Jose, California, and Shanghai, China. Luke Han, co-founder and CEO at Kyligence, recently explained the company's connection to the open source project, its future goals, where it fits into the global data management ecosystem, and how it plans to differentiate itself from competitors.

Posted June 08, 2021

The one-size-fits-all RDBMS has given way to an explosion of diverse data management technologies. In a  session titled "Next-Generation Databases" at Data Summit Connect 2021, Guy Harrison looked at the history of data management from the mainframe through Hadoop to blockchain and considered the utility of  new database technologies for leveraging data assets, and speculated on how these will evolve to meet tomorrow's data needs.

Posted June 02, 2021

The move to next-generation databases is driven by their ability to help companies achieve competitiveness and reach customers faster and more efficiently. These new breeds of systems can be a force for business transformation—whether it is generating new sources of revenue, enhancing customer experience, or producing data-driven insights that improve how organizations interact with customers.

Posted May 26, 2021

Rubrik, the Cloud Data Management Company, is introducing major data security features that enable organizations around the world to easily and accurately assess the impact of ransomware attacks and automate recovery operations. Rubrik's data security provides an important line of defense against these common threats and helps IT teams to answer the most pressing questions regarding their business data: What is the content of the data? What is happening to the data? Who is accessing important business information?

Posted May 18, 2021

Data lakes are one of the fastest growing trends in managing big data across various industries. However, the rise of event streaming has created a new technology category for stream processing using frameworks like Apache Flink and Kafka Streams. Nishith Agarwal, engineering manager, Uber, and Sivabalan Narayanan, senior software engineer, Uber discussed "Apache Hudi: The Streaming Data Lake Platform" during their Data Summit Connect 2021 presentation.

Posted May 12, 2021

As software infrastructure is stretched between on premise, public clouds, and hybrid clouds, keeping software in compliance is a significant challenge. Michael Corey, co-founder, COO, LicenseFortress and Don Sullivan, product line manager, business critical applications, VMware discussed current software license trends, the difference between Oracle policy and your contractual obligations, licensing Oracle on a virtualized environment, and licensing best practices, during their Data Summit Connect 2021 presentation, "Database Licensing: Best Practices and Pitfalls."

Posted May 12, 2021

Migrating a large-scale Hadoop cluster to the cloud is challenging, especially when the cluster is very active and downtime during the migration is not an option. Tony Velcich, senior director, product marketing, WANdisco, and Ken Seier, chief architect, data and AI, Insight discussed the challenges involved in such a migration during their Data Summit Connect 2021 presentation, "Considerations for Large Scale Hadoop Data Migration to the Cloud."

Posted May 11, 2021

It's time to cast your vote for the annual Database Trends and Applications Readers' Choice Awards, a competition in which the winning information management solutions, products, and services are selected by you, our readers.

Posted April 29, 2021

From machine learning and automation to hybrid and multi-cloud environments, technology trends continue to reshape the practice of database management. As a result, database professionals face new challenges and opportunities. Today, the average database team is tasked with managing more databases, bigger databases, and a greater variety of databases—from the ground to the cloud.

Posted April 27, 2021

Marvell Technology, Inc., a leader in data infrastructure semiconductor solutions, announced it has completed its acquisition of Inphi Corporation, creating a U.S. semiconductor powerhouse positioned for end-to-end technology leadership in data infrastructure.

Posted April 23, 2021

As the world of data analytics continues to evolve and reshape after a tumultuous 2020, the need for agility is rapidly driving a new era in data culture in which it is imperative to handle data immediately and at scale. While emphasis on self-service data and analytics has been top-of-mind for some time now, the shift to self-sufficiency is held back by culture, not technology. With the new year pushing more robotics process automation at all levels of the business—and for all data users—organizations are becoming more acutely aware that true enablement isn't just about tools and tech. It's about people.

Posted April 05, 2021

The world changed over the last year. Future historians will complete their theses focusing on different quarters or even specific months of 2020. But one of the most overused cliches in thinking about this period of time has been the idea that "the more things change, the more they remain the same." Let's consider sports in 2020. Major League Baseball had a 60-game season, the NBA finals were played in October, and cardboard cutouts took the place of fans in every sport. However, the Lakers won the NBA finals, the Dodgers won the World Series with the Yankees playing deep into the playoffs, and Tom Brady went to his 10th Super Bowl. The more things change …

Posted April 05, 2021

Since its emergence, many companies have made great strides with DevOps, advancing their development processes. However, for DevOps to continue transforming enterprises, it must evolve to address longstanding challenges, take advantage of new opportunities, and spread beyond its comfort zone.

Posted April 05, 2021

Data may be at the heart of all digital engagements, but most enterprises are still behind the curve when it comes to effectively identifying and managing it. That's the takeaway from the latest survey of 419 enterprise executives from BARC, which finds continuing challenges with identifying and surfacing the data assets needed to succeed in today's digital economy.

Posted April 02, 2021

Faster decision making enabled by access to role-appropriate information is the goal of organizations striving to become data-driven. At the same time, there is strong pressure on companies to ensure data quality and trustworthiness, as well as to maintain data security to avoid breaches and risk regulatory non-compliance.

Posted April 02, 2021

Spectra Logic, a leader in data storage and data management solutions, is releasing the publication of its annual "Data Storage Outlook" report, which explores how the world manages, accesses, uses, and preserves its ever-expanding data repositories.

Posted April 01, 2021

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Sponsors