Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

The 9th annual Data Summit conference will be held May 17-18, 2022, at the Hyatt Regency Boston. Pre-conference workshops will take place on May 16, 2022. The program is available for review and a variety of pass options are available to suit individual requirements.

Posted May 04, 2022

Each year, Data Summit features industry-leading experts covering the topics that matter most for data professionals who want to stay on top of the latest technologies and strategies. The conference program is now available for review, and a variety of pass options are being offered, including special pricing for attendees who register early.

Posted April 28, 2022

Infoworks.io announced that Infoworks Replicator 4.0 is enabling the migration of on-prem Hadoop data lakes to the cloud faster with one-third the resources required of traditional approaches.

Posted April 13, 2022

It's time to vote for the annual Database Trends and Applications Readers' Choice Awards, a competition in which the winning information management solutions, products, and services are selected by you, the readers. The voting period will be open through Wednesday, May 11. Winners will be showcased in a special section on the DBTA website and in the August 2022 issue of Database Trends and Applications magazine.

Posted March 28, 2022

Major technology trends are reshaping the DBA role at many organizations. The size and complexity of database environments continues to grow with higher data volumes, more workloads, and an increasing rate of database deployments that need to be managed. To help IT decision makers and database professionals tackle these changes, challenges, and opportunities, DBTA recently held a special roundtable webinar with Kathryn Sizemore, senior MySQL/MariaDB database administrator, Datavail and Devin Gallagher, senior sales engineer, IDERA.

Posted March 11, 2022

At a time when business agility is as important as ever, new technology trends in DevOps, cloud, automation, and data management are increasingly merging as organizations look to streamline complex processes and increase the time-to-value of new initiatives. This agile journey is about removing the barriers between the business and its customers, as well as its data and employees, to accelerate the delivery of business value.

Posted March 09, 2022

The costs of downtime—even for a minute—are simply too steep for today's digitally evolving enterprises to tolerate. As part of their efforts to keep expensive downtime at bay—and ensure the continued viability and availability of data—data managers are increasingly turning to strategies such as automation and cloud services. Still, they continue to have difficulties and acknowledge that keeping their data environments up-to-date is holding them back from delivering more capabilities to their organizations.

Posted March 07, 2022

As an industry, we've been talking about the promise of data lakes for more than a decade. It's a fantastic concept—to put an end to data silos with a single repos­itory for big data analytics. Imagine having a singular place to house all your data for analytics to support product-led growth and business insight.

Posted February 24, 2022

Data management has never been so unfettered—and yet so complicated at the same time. An emerging generation of tools and platforms is helping enterprises to get more value from their data than ever. These solutions now support and automate a large swath of structural activities, from data ingestion to storage, and also enhance business-focused operations such as advanced analytics, AI, machine learning, and continuous real-time intelligence.

Posted February 22, 2022

It's time to submit nominations for the annual Database Trends and Applications Readers' Choice Awards, a competition in which the winning information management solutions, products, and services are selected by you, the users.

Posted February 09, 2022

Most database administrators know database servers didn't initially come in a cloud or cluster. Once upon a time, DBAs had to reconfigure disk files and handle data manually. Now, with virtualization and the shift toward the cloud, the evolution of database administration yields more opportunities to automate tasks and fewer reasons for DBAs to get their hands dirty.

Posted February 08, 2022

Directus, a software company democratizing the future of data management, is releasing Directus 9, an Open Data Platform that can power any data driven app or digital experience. With a new codebase built on Node.js and Vue.js 3, Directus 9 achieves  higher performance than previous versions for near-instant SQL query responses, whether browsing vast datasets in the Directus app or performing deeply nested relational API requests, according to the vendor.

Posted February 02, 2022

The noted motivational speaker and author Zig Ziglar was quoted as saying, "When obstacles arise, you change your direction to reach your goal; you do not change your decision to get there." This sentiment rings truer today than ever. In the past 21 months, companies across every industry have had to alter their "go-to-market" strategies to simply survive. For the local diner, this may have initially meant relying on take-out orders, then finding ways to create outdoor seating areas, and later dealing with the weather inconveniently imposing itself on outdoor patrons. For much of corporate America, changing their company's direction to reach their goals has meant converting traditional workforces to remote employees.

Posted January 18, 2022

The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.

Posted December 15, 2021

Organizations that use MySQL or MariaDB databases for business-critical functions may struggle with direct attached storage (DAS) limitations for these deployments. Typically these platforms can accelerate access to data, increase business agility, and deliver business breakthroughs.

Posted December 15, 2021

The importance of leveraging data quickly and effectively is a message that has come through loud and clear in recent years—and with increasing intensity since the onset of the COVID-19 pandemic. Whether it is anticipating supply chain problems, addressing customer concerns with agility, or identifying new opportunities and pouncing quickly, the ability to achieve a comprehensive view of all available information for real-time decision making has become a strong theme. To help make the process of identifying useful products and services easier, here, DBTA presents a list of Trend-Setting Products for 2022.

Posted December 08, 2021

Databricks, the Data and AI company and a provider of data lakehouse architecture, is acquiring the German startup, 8080 Labs, enabling the company to integrate UI-driven capabilities across Databricks' Lakehouse Platform—marking the company's expansion into the low-code/no-code space.

Posted November 12, 2021

NS1, a provider in application traffic intelligence and automation, is releasing its cloud-managed solution for DNS, DHCP, and IP address management (DDI), delivered through the NS1 Connect platform. NS1 Cloud-Managed DDI enables organizations to deliver core network services across their distributed network footprint with the agility of software-based deployment, the scale of cloud-native operations, and the operating efficiency of SaaS management, according to the company.

Posted November 03, 2021

Immuta's Sumit Sarkar discussed the challenges organizations face with implementing and maintaining cloud data ecosystems—particularly as more cloud data platforms emerge—during his presentation at Data Summit Connect 2021.       

Posted November 03, 2021

Provectus, a Silicon Valley artificial intelligence (AI) consultancy, is debuting enhancements to its Open-Source Data Discovery (ODD) and Observability Platform v0.2, upgrading the platform's data discovery and observability capabilities while adding new features for data quality assurance and support of new, third-party service adapters, including Amazon Athena, Amazon SageMaker Feature Store, Feast, and Great Expectations (GE).

Posted October 29, 2021

GridGain Systems, provider of enterprise-grade in-memory computing solutions powered by the Apache  Ignite distributed database, is now offering GridGain Nebula, the company's pay-as-you-go, cloud-native in-memory computing service, to all GridGain and Apache Ignite users. Previously, Nebula was available only to the world's largest enterprises.

Posted October 29, 2021

Hackolade founder and CEO Pascal Desmarets explained how polyglot persistence enables organizations to leverage the strength of multiple data stores, deal with scale more efficiently, and more during his presentation at Data Summit Connect 2021.       

Posted October 28, 2021

Nexla, the unified data operations company, today announced it has secured $12 million in a Series A funding round, enabling the company to continue to deliver on growing enterprise demand for ready-to-use data. Founded in 2016, Nexla streamlines the process of getting ready-to-use data to more applications and more users without having to use tens of different tools.

Posted October 18, 2021

New Relic, the observability company, is releasing New Relic Instant Observability (I/O), an open source ecosystem of quickstarts to empower all software engineers to instrument, dashboard, and alert their entire technology stack. New Relic I/O is introducing an ever-expanding, open ecosystem of knowledge-focused resources that codifies the collective experience of the world's observability experts and practitioners to help engineers around the world unlock the power of their data faster, according to the vendor.

Posted October 18, 2021

Domino Data Lab, provider of an Enterprise MLOps platform, announced it has received $100 million in a latest funding round along with expanding its partnership with NVIDIA to further integrate products and expand joint sales efforts to support customers' efforts to build model-driven businesses.

Posted October 12, 2021

Cube Dev, the open-source company behind Cube.js, is releasing Cube Cloud, a hosted version of the company's open source Cube.js analytics API. With Cube Cloud, companies build data applications like metrics dashboards and analytics features that consume data from their cloud data warehouse—without building or hosting any of the complex technologies required to make this possible, according to the vendor.

Posted October 07, 2021

The proliferation of data sources, types, and stores is increasing the challenge of combining data into meaningful, actionable information. As a result, the need for faster and smarter data integration capabilities is growing. In a recent survey, nearly half of DBTA subscribers indicated that real-time insights are critical to their data strategies. At the same time, to deliver actual value, people need information they can trust, so balancing governance is essential nowadays, especially with ongoing regulatory requirements.

Posted October 05, 2021

Zadara, a provider of edge cloud services, is partnering with Zenlayer, enabling the companies to offer managed storage solutions that businesses can deploy from on-premises data centers, private colocation facilities, or the cloud. The addition of Zadara's zStorage enables Zenlayer to provide backup and disaster recovery solutions on a global scale—even at the edge, closer to where data is generated and consumed.

Posted October 01, 2021

Matillion, a leading cloud data integration platform, announced it has secured $150 million in Series E funding, empowering the company to continue to grow its cloud analytics, AI, and machine learning for large global enterprises. The funding round was led by General Atlantic, a global growth equity firm, with participation from Battery Ventures, Sapphire Ventures, Scale Venture Partners, and Lightspeed Venture Partners.

Posted September 23, 2021

Nutanix, a provider of hybrid multi-cloud computing, is adding new capabilities in the Nutanix Cloud Platform that make it easier for customers to simplify data management and optimize database and big data workload performance.

Posted September 23, 2021

The Call for Speakers is now open for Data Summit 2022 which will be held at the Hyatt Regency Boston May 17-18, 2022, with pre-conference workshops on May 16, 2022. The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.

Posted September 09, 2021

Today, organizations need data-driven insights to advance decision making at all levels and digital transformation is a key component of those efforts. Supporting data-driven insights and digital transformation takes an ever-growing range of services, products, and tools from forward-thinking companies that are working to help their customers deliver the right insights to the right people at the right time.

Posted September 08, 2021

Confluent, Inc., the platform to set data in motion, is launching the Confluent Q3 '21 Release, featuring developments that help organizations reliably share data between different environments, seamlessly integrate with business-critical applications, and cost-effectively store data needed for next-generation, digital customer experiences and data-driven backend operations.

Posted August 17, 2021

Surviving and thriving with data science and machine learning means not only having the right platforms, tools and skills, but identifying use cases and implementing processes that can deliver repeatable, scalable business value. The challenges are numerous, from selecting data sets and data platforms, to architecting and optimizing data pipelines, and model training and deployment. As a result, new solutions have emerged to deliver key capabilities in areas including visualization, self-service and real-time analytics. Along with the rise of DataOps, greater collaboration and automation have been identified as key success factors.

Posted August 05, 2021

One of the challenges of working with Hadoop environments has been maintaining the infrastruc­ture for big data projects. That's where cloud makes things easier and, increas­ingly, has served as the underlying infra­structure platform of choice for Hadoop initiatives. At the same time, not every­thing has moved to the cloud just yet for big data environments. Many IT managers expect to live in a hybrid environment. They are planning for multi-cloud data management to deliver business value and are also still relying on old-school approaches and manual tools to support their data environments.

Posted August 02, 2021

Airbyte, creators of a fast-growing open-source data integration platform, is releasing an open source data integration for data lakes, enabling AWS users to replicate data from anywhere to their Amazon Simple Storage Service (S3) account. Companies are now able to leverage Airbyte's 75-plus pre-built connectors, or build their own custom connectors within two hours using Airbyte's Connector Development Kit (CDK), in order to replicate their data to S3.

Posted July 09, 2021

We're still at the start of the 2020s, and already, things look very different from the preceding decade. For data executives and profession­als, the years ahead may mean change on a scale never seen before in the IT industry. Promising new technologies—as well as redesigned and repurposed older ones—are reshaping the data center and analytics shops in new and exciting ways. We asked industry leaders for their views on what is enhancing the ability of enterprises to compete on data.

Posted June 10, 2021

Founded by the creators of Apache Kylin, venture-backed Kyligence and is dual-headquartered in San Jose, California, and Shanghai, China. Luke Han, co-founder and CEO at Kyligence, recently explained the company's connection to the open source project, its future goals, where it fits into the global data management ecosystem, and how it plans to differentiate itself from competitors.

Posted June 08, 2021

The one-size-fits-all RDBMS has given way to an explosion of diverse data management technologies. In a  session titled "Next-Generation Databases" at Data Summit Connect 2021, Guy Harrison looked at the history of data management from the mainframe through Hadoop to blockchain and considered the utility of  new database technologies for leveraging data assets, and speculated on how these will evolve to meet tomorrow's data needs.

Posted June 02, 2021

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17

Sponsors