Newsletters




Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.



Hadoop Articles

Matillion, a leading cloud data integration platform, announced it has secured $150 million in Series E funding, empowering the company to continue to grow its cloud analytics, AI, and machine learning for large global enterprises. The funding round was led by General Atlantic, a global growth equity firm, with participation from Battery Ventures, Sapphire Ventures, Scale Venture Partners, and Lightspeed Venture Partners.

Posted September 23, 2021

Nutanix, a provider of hybrid multicloud computing, is providing new capabilities in the Nutanix Cloud Platform that make it easier for customers to simplify data management and optimize database and big data workload performance.

Posted September 23, 2021

The Call for Speakers is now open for Data Summit 2022 which will be held at the Hyatt Regency Boston May 17-18, 2022, with pre-conference workshops on May 16, 2022. The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.

Posted September 09, 2021

Today, organizations need data-driven insights to advance decision making at all levels and digital transformation is a key component of those efforts. Supporting data-driven insights and digital transformation takes an ever-growing range of services, products, and tools from forward-thinking companies that are working to help their customers deliver the right insights to the right people at the right time.

Posted September 08, 2021

Confluent, Inc., the platform to set data in motion, is launching the Confluent Q3 '21 Release, featuring developments that help organizations reliably share data between different environments, seamlessly integrate with business-critical applications, and cost-effectively store data needed for next-generation, digital customer experiences and data-driven backend operations.

Posted August 17, 2021

Surviving and thriving with data science and machine learning means not only having the right platforms, tools and skills, but identifying use cases and implementing processes that can deliver repeatable, scalable business value. The challenges are numerous, from selecting data sets and data platforms, to architecting and optimizing data pipelines, and model training and deployment. As a result, new solutions have emerged to deliver key capabilities in areas including visualization, self-service and real-time analytics. Along with the rise of DataOps, greater collaboration and automation have been identified as key success factors.

Posted August 05, 2021

One of the challenges of working with Hadoop environments has been maintaining the infrastruc­ture for big data projects. That's where cloud makes things easier and, increas­ingly, has served as the underlying infra­structure platform of choice for Hadoop initiatives. At the same time, not every­thing has moved to the cloud just yet for big data environments. Many IT managers expect to live in a hybrid environment. They are planning for multi-cloud data management to deliver business value and are also still relying on old-school approaches and manual tools to support their data environments.

Posted August 02, 2021

Airbyte, creators of a fast-growing open-source data integration platform, is releasing an open source data integration for data lakes, enabling AWS users to replicate data from anywhere to their Amazon Simple Storage Service (S3) account. Companies are now able to leverage Airbyte's 75-plus pre-built connectors, or build their own custom connectors within two hours using Airbyte's Connector Development Kit (CDK), in order to replicate their data to S3.

Posted July 09, 2021

We're still at the start of the 2020s, and already, things look very different from the preceding decade. For data executives and profession­als, the years ahead may mean change on a scale never seen before in the IT industry. Promising new technologies—as well as redesigned and repurposed older ones—are reshaping the data center and analytics shops in new and exciting ways. We asked industry leaders for their views on what is enhancing the ability of enterprises to compete on data.

Posted June 10, 2021

Founded by the creators of Apache Kylin, venture-backed Kyligence and is dual-headquartered in San Jose, California, and Shanghai, China. Luke Han, co-founder and CEO at Kyligence, recently explained the company's connection to the open source project, its future goals, where it fits into the global data management ecosystem, and how it plans to differentiate itself from competitors.

Posted June 08, 2021

The one-size-fits-all RDBMS has given way to an explosion of diverse data management technologies. In a  session titled "Next-Generation Databases" at Data Summit Connect 2021, Guy Harrison looked at the history of data management from the mainframe through Hadoop to blockchain and considered the utility of  new database technologies for leveraging data assets, and speculated on how these will evolve to meet tomorrow's data needs.

Posted June 02, 2021

The move to next-generation databases is driven by their ability to help companies achieve competitiveness and reach customers faster and more efficiently. These new breeds of systems can be a force for business transformation—whether it is generating new sources of revenue, enhancing customer experience, or producing data-driven insights that improve how organizations interact with customers.

Posted May 26, 2021

Rubrik, the Cloud Data Management Company, is introducing major data security features that enable organizations around the world to easily and accurately assess the impact of ransomware attacks and automate recovery operations. Rubrik's data security provides an important line of defense against these common threats and helps IT teams to answer the most pressing questions regarding their business data: What is the content of the data? What is happening to the data? Who is accessing important business information?

Posted May 18, 2021

Data lakes are one of the fastest growing trends in managing big data across various industries. However, the rise of event streaming has created a new technology category for stream processing using frameworks like Apache Flink and Kafka Streams. Nishith Agarwal, engineering manager, Uber, and Sivabalan Narayanan, senior software engineer, Uber discussed "Apache Hudi: The Streaming Data Lake Platform" during their Data Summit Connect 2021 presentation.

Posted May 12, 2021

As software infrastructure is stretched between on premise, public clouds, and hybrid clouds, keeping software in compliance is a significant challenge. Michael Corey, co-founder, COO, LicenseFortress and Don Sullivan, product line manager, business critical applications, VMware discussed current software license trends, the difference between Oracle policy and your contractual obligations, licensing Oracle on a virtualized environment, and licensing best practices, during their Data Summit Connect 2021 presentation, "Database Licensing: Best Practices and Pitfalls."

Posted May 12, 2021

Migrating a large-scale Hadoop cluster to the cloud is challenging, especially when the cluster is very active and downtime during the migration is not an option. Tony Velcich, senior director, product marketing, WANdisco, and Ken Seier, chief architect, data and AI, Insight discussed the challenges involved in such a migration during their Data Summit Connect 2021 presentation, "Considerations for Large Scale Hadoop Data Migration to the Cloud."

Posted May 11, 2021

It's time to cast your vote for the annual Database Trends and Applications Readers' Choice Awards, a competition in which the winning information management solutions, products, and services are selected by you, our readers.

Posted April 29, 2021

From machine learning and automation to hybrid and multi-cloud environments, technology trends continue to reshape the practice of database management. As a result, database professionals face new challenges and opportunities. Today, the average database team is tasked with managing more databases, bigger databases, and a greater variety of databases—from the ground to the cloud.

Posted April 27, 2021

Marvell Technology, Inc., a leader in data infrastructure semiconductor solutions, announced it has completed its acquisition of Inphi Corporation, creating a U.S. semiconductor powerhouse positioned for end-to-end technology leadership in data infrastructure.

Posted April 23, 2021

As the world of data analytics continues to evolve and reshape after a tumultuous 2020, the need for agility is rapidly driving a new era in data culture in which it is imperative to handle data immediately and at scale. While emphasis on self-service data and analytics has been top-of-mind for some time now, the shift to self-sufficiency is held back by culture, not technology. With the new year pushing more robotics process automation at all levels of the business—and for all data users—organizations are becoming more acutely aware that true enablement isn't just about tools and tech. It's about people.

Posted April 05, 2021

The world changed over the last year. Future historians will complete their theses focusing on different quarters or even specific months of 2020. But one of the most overused cliches in thinking about this period of time has been the idea that "the more things change, the more they remain the same." Let's consider sports in 2020. Major League Baseball had a 60-game season, the NBA finals were played in October, and cardboard cutouts took the place of fans in every sport. However, the Lakers won the NBA finals, the Dodgers won the World Series with the Yankees playing deep into the playoffs, and Tom Brady went to his 10th Super Bowl. The more things change …

Posted April 05, 2021

Since its emergence, many companies have made great strides with DevOps, advancing their development processes. However, for DevOps to continue transforming enterprises, it must evolve to address longstanding challenges, take advantage of new opportunities, and spread beyond its comfort zone.

Posted April 05, 2021

Data may be at the heart of all digital engagements, but most enterprises are still behind the curve when it comes to effectively identifying and managing it. That's the takeaway from the latest survey of 419 enterprise executives from BARC, which finds continuing challenges with identifying and surfacing the data assets needed to succeed in today's digital economy.

Posted April 02, 2021

Faster decision making enabled by access to role-appropriate information is the goal of organizations striving to become data-driven. At the same time, there is strong pressure on companies to ensure data quality and trustworthiness, as well as to maintain data security to avoid breaches and risk regulatory non-compliance.

Posted April 02, 2021

Spectra Logic, a leader in data storage and data management solutions, is releasing the publication of its annual "Data Storage Outlook" report, which explores how the world manages, accesses, uses, and preserves its ever-expanding data repositories.

Posted April 01, 2021

From hybrid and multicloud, to real-time analytics and AI, a strong data architecture strategy is critical to supporting an organization's goals. Greater speed, flexibility and scalability are common wish-list items, alongside smarter data governance and security capabilities. DBTA recently held a special roundtable webinar with Danny Sandwell, director of product marketing, erwin, Inc.; Paul Lacey, senior director of product marketing, Matillion; and Michael Distler, senior Director of Product Marketing, Qlik, who discussed the top trends in modern data architecture for 2021.

Posted March 26, 2021

Overcoming travel challenges, Data Summit Connect 2021, presented by DBTA and Big Data Quarterly, is a virtual event that will run May 11-12 and include provocative sessions, exhibits, and opportunities to network. In addition, preconference workshops will be held on May 10.

Posted March 22, 2021

Cloudflare, Inc., the security, performance, and reliability company helping to build a better Internet, is introducing Magic WAN with Magic Firewall along with forming new strategic partnerships with major networking and data center providers as part of Cloudflare One, its cloud-based network-as-a-service solution.

Posted March 22, 2021

Precisely, a leader in data integrity, is being acquired by Capital Group, L.P. (together with its affiliates, "Clearlake") and TA Associates.

Posted March 22, 2021

Instaclustr, delivering reliability at scale through fully managed open source data technologies, is acquiring credativ, adding a rich collection of open source software and services to Instaclustr's portfolio.

Posted March 17, 2021

Our new industrial era poses a paradox for every manufacturer. The increased revenues driven by high consumer demand often conceal the pressure felt on margins due to rising material costs and constant labor shortages. Consequently, many manufacturers seek supply-chain innovations to optimize their asset utilization, reduce production waste, minimize re-work, and produce reliable lead times. However, none of these efficiencies are possible without a modern data infrastructure.

Posted March 16, 2021

Machine learning is becoming the go-to solution for greater automation and intelligence. A recent study fielded amongst the subscribers of DBTA found that 48% currently have machine learning initiatives underway with another 20% considering adoption. At the same time, most projects are still in the early phases. DBTA recently held a roundtable webinar with Gaurav Deshpande, VP of marketing, TigerGraph; Santiago Giraldo, director of product marketing data engineering and machine learning, Cloudera; and Paige Roberts, open source relations manager, Vertica, who discussed key technologies and strategies for maximizing machine learning's impact.

Posted March 12, 2021

WANdisco, the LiveData company, is forming a partnership with Snowflake, the Data Cloud company, to automate, accelerate, and simplify the migration of on-premises Hadoop analytics workloads to Snowflake's data platform.

Posted March 12, 2021

Navisite announced it is now a Google Cloud Partner, a designation that recognizes the company as an authorized managed services provider on Google Cloud. As a Google Cloud Partner, Navisite not only demonstrates the required knowledge and expertise to successfully migrate customers to Google Cloud but also the commitment and partnership with Google Cloud to help customers maximize business growth, innovation, and profitability.

Posted March 09, 2021

Next Pathway Inc., the Automated Cloud Migration company, is offering enhanced capabilities within the SHIFT Migration Suite and Crawler360, allowing enterprises to automatically migrate from Apache Hadoop to their desired cloud solution.

Posted March 02, 2021

Qlik is offering its first-ever Academic Program Professor Ambassador Class, creating a select network of academics from the Qlik Academic Program that have demonstrated dedication and excellence in leveraging Qlik to drive data literacy with analytics in the classroom. The Professor Ambassador Program is an extension of the Qlik Academic Program, which helps universities improve the value of their offerings by teaching marketable data skills, while helping students advance their analytical and data literacy proficiency within every academic discipline with Qlik.

Posted February 11, 2021

It's time to submit nominations for the annual Database Trends and Applications Readers' Choice Awards Program. The 2021 nominating process has been extended to Wednesday, March 17, so be sure to nominate your favorite products now. Winners will be showcased in a special section on the DBTA website and in the August 2021 edition of Database Trends and Applications magazine.

Posted February 10, 2021

Starburst, the analytics anywhere company, is offering a public beta release of its first-ever fully managed platform, Starburst Galaxy. The cloud-native service empowers users with fast access to all data, and easier ways to manage it, wherever it resides, according to the vendor. Starburst Galaxy will remove the complexity of data movement and copies and accelerate data-driven decision making.

Posted February 09, 2021

CloudBolt Software, the enterprise cloud management leader, is releasing its "Winter" platform update that features enhancements to help enterprises accelerate their hybrid cloud and IT automation journeys.

Posted February 08, 2021

OpenDrives, Inc., a global provider of enterprise-grade, hyper-scalable network-attached-storage (NAS) solutions, announced it has raised up to $20 million in Series B funding, enabling the company to continue growing and accelerate product development.

Posted January 25, 2021

The ability to quickly act on information to solve problems or create value has long been the goal of many businesses. However, it was until recently when new technologies emerged that the speed and scalability requirements of real-time analytics could be addressed both technically and cost-effectively by organizations on a large scale. DBTA held a roundtable webinar with Jamison Shaver, senior director, product management, Swim; Rob Hedgepeth, director, developer evangelist, MariaDB; and Rick Negrin, VP, product management, SingleStore, who discussed the key capabilities for succeeding with real-time analytics today.

Posted January 14, 2021

Cockroach Labs, the company behind CockroachDB, is receiving $160 million in new funding, enabling the company to continue its investments in product development to better support its customers. This latest funding round was led by Altimeter Capital with participation from new investors Greenoaks and Lone Pine, and existing investors Benchmark, BOND, FirstMark, GV, Index Ventures, and Tiger Global. This round brings Cockroach Labs' total funding to $355 million at a valuation of $2 billion.

Posted January 12, 2021

To fit into modern analytics ecosystems, legacy data warehouses must evolve—both architecturally and technologically—to deliver the agility, scalability, and flexibility that business need to thrive in today's data-driven economy. DBTA held a webinar with Clive Bearman, director of content, product and marketing strategy, Qlik; David Leichner, CMO, SQream; and Felipe Hoffa, data cloud advocate, Snowflake who discussed the must-have capabilities for modern data warehousing.

Posted January 11, 2021

Over the past several years, open source technology adoption has steadily increased in the enterprise space. Because of the impact it can have on the business, choosing the right open source technology—specifically a database—is a critical decision that shouldn't be taken lightly.

Posted January 08, 2021

This may be the era of the data-driven enterprise, but only a handful of organizations report they are ready for it. There is a growing volume of "dark data" that remains obscure to IT managers and decision makers. This period unfolding before us will be driven by several technology initiatives, from 5G wireless and IoT to AI.

Posted December 10, 2020

Data management and integration demands continue to increase as organizations are faced with more data flowing in from a greater variety sources than ever before. At the same time, there is the need to extract business value, protect, and offer wider access to that data for more users. Today, being data-driven is the goal of all companies, whether long established or born digital.

Posted December 09, 2020

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

Sponsors