Newsletters




Data Warehousing

Hardware and software that support the efficient consolidation of data from multiple sources in a Data Warehouse for Reporting and Analytics include ETL (Extract, Transform, Load), EAI (Enterprise Application Integration), CDC (Change Data Capture), Data Replication, Data Deduplication, Compression, Big Data technologies such as Hadoop and MapReduce, and Data Warehouse Appliances.



Data Warehousing Articles

Modern applications have increasingly leveraged Kubernetes as the "OS of the cloud" because of its ability to abstract the underlying cloud platform and coordinate the activities of multiple docker containers. Kubernetes does indeed radically simplify the deployment and administration of multi-service distributed applications. However, it has a significant learning curve, and maintaining a largescale Kubernetes cluster can be daunting.

Posted December 08, 2022

Readers of this column sometimes ask me questions about databases and database administration, which I welcome. And at times I will take the opportunity to answer particularly intriguing questions in print. One intriguing question I have been asked more than once is: "What metrics and measurements are useful for managing how effective your DBA group is?"

Posted December 08, 2022

Often data is categorized into very high-level groupings of structured or unstructured. Generally, structured data is considered data that conforms to an easily identifiable pattern and as part of this conforming, that data may be easily loaded into a relational database table "as is." Examples of this might be fixed-format files, or comma-separated files having an agreed upon pattern to each record within it. Unstructured data supposedly cannot be loaded "as is" into a relational table. Unstructured data is, by name, lacking an identifiable structure to make sense of the data, right? Not exactly.

Posted December 08, 2022

In the ever-shifting markets professionals work in today, companies must have the ability to remain agile and flexible throughout their business operations. To combat the uncertainties of the marketplace and customer demands for fast service, businesses have turned to open-source technology over the past decade. Open-source technologies provide IT and development teams with the agility to implement innovative tools and practices—such as DevOps and Continuous Integration and Continuous Delivery (CI/CD).

Posted December 08, 2022

Cybersecurity and threat detection continues to be top of mind moving into 2023. Data breaches and the capture of sensitive information remain concerns for organizations large and small. Here, data security leaders share their thoughts on what lies ahead as companies seek the best resources to secure data and thwart bad actors.

Posted December 07, 2022

Twelve Labs, the video search and understanding company, announced several product advances along with closing on a $12 million seed extension round, following its initial seed fundraise last spring. To help ensure that speed and quality are not compromised, Twelve Labs has chosen Oracle to provide the AI cloud infrastructure capacity required to bring its foundation AI model to market.

Posted December 07, 2022

Quest Software creates and manage software that makes the benefits of new technology real while empowering users and data, streamlining IT operations, and hardening cybersecurity from the inside out. Looking ahead to a new year several experts at Quest are offering their predictions for 2023.

Posted December 07, 2022

The call for speakers is open for the 10th annual Data Summit conference, to be held in Boston, May 10-11, 2023, with pre-conference workshops on May 9, 2023. The Data Summit conference focuses on the business and technical aspects of Data Management, Analytics, Artificial Intelligence, Machine Learning, Data Architecture, and Emerging Technologies.

Posted December 06, 2022

Airbyte, an open-source integration provider, is unveiling its expanded partnership with dbt Labs, an analytics engineering company, accompanied by a new integration to enable dbt Cloud customers to schedule and initiate dbt jobs from within Airbyte Cloud.

Posted December 06, 2022

ClickHouse, Inc, creator of an online analytical processing (OLAP) database management system, announced the general availability of their newest offering, ClickHouse Cloud. The platform promises a lightning-fast cloud-based database that simplifies and accelerates insights and analytics for modern digital enterprises.

Posted December 06, 2022

Striim, Inc., announced the availability of Striim Cloud on Amazon Web Services (AWS), giving users access to a fully-managed, unified software-as-a-service (SaaS) platform for real-time streaming data integration and analytics from both on-premise or hybrid cloud mission-critical applications.

Posted December 05, 2022

Equalum, provider of data integration and ingestion solutions, is announcing its strategic partnership with Yellowbrick Data, the hybrid multi-cloud data warehouse vendor. By streamlining the data migration process with Equalum's platform, data is moved from legacy environments to the Yellowbrick Data Warehouse—which is hosted in the user's commercial cloud environment of choice.

Posted December 02, 2022

A renewed interest in data-centric AI is driving increased model outcome accuracy and introducing the concept to new applications. Data-centric AI is gaining momentum as engineers working with AI shift their focus from models to data. Whereas engineers previously took a model-centered approach to improve the prediction outcomes and accuracy of a model, current dynamics are causing many to look to the quality of input data to improve outcomes.

Posted December 01, 2022

Global technology solutions company Unisys Corporation and multi-cloud data services company Faction announced they will jointly offer an end-to-end solution for fully-managed data protection, cyber recovery, and business continuity services in both on-premises and multi-cloud environments.

Posted December 01, 2022

Synatic, a provider of data integration and automation, announced it has secured an additional $2.5 million in a seed extension funding round, enabling the company to expand its market reach in the United States in preparation for Series A funding early in 2023.

Posted November 30, 2022

Databricks is releasing MLflow 2.0, building upon MLflow's strong platform foundation and incorporating extensive user feedback to simplify data science workflows and deliver innovative, first-class tools for MLOps. Features and improvements include extensions to MLflow Recipes (formerly MLflow Pipelines) such as AutoML, hyperparameter tuning, and classification support, as well modernized integrations with the ML ecosystem, a streamlined MLflow Tracking UI, a refresh of core APIs across MLflow's platform components, and much more.

Posted November 29, 2022

Cameron O'Rourke, senior director of product strategy at Incorta, and Eldad Chai, CEO of Satori, gathered for a DBTA webinar, "Top Trends in Data Engineering," to discuss key patterns and methods that illuminate what areas of data and analytics need some technological TLC.

Posted November 22, 2022

The provider of data integration and ingestion solutions, Equalum, is debuting its Equalum Competitive Replacement Program, re-envisioning the enterprise's CDC solution for customers of legacy solutions, including HVR, Striim, and StreamSets, both for on-premises and cloud environments.

Posted November 21, 2022

IBM is unveiling the IBM Cloud for VMware as a Service, a direct result of the ongoing, 20-year partnership between IBM and VMware that elaborates upon their workload modernization and time-to-value acceleration ventures.

Posted November 21, 2022

When designing a data center, a recognized set of principles is typically followed. Scalability, resiliency, reliability, and sustainability are all essential, but the most important common feature for data center products may be flexibility. The equipment cabinet should likewise be flexible, but this has not always been the case. There has been a rethinking of the way IT infrastructure, power, and cooling converge—and the data center cabinet is starting to adapt.

Posted November 21, 2022

Talend, a global provider of data integration and data management, it is partnering with Passerelle and Snowflake, the Data Cloud company, to provide new vertical solutions for delivering healthy data to organizations worldwide. Built on Talend Data Fabric and Snowflake's Data Cloud, Passerelle's Data Rocket provides a scalable architecture that delivers governed data ingestion, trusted stewardship, cloud-based storage, and on-demand visual analytics based on the foundation of healthy data. New Data Rocket vertical solutions will be tailored for key vertical markets, beginning with financial services.

Posted November 17, 2022

Fauna, the distributed document-relational database delivered as a cloud API, is announcing two major innovations that enhance its serverless database; Intelligent Routing and Virtual Private Fauna.

Posted November 17, 2022

TigerGraph, provider of an advanced analytics and ML platform for connected data, is unveiling updates to TigerGraph Cloud, the native parallel graph database-as-a-service: TigerGraph Insights and ML Workbench.

Posted November 17, 2022

Hammerspace is redefining unstructured data architectures with the latest performance capabilities to further empower organizations to leverage any server, storage system, and network that best benefits their decentralized workflows.

Posted November 17, 2022

The enterprise that built the serverless, streamlined data analytics platform based on open source database DuckDB, MotherDuck, is announcing a recent funding milestone of $47.5 million. The initial Series A funding, led by Andreessen Horowitz, garnered $35 million, further compounded with a $12.5 million seed round led by Redpoint—resulting in a total valuation of $175 million for the company.

Posted November 17, 2022

Qlik is launching Qlik Cloud Data Integration, its Enterprise Integration Platform as a Service (eiPaaS) offering to fuel enterprise data strategies through a real-time data integration fabric that connects all enterprise applications and data sources to the cloud.

Posted November 17, 2022

Alluxio, the developer of the open source data orchestration platform for data driven workloads such as large-scale analytics and AI/ML, is releasing version 2.9 of its Data Orchestration Platform, delivering support for a scale-out, multi-tenant architecture with a new cross-environment synchronization feature. Additional updates include enhanced manageability with significant improvement in the tooling and guidelines for deploying Alluxio on Kubernetes, and improved security and performance with a strengthened S3 API and POSIX API.

Posted November 16, 2022

Avantra, provider of an AIOps platform for SAP operations automation, is releasing Avantra 23, delivering updates that improve service quality for SAP operations with ready-to-use workflow automation templates, including SAP system refresh, automated SAP security analysis, and much more.

Posted November 16, 2022

Y42, the developers behind the Modern DataOps Cloud, is officially relaunching their product to accommodate critical needs for accessible, governed, and collaborative data operations.

Posted November 14, 2022

Apiiro, a provider of cloud-native application security, announced it has received $100 million in a significant Series B round, allowing the enterprise to accelerate the business and advance the company's mission to empower developers and application security engineers to proactively fix risks before releasing to the cloud. The funding round was led by General Catalyst with participation by Greylock and Kleiner Perkins.

Posted November 11, 2022

Last month we looked at various types of database recovery, how they work, and how DBAs need to prepare for recovery scenarios. This month, let's delve a little deeper into the issues and decisions that DBAs need to be prepared to address as they work on database recovery. The first thing that DBAs need to be aware of is the recovery time objectives, or RTOs, for the database objects in question. In an ideal world, RTOs would have been established for each object and the backup procedures would be in place to establish sufficient time for recovering to those objectives.

Posted November 10, 2022

Snyk, a provider of developer security, is offering its SnykLaunch Fall 2022, providing a number of significant innovations that extend the reach and power of the company's existing Developer Security Platform. This platform update allows more companies to maximize the benefits of DevSecOps and effective collaboration between their developer, operations, and security teams, according to the vendor.

Posted November 10, 2022

IBM is introducing new software designed to help enterprises break down data and analytics silos so they can make data-driven decisions fast and navigate unpredictable disturbances. IBM Business Analytics Enterprise is a suite of business intelligence planning, budgeting, reporting, forecasting, and dashboard capabilities that provides users with a robust view of data sources across their entire business.

Posted November 07, 2022

Cirrus Data Solutions Inc. (CDS) is announcing the availability of Cirrus Migrate Cloud on the Oracle Cloud Marketplace, following its integration with Oracle Cloud Infrastructure (OCI).

Posted November 02, 2022

Is it getting easier or more difficult to lock down data in today's digital enterprises? Industry leaders have mixed opinions on the state of that challenge. Cloud vendors promise industrial-grade security for backend applica­tions and data, while at the same time the move to cloud increases complexity.

Posted November 02, 2022

To reconcile enterprises with the growing demands of modern applications, new technologies and strategies are necessary to successfully support applications in their respective hybrid and multi-cloud environments.

Posted November 01, 2022

Pepperdata, provider of big data cloud and Kubernetes workloads performance products, is unveiling its Autonomous FinOps for Kubernetes (K8s) offering.

Posted November 01, 2022

Hazelcast Inc. is making additions to the platform's capabilities—notably in regards to joining multiple streams of live data, as well as merging them with large volumes of stored data.

Posted November 01, 2022

Red Hat, Inc., a leading provider of open source solutions, is introducing Red Hat Device Edge, a solution for flexibly deploying traditional or containerized workloads on small devices such as robots, IoT gateways, points of sale, public transport, and more. This latest product in the Red Hat edge portfolio aims to provide a future-proof platform that allows organizations' architecture to evolve as their workload strategy changes, according to the company.  

Posted October 31, 2022

Delve into the context and best practices of modern data lakehouse management and architecture, based on DBTA's webinar, "Building a Modern Lakehouse".

Posted October 27, 2022

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49

Sponsors