Newsletters




Data Warehousing

Hardware and software that support the efficient consolidation of data from multiple sources in a Data Warehouse for Reporting and Analytics include ETL (Extract, Transform, Load), EAI (Enterprise Application Integration), CDC (Change Data Capture), Data Replication, Data Deduplication, Compression, Big Data technologies such as Hadoop and MapReduce, and Data Warehouse Appliances.



Data Warehousing Articles

Business and IT have, in the past, used times of crisis to adapt and transform themselves for the better. While the COVID-19 pandemic has undeniably had a devastating effect on business, and life itself, it may also be a catalyst for change, compelling organizations to rethink their long-term operations and spending amidst the short-term crisis brought on by the health emergency. Industry leaders recently discussed what "the new normal" may look like.

Posted May 27, 2020

Even the most ambitious data analytics initiatives tend to get buried by the 80/20 rule—with data analysts or scientists only able to devote 20% of their time to actual business analysis, while the rest is spent simply finding, cleansing, and organizing data. This is unsustainable, as the pressure to deliver insights in a rapid manner is increasing.

Posted May 21, 2020

DevOps, DataOps, AI, and containers all lead to one important innovation for enterprises seeking to be more data-driven—and that is greater automation. Data-driven enterprises cannot function if data resources and applications are in any way being manually administered, deployed, remediated, or upgraded.

Posted May 18, 2020

One creates the potential for some interesting anomalies when building a star schema wherein the fact table contains future-dated metrics and any of the dimensions are Type 2.  A Type 2 dimension tracks changes to the data items contained within it. Effectively, each dimension contains a surrogate key, a natural key with a start and stop date, and additional descriptor columns. If any of the descriptor column values change, the existing dimension row has the stop date populated while a new row is inserted with the same natural key, new start date, and new descriptor values. 

Posted May 13, 2020

What more companies need today is a "data lab" to create ideas from data and a "data factory" to turn those ideas into products. Google, Amazon, and other data-driven giants already work like this. So should companies outside of technology. 

Posted May 13, 2020

MemSQL, the No-Limits Database for operational analytics and cloud-native applications, is receiving $50 million in new capital after signing a debt facility agreement with Hercules Capital, the largest non-bank venture debt provider with more than $2.4 billion in total assets, who served as underwriter for the financing.

Posted May 11, 2020

Cloudera, the enterprise data cloud company, is releasing an expanded set of production machine learning capabilities for MLOps, now available in Cloudera Machine Learning (CML). Organizations can manage and secure the ML lifecycle for production machine learning with CML's new MLOps features and Cloudera SDX for models.

Posted May 06, 2020

Oracle Cloud Infrastructure-Government Cloud has achieved FedRAMP High Authorization. "Government customers rely on Oracle Cloud to run their most critical workloads. With FedRAMP High and Impact Level 5 authorizations, we are able to support the highest levels of security standards for unclassified workloads across local, state, and federal government, as well as the Department of Defense," said Scott Twaddle, vice president, Regulated Markets, Oracle Cloud Infrastructure.

Posted May 06, 2020

Yellowbrick Data, providing a data warehouse for hybrid cloud, and Next Pathway Inc., the Automated Cloud Migration company, are collaborating to accelerate the migration of complex on-prem workloads to the Yellowbrick Data hybrid cloud data warehouse.

Posted May 05, 2020

Dremio has announced the free availability of the Dremio AWS Edition, a self-service data lake engine highly optimized for Amazon Web Services (AWS) and available in AWS Marketplace. "Dremio brings interactive BI and data science to Amazon S3, and with our new AWS Edition we are dramatically lowering the cost per query and making data lake insights accessible to data consumers in organizations of any size," said Tomer Shiran, chief product officer, Dremio.

Posted May 05, 2020

VAST Data, a storage company, is releasing Version 3 of its Universal Storage architecture, introducing more than 20 new features - including support for Windows and MacOS applications, cloud data replication, and native encryption. These latest Universal Storage updates allow enterprises to now marry all-flash performance with archive economics and scale to enable mission-critical and data-intensive enterprise production environments to consolidate their workflows and bring the power of flash storage and fast access to all of their data.

Posted May 01, 2020

Kong Inc., a cloud connectivity company, is releasing a new open source tool called Insomnia Designer, offering a collaborative API design editor. Building on Insomnia Core, which Kong acquired in 2019, the software works natively with Insomnia's testing capabilities to accelerate the development, performance and stability of REST and GraphQL services, the communications backbone of the modern applications and services people rely on each day.

Posted April 30, 2020

Sigma Computing, a provider of cloud-native analytics and business intelligence (A&BI), is extending the power of Sigma to be used throughout the cloud data analytics stack.

Posted April 28, 2020

Swarm64, a provider of database acceleration solutions for the PostgreSQL open source database, is releasing Swarm64 DA 4.0, database acceleration software that extends PostgreSQL with the ability to analyze data orders of magnitude faster than usual.

Posted April 23, 2020

To get a full appreciation for the incredible pace of change in business technology, look at the past 6 years. In 2014, IDC published a report that said that, by 2020, the digital universe would contain nearly as many digital bits as there are stars in the universe, and the data we create and copy annually would reach 44 zettabytes, or 44 trillion gigabytes. Guess what? It's 2020. And it turns out IDC was correct in assuming that we were about to endure a data deluge.

Posted April 22, 2020

Circonus, provider of a machine data intelligence platform, has announced its Spring 2020 release. The release includes a Kubernetes monitoring solution that provides health-based alerting and horizontal pod auto-scaling, cloud monitoring, GCP Marketplace availability, performance improvements, and a more comprehensive Terraform integration. 

Posted April 21, 2020

VAST Data, a storage company, has raised $100 million in Series C funding which will be used to drive global expansion and accelerate the company's next phase of growth.

Posted April 16, 2020

Pepperdata, a provider of Analytics Stack Performance (ASP) solutions, is releasing Streaming Spotlight, a new product in Pepperdata's data analytics performance suite enabling Kafka integration. The suite is purpose-built for IT operations teams, giving them a single, comprehensive view of their analytics stack, both in the cloud and on premises.

Posted April 14, 2020

Even before the IT elements of data optimization begin, aligning organizational culture around a data-driven mindset will be a major challenge. Making the case for data optimization is important. Even before the IT elements of data optimization begin, aligning organizational culture around a data-driven mindset will be a major challenge. Making the case for data optimization is important.

Posted April 08, 2020

With $3.6 trillion in mergers and acquisitions completed in 2019 alone, M&A activity has been booming. However, a merger or acquisition isn't just a business decision and a business process. It's also a massive undertaking on the IT side, as you figure out how to migrate and integrate business applications and business data.

Posted April 08, 2020

The hype around DevOps and its potential to drive greater ROI across a wide range of enterprise operations increased substantially in the last decade. However, as these expectations carry into 2020, organizations will start to take a more sober approach to DevOps implementations. While DevOps was initially seen as a widespread solution to all sorts of enterprise IT issues, the implementation of DevOps approaches is now shaping up to become more strategic and focused, with much of the emphasis on how to maximize the ultimate return on investment.

Posted April 08, 2020

When it comes to DevOps, developers increasingly recognize databases to be code sets that require ongoing integration and deployment. They are "another code deployment which can and should be managed, tested, automated, and improved with the same robust, reliable methodologies applied to application code," according to the authors of a recent survey of 2,000 developers.

Posted April 08, 2020

Cutting-edge startups are constantly emerging to address new challenges and problems in ways never thought possible. Many of these young, innovative companies have fresh approaches that tap into blockchain, quantum computing, advanced analytics, AI, DevOps methodologies, containerization, and data security advancements. To shine a spotlight on some of the ways innovation in IT is being reflected today, here, DBTA presents 28 companies we think are worth watching in 2020.

Posted April 08, 2020

Neo4j, a provider of graph technology, is launching Neo4j for Graph Data Science, a data science environment built to harness the predictive power of relationships for enterprise deployments. Neo4j for Graph Data Science helps data scientists leverage highly predictive, yet largely underutilized relationships and network structures to answer unwieldy problems.

Posted April 08, 2020

Talend, a provider of in cloud data integration and data integrity, is bolstering its partnership with Databricks. With the Winter '20 release of Talend Data Fabric, including Stitch Data Loader for data ingest, Talend now supports Delta Lake. The comprehensive support enables data ingestion into lakehouse environments where data warehouse management features are combined with low-cost storage.

Posted April 08, 2020

Talend is joining the fight against COVID-19 by collaborating with developers from the Singer open source community and Bytecode to create an ETL tool for COVID-19 datasets. Talend standardizes the data, augments it with metadata, then routes the results to a data warehouse or data lake: Amazon Redshift, Amazon S3, Snowflake, Microsoft Azure Synapse Analytics, Delta Lake for Databricks, or Google BigQuery.

Posted April 07, 2020

The White House has announced the launch of the COVID-19 High Performance Computing Consortium to provide COVID-19 researchers worldwide with access to the world's most powerful high performance computing resources that can significantly advance the pace of scientific discovery in the fight to stop the virus. The public-private consortium, spearheaded by the White House, the U.S. Department of Energy, and IBM, includes government, industry, and academic leaders who have volunteered free compute time and resources on their machines.

Posted April 06, 2020

Oracle has announced a new Developer Associate certification for Oracle Cloud Infrastructure. The Developer Associate certification is intended for developers who have 6 months of experience in developing and maintaining applications. With this addition, Oracle now offers five distinct certifications for architects, operators, and developers on Oracle Cloud Infrastructure.

Posted March 26, 2020

LogDNA, a provider of multi-cloud log management solutions, has introduced performance and usability updates that enable developers to more easily query, filter, and gain insight from their log data. "The complexity of developing, deploying, and scaling applications is exponentially more complicated today than even just a few months ago, and the amount of data even small teams deal with on a daily basis is becoming untenable," said Peter Cho, vice president of product management at LogDNA.

Posted March 25, 2020

Pure Storage, a data solutions provider delivering a modern data experience, is releasing its third-generation all-NVMe FlashArray//X, providing customers with higher performance. With Pure Storage's Evergreen Storage model, customers can enjoy access to continuous innovation from Pure Storage that includes these and future updates to its product and solutions suite. 

Posted March 25, 2020

Oracle last week announced strong results for fiscal 2020 Q3. Total revenues were $9.8 billion, up 2% in USD and 3% in constant currency compared to Q3 last year. Cloud services and license support revenues were $6.9 billion, up 4% in USD and 5% in constant currency. "Subscription revenues, made up of cloud services and license support revenues, grew 5% in constant currency. These consistently growing and recurring subscription revenues now account for 71% of total company revenues," said Safra Catz, Oracle CEO.

Posted March 18, 2020

Platform9 is now offering new Freedom and Growth plans for their Platform9 Managed Kubernetes (PMK) Service.

Posted March 17, 2020

Rockset, a real-time database in the cloud, is releasing Query Lambdas, enabling developers to build data applications faster than ever before. As a real-time database in the cloud, Rockset eliminates roadblocks and, with Query Lambdas, allows developers to use their own data as an API to quickly build modern data applications.

Posted March 12, 2020

DH2i, a provider of multi-platform Software Defined Perimeter (SDP) and Smart Availability software, announced that its DxEnterprise for SDP-enhanced Microsoft SQL Server Availability Groups (AGs) is now available for Linux on RHEL and Ubuntu in AWS Marketplace.

Posted March 11, 2020

GS1 US has published a new guideline titled "Applying GS1 Standards for Supply Chain Visibility in Blockchain Applications," an educational resource that can help industry enable supply chain visibility in blockchain implementations by leveraging GS1 Standards. GS1 US, a member of GS1 global, is a not-for-profit information standards organization that facilitates industry collaboration to help improve supply chain visibility and efficiency through the use of GS1 Standards, a widely used supply chain standards system.

Posted March 11, 2020

Syncsort is partnering Databricks to support cloud initiatives for critical mainframe and IBM i data, enabling enterprises to leverage Syncsort Connect products to access, transform, and deliver mainframe data to Delta Lake.

Posted March 09, 2020

Talend, a provider of cloud data integration and data integrity, is introducing the Winter ‘20 release of Talend Data Fabric, unveiling the new Talend Cloud Data Inventory. Talend Cloud Data Inventory automatically calculates the Data Intelligence Score of all data across an organization and presents it in a service-self cloud app for every user.

Posted February 27, 2020

Eficode, a devOps company, is joining the Cloud Native Computing Foundation (CNCF) and is now a Kubernetes Certified Service Provider (KCSP).

Posted February 27, 2020

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

Sponsors