Data Modeling Articles
Brian O'Neill, founder and principal of Designing for Analytics, discussed the key design challenges data product teams face during his presentation at Data Summit Connect 2021.
Posted October 07, 2021
Marvel should have an evil villain named "Null." Nulls have always been trouble in the relational world. Certainly, nulls are used all over the place by virtually everyone. Still, that does not mean that nulls are harmless.
Posted October 05, 2021
The Open Mainframe Project recently announced record growth in contributions with more than 105.31 million lines of code written and more than 9,600 commits submitted by Open Mainframe Project communities so far this year. This is 100% more code than last year with an increased number of active participants in the 20 project and working groups. These numbers will only increase as Open Mainframe continues to be the cornerstone of governance and innovation for modernizing the mainframe and its path to IoT, Cloud and Edge Computing.
Posted October 04, 2021
IBM and Linux Foundation AI and Data (LFAI and Data) have joined together to create a "one-stop shop" for trusted data and AI artifacts in order to reduce duplication across teams when creating assets, as well as mitigate traceability, governance, risk management, lineage tracking, and metadata collection issues.
Posted October 04, 2021
Today's data-driven organizations demand capabilities that adapt to the enterprise and open new paths of innovation to business users. Achieving leadership in today's economy requires identifying and preparing for the emerging technologies and methodologies that deliver transformation.
Posted September 27, 2021
Matillion, a leading cloud data integration platform, announced it has secured $150 million in Series E funding, empowering the company to continue to grow its cloud analytics, AI, and machine learning for large global enterprises. The funding round was led by General Atlantic, a global growth equity firm, with participation from Battery Ventures, Sapphire Ventures, Scale Venture Partners, and Lightspeed Venture Partners.
Posted September 23, 2021
From data ingestion to data modeling and governance, more and more organizations are looking towards automation to accelerate and improve data management and analytics processes. And at a time when data environments continue to expand in size and complexity while IT staff sizes remain relatively flat, it's easy to understand the appeal. Business leaders are hungry for fast, actionable insights. Manual tasks are error prone and time consuming. For many enterprises, the traditional approach to data management is no longer sustainable, particularly when it comes to enabling modern data analytics strategies.
Posted September 23, 2021
Zumasys is offering a video training series on PICK MultiValue Dictionaries. The most recent virtual session features Pete Schellenbach, author of AccuTerm. Additionally, AccuTerm Enterprise is available for Independent Software Vendors (ISVs) or environments with hundreds of users.
Posted September 22, 2021
Domino Data Lab, provider of the Enterprise MLOps platform, is making upgrades to its model monitoring capabilities, allowing companies to place greater trust in the models they deploy. This and other enhancements—including Domino Model Monitor (DMM) support for AWS, GCP, and Azure— are part of Domino 4.6.
Posted September 21, 2021
Document-oriented databases are one of the fastest growing categories of NoSQL databases, and the primary reason is the flexibility of schema or logic design. Document databases make it easier for developers to store and query data in a database by using the same document-model format they use in their application code. The flexible, semi-structured, and hierarchical nature of documents and document databases allows them to evolve with applications' needs.
Posted September 16, 2021
Revelation Software's latest release of OpenInsight, version 10.1, has changes, new functionality, and improved performance throughout the product. DBTA recently held a webinar with Bob Carten, senior developer, Revelation Software and Bryan Shumsky, senior developer, Revelation Software, who discussed some of these improvements, including a new Database Tool panel, a brand new Examples application, new REST functionality, updated help and support documents, and more.
Posted September 16, 2021
Syniti, a provider of enterprise data management, is acquiring 360Science, a proven data quality leader specializing in matching, deduping, unifying, linking, and verifying contact and business data. The acquisition, which encompasses 360Science's technology and the retention of key talent in data matching and linguistics, will strengthen Syniti's expertise in helping customers tackle the complex issues surrounding data.
Posted September 16, 2021
Aerospike is delivering optimizations with its 5.7 release, boosting the real-time data platform to deliver queries on larger, more mission-critical datasets.
Posted September 15, 2021
Seth Earley identified data quality, testing, and web analysis reports as three key sources of product data metrics and described what they track and reveal during his keynote at Data Summit Connect 2021.
Posted September 15, 2021
Teradata, the connected multi-cloud data platform for enterprise analytics, announced that Vantage is now able to operationalize externally created predictive models, also known as model sharing or bring your own model (BYOM). Businesses will now be able to quickly realize a greater return on investment (ROI) in developing analytical models through increased model operationalization, expanded analytic use cases, and a streamlined approach to data-driven decision-making.
Posted September 10, 2021
The Call for Speakers is now open for Data Summit 2022 which will be held at the Hyatt Regency Boston May 17-18, 2022, with pre-conference workshops on May 16, 2022. The Data Summit conference focuses on the business and technical aspects of Big Data, Data Management, DevOps, Data Management, AI, Machine Learning, and the ramifications of working in a data-driven environment.
Posted September 09, 2021
Today, organizations need data-driven insights to advance decision making at all levels and digital transformation is a key component of those efforts. Supporting data-driven insights and digital transformation takes an ever-growing range of services, products, and tools from forward-thinking companies that are working to help their customers deliver the right insights to the right people at the right time.
Posted September 08, 2021
Riverbed is offering more critical cloud visibility and reporting capabilities in its industry-leading end-to-end visibility solutions—adding support of Azure NSG and AWS VPC flow logs.
Posted September 03, 2021
Dynatrace, in partnership with Microsoft, is offering a new integration that provides full application data transparency into applications deployed on Azure Spring Cloud.
Posted September 02, 2021
CircleCI, a leading continuous integration and continuous delivery (CI/CD) platform, is releasing CircleCI webhooks, a feature that provides software engineering teams the ability to build integrations that react to CircleCI job and workflow status notifications.
Posted September 02, 2021
Databricks, the Data and AI company, announced it received $1.6 Billion in a new round of funding to accelerate innovation and adoption of the data lakehouse. Driven by open standards, cloud adoption and the continued rise of machine learning applications, the company intends to build on its lead by investing in innovations that further simplify AI, preserve choice and flexibility across all major public clouds, and establish the lakehouse as a modern replacement to the legacy data warehouse.
Posted September 01, 2021
Oracle has announced the Verrazzano Enterprise Container Platform. It runs on top of Kubernetes—on-premise and in the cloud?and enables users to deploy container applications to any of the Kubernetes clusters where Verrazzano is installed. "Technically speaking, Oracle Verrazzano is a container deployment and management platform that embodies the core requirements and design principles of cloud-native application development," said David Cabelus, a product manager in the OCI Developer Service group, with a focus on OKE add-ons and hybrid cloud, in a recent blog post.
Posted September 01, 2021
BlueFinity is updating its Evoke low-code app development platform with an integrated and adaptable 360-degree imaging virtual tour capability. This is a multi-purpose feature that provides for the searching, browsing and inspection of any store, venue, property, or utility through an integrated feature of an app. According to the company, the results are sophisticated, full-function web and mobile apps that can be fully integrated with any database (Db2, SQL, Oracle, MultiValue, etc.).
Posted September 01, 2021
Northern Light CEO David Seuss enumerated the benefits of building smart taxonomies for organizing and leveraging vast content repositories during his presentation at Data Summit Connect 2021.
Posted August 30, 2021
Actian SVP Emma McGrattan explained how to build a cloud-based data architecture tailored to your specific business needs rather than peak workload during her presentation at Data Summit Connect 2021.
Posted August 26, 2021
Tecton, the enterprise feature store company, is adding low-latency streaming pipelines to its feature store so that organizations can quickly and reliably build real-time machine learning models.
Posted August 11, 2021
TruEra, provider of a suite of AI quality solutions, is releasing TruLens, an open source explainability software tool for machine learning models that are based on neural networks. TruLens is a library for deep neural networks that provides a uniform API for explaining Tensorflow, Pytorch, and Keras models. The software is freely available for download, and comes with documentation and a developer community to further its development and use.
Posted August 10, 2021
Surviving and thriving with data science and machine learning means not only having the right platforms, tools and skills, but identifying use cases and implementing processes that can deliver repeatable, scalable business value. The challenges are numerous, from selecting data sets and data platforms, to architecting and optimizing data pipelines, and model training and deployment. As a result, new solutions have emerged to deliver key capabilities in areas including visualization, self-service and real-time analytics. Along with the rise of DataOps, greater collaboration and automation have been identified as key success factors.
Posted August 05, 2021
Confluent is rolling out an all-new website dedicated to Apache Kafka, event streaming, and associated cloud technologies. The site is called Confluent Developer and compiles all the information users need in one place, from first steps in event streaming right through to more complex topics including: microservice architectures, data pipelines, and company-wide systems for data in motion.
Posted August 05, 2021
Lynda Partner, senior vice president for products and offerings at Pythian, spoke at Data Summit Connect earlier this year about how to accelerate value with machine learning. In this interview with BDQ, Partner continues the conversation and highlights how to select the right use cases for ML, avoid mistakes, and manage an ML project once it becomes a reality.
Posted August 04, 2021
A Distributed SQL database, such as CockroachDB, delivers effortless and elastic cloud scale while guaranteeing transactions. It is a database that reimagines the execution and storage layers while still allowing developers to still use familiar SQL syntax.
Posted August 04, 2021
Treeverse, the creator of lakeFS, the open-source technology that brings streamlined data lifecycle management and version control to data lakes, announced it has raised $23 million in funding to date, further accelerating the development and adoption of the innovative solution.
Posted August 02, 2021
NVIDIA has introduced the North American availability of the NVIDIA Base Command Platform, a hosted AI development hub that provides enterprises with instant access to computing infrastructure wherever their data resides. "As enterprise AI adoption grows, so does demand for faster access to the world-leading infrastructure offered by NVIDIA and our partners," said Manuvir Das, head of Enterprise Computing at NVIDIA.
Posted August 02, 2021
We all know that in uncertain times, a forecast underlies a company's success or failure. Forecasts keep prices low by optimizing business operations—including cash flow, production, staff, and financial management—while increasing knowledge of the market. Business forecasting gives you an essential tool for adapting to change and fostering competitive advantage.But relevant forecasting isn't easy.
Posted August 02, 2021
The meaning and interrelationships of and between data are important. If you are the designer of a database and being lobbied to allow the creation of a table without a primary key, make sure you understand how the table is to be used, and that people will not be writing queries against the structure that will potentially be multiplying data unexpectedly
Posted August 02, 2021
In the last 5 years, we've seen a blurring of the distinction between many of the upstart databases and the traditional SQL databases. NoSQL databases such as MongoDB have added features typically associated with relational databases—transactions, SQL connectors, and the like—while the SQL databases have introduced support for JSON document models. We can see that databases such as PostgreSQL and MongoDB are increasingly converging on a common set of features. However, one category of NoSQL databases seems to be bucking the convergence trend: graph databases.
Posted August 02, 2021
Software intelligence company Dynatrace is extending Smartscape, the Dynatrace platform's real-time and continuously updated topology, to bring Dynatrace's AIOps and analytics capabilities to more open source services, including OpenTelemetry, FluentD, and Prometheus.
Posted July 30, 2021
The Apache Cassandra Project is launching v4.0 of Apache Cassandra, offering more than 1,000 bug fixes, improvements and new features. As a NoSQL database, Apache Cassandra handles massive amounts of data across load-intensive applications with high availability and no single point of failure.
Posted July 28, 2021
Optum SVP Sanji Fernando explains how his organization approaches machine learning model training, evaluation, and retraining in this clip from his presentation at Data Summit Connect 2021.
Posted July 28, 2021
CognitiveScale, the enterprise AI company providing AI-powered digital systems, is releasing Cortex Fabric Version 6—a new, low code developer platform for automation, augmentation, and transformation. Cortex 6 helps enterprises create trustworthy AI applications faster, more affordably, and with business outcomes delivered through KPIs based on insights from data, models, and actions—all with minimal dependencies on underlying infrastructure, according to the vendor.
Posted July 16, 2021
More organizations are realizing the importance of responding to queries in real-time with the right data sets. At the same time, however, there is a constant need to do more with less. That's where InfluxDB Cloud comes in, a leading time series data platform. It's more than a database—InfluxDB Cloud is your all-in-one observability platform including data ingestion tools, dashboarding engine, and real time alerting capability.
Posted July 15, 2021