Newsletters




Data Modeling

Data Modeling and Database Design and Development – including popular approaches such as Agile and Waterfall design - provide the basis for the Visualization, and Management of, Business Data in support of initiatives such a Big Data Analytics, Business Intelligence, Data Governance, Data Security, and other enterprise-wide data-driven objectives.



Data Modeling Articles

Data modeling has always been a task that seems positioned in the middle of a white-water rapids with a paddle but no canoe. On one side of the data modeling rapids are the raging agilists who are demanding working software and decrying virtually all documentation. To this agilists' group, data modeling is often seen as too simple to matter. But at the same time, their implementations will miss standardization in naming or data model patterns. And results may be so far off course that major rework is unavoidable. Sadly, far too many agile practices have been set up to place things under the technical debt umbrella, when in reality those practices never allow the re-factoring closet door to be opened. Poor data models are "overcome" by creating ever more complex logic around the data in order to get to a more proper result, as developers learn what really needs to be accomplished along the way, maybe. The results may work but can be a nightmare to maintain.

Posted July 07, 2022

Kyligence is introducing Kyligence Zen, an intelligent metrics store platform that helps to align business goals and key metrics by automating data pipelines from data lakes or data warehouses to its multidimensional OLAP database to deliver metrics consistency and data trust in a cost-effective way.  

Posted June 29, 2022

Rocket Software, a global technology leader that develops enterprise software for some of the world's largest companies, is releasing the next version of its PRO/JCL, a job control language (JCL) management solution that includes a new interface through VS Code1 extension, Enhanced Directed Execution to include options for sending PROCs and PARMLIBs to remote LPARS, and improved visibility for NODE and LPAR in the primary panel. These updates help mainframe data centers achieve and operate a JCL environment that is error-free, standardized, and optimized.

Posted June 27, 2022

NVIDIA announced it is a founding member of the Linux Foundation's Open Programmable Infrastructure (OPI) project, while making its NVIDIA DOCA networking software APIs widely available to foster innovation in the data center. Businesses are embracing open data centers, which require applications and services that are easily integrated with other solutions for simplified, lower-cost and sustainable management. Moving to open NVIDIA DOCA will help develop and nurture broad and vibrant DPU ecosystems and power unprecedented data center transformation.

Posted June 27, 2022

Continuent is releasing Tungsten Clustering and Tungsten Replicator Versions 7.0.1, introducing 52 new features and improvements, along with critical bug fixes.

Posted June 22, 2022

invenioLSI, a company with decades of experience as an SAP integrator offering cloud solutions, in partnership with RISE with SAP, is now building intelligent business operations for the public sector with advanced solutions like AI and robotics. invenioLSI's focus on diversifying their global AMS and cloud portfolio offers RISE with SAP for customers interested in migrating legacy or ERP applications to SAP cloud solutions.

Posted June 22, 2022

MongoDB unveiled its developer data platform vision with a series of new capabilities at its annual conference held at the Javits Center in New York City. With these announcements, MongoDB aims to help development teams to innovate faster by addressing a wider set of use cases, servicing more of the data lifecycle, optimizing for modern architectures, and implementing more sophisticated levels of data encryption.

Posted June 08, 2022

Data modeling is the process of defining datapoints and struc­tures at a detailed or abstract level to communicate information about the data shape, content, and relationships to target audiences. Data models can be focused on a very specific universe of discourse or an entire enterprise's informational concerns. The final product for a data modeling exercise varies from a list of critical subject areas, an entity-relationship diagram (ERD) with or without details about attributes, or even a data definition language (DDL) script contain­ing all the SQL commands to build a set of physical structures within some chosen database management system (DBMS).

Posted June 02, 2022

Rocket Software is introducing the Rocket MV BASIC for Visual Studio Code 1.6.0 extension, now available on the Visual Studio Code Marketplace. Rocket MV BASIC for VS Code (MVVS) allows BASIC developers to edit, compile, and now debug their BASIC applications in Microsoft Visual Studio Code.

Posted June 02, 2022

PlanetScale, the serverless database provider powered by MySQL and Vitess, is offering a number of new innovations that accelerate delivery of "The Future Database," with new Insights providing granular performance visibility, Portals for multi-region deployment, and Connect enabling expansive analytics platform integrations.

Posted June 02, 2022

Deepnote, an early-stage startup backed by Accel and Index Ventures, is emerging from beta with version 1.0, opening up to the general availability of collaborative data science notebooks to data teams worldwide. Since the company's Series A announcement in Jan 2022, Deepnote has added many features going into the 1.0 launch. Most notably is the addition of Deepnote Workspaces, which empowers data teams to organize and surface data projects, notebooks, and apps in one place.

Posted May 31, 2022

Apollo GraphQL is offering supergraph, a network of a company's data, microservices, and digital capabilities that empowers product and engineering teams to quickly create incredible experiences for their customers. A supergraph acts as a composition layer facilitating collaboration between backend data and services, and frontend apps and devices.

Posted May 26, 2022

TigerGraph, provider of a leading graph analytics platform, is introducing the TigerGraph ML (Machine Learning) Workbench—a powerful toolkit that enables data scientists to significantly improve ML model accuracy, shorten development cycles, and deliver more value to the business.

Posted May 25, 2022

Tamr, a cloud-native data mastering solution, is offering Tamr Enrich, a set of enrichment services built natively into the data mastering process using Tamr's patented human-guided machine learning. Tamr Enrich curates and actively manages external datasets and services, enabling customers to seamlessly embed trusted, high-quality external data insights to their data mastering pipelines for richer business.   

Posted May 24, 2022

Matillion, provider of an enterprise cloud data integration platform, is releasing Matillion Data Loader 2.0, empowering enterprises to simplify data ingestion and accelerate insights with a cloud-native, no-code experience. Matillion Data Loader provides a single unified experience across batch loading and real-time, log-based change data capture (CDC) pipelines, and a consumption-based pricing model to help customers better manage data integration costs.

Posted May 24, 2022

Push Technology, a provider of real-time data streaming and messaging solutions, is releasing Diffusion 6.8, adding new features that include the Diffusion Gateway Framework, expanded data wrangling calculations and conditionals, and journal logging.

Posted May 20, 2022

Imply Data, founded by the original creators of Apache Druid, announced its $100 million Series D financing, which values the company at $1.1 billion. This investment round was led by Thoma Bravo with participation from OMERS Growth Equity, both new investors. Existing investors Bessemer Venture Funds, Andreessen Horowitz, and Khosla Ventures also participated in the financing.

Posted May 19, 2022

A common pattern in data lake and lakehouse design is structuring data into zones, with bronze, silver, and gold being typical labels. Each zone is suitable for different workloads and different consumers. For instance, machine learning algorithms typically process against bronze or silver, while analytic dashboards often query gold. This prompts the question: Which layer is best suited for applying data quality rules and actions? The answer: All of them.

Posted May 18, 2022

As the world becomes increasingly data-driven, AI/ML algorithms are being incorporated in most business applications. Historically, data in AI architectures was moved to a central location to perform both model training and inference. This centralized approach is becoming untenable due to cost, performance, and privacy reasons.

Posted May 18, 2022

Data consumers need data for BI and analytics to make business decisions. But for most organizations, their current data infrastructure isn't keeping up with demand. In a presentation at Data Summit 2022, titled "Building the Open Data Lakehouse," Mark Lyons, senior director, product management, Dremio, explained why more organizations are moving their analytics and BI to an open data lakehouse and how you can build a successful lakehouse strategy.

Posted May 18, 2022

Wednesday's Data Summit 2022 keynotes opened with Laura Sebastian-Coleman, data quality director, Prudential Financial, who discussed "Data Quality Deniers & What We Learn From Them." One of the biggest organizational obstacles to data quality management is basic pessimism about the possibility of managing the quality of data. This is due to lack of clarity—the goals and processes for data quality management have not been defined or have not been understood—and disbelief that the quality of data could be subject to control.

Posted May 18, 2022

At Data Summit 2022, Sudha Viswanathan, staff engineer, Wayfair, presented a talk titled, "Gaining Insights From Clickstream Data." Viswanathan explained that Wayfair's clickstream data refers to data that contains information about customer actions on the Wayfair site, such as what pages were viewed, the products that were clicked, what was added to the cart, which URL brought the customer to Wayfair. This helps Wayfair make data-driven decisions regarding revenue attribution for different marketing channels and improves traffic and test analysis and ad bidding.

Posted May 18, 2022

To turn data into insights and leverage the wealth of information that they are collecting, organizations need to ensure that their data is up-to-date and trustworthy. There is no magic answer. It's a combination of technology and processes. Kevin Campbell, CEO of Syniti, and Phil Fersht, CEO and chief analyst at HFS Research, discussed data value research conducted with Global 2000 C-level executives during their Data Summit 2022 presentation, "Every Problem Is a Data Problem: How Bad Data Is Killing Your Business."

Posted May 17, 2022

"Every company is a data company," Keith Alsheimer, head of marketing, Unravel Data during his Data Summit 2022 presentation. Alsheimer and Chris Santiago, VP solutions engineering, Unravel Data, discussed DataOps and how it can solve big data problems during the presentation. The annual Data Summit conference returned in-person to Boston, May 17-18, 2022, with pre-conference workshops on May 16.

Posted May 17, 2022

At Data Summit 2022, Chris Bergh, CEO and head chef of DataKitchen, shared how to build an internal business case and ways to collect small wins that illustrate the benefits of DataOps at your organization.

Posted May 17, 2022

Data projects that are completed on time, address changing requirements, and deliver value in the real world require a combination of skills and technologies, as well as the right people, according to Marilyn Moise Rousseau, corporate manager database operations, Baptist Health South Florida, who spoke at Data Summit 2022.

Posted May 17, 2022

Around 85% of analytics, big data, and AI projects will fail, despite massive investments of money. It's not new news, but it still reflects on how powerfully design affects speed, scale, and usage. At Data Summit 2022, Brian O'Neill, founder and principal, Designing for Analytics presented his session, "Technically Right, Effectively Wrong: How to Avoid Creating the ML or Analytics Application No Customer Wants to Use."

Posted May 17, 2022

Franz, a supplier of graph database technology for entity-event knowledge graph solutions, has announced AllegroGraph 7.3 with enhanced GraphQL query capabilities for distributed knowledge graphs and enterprise data fabrics. "Now when organizations need to integrate multiple systems from large legacy infrastructures and add new data to deliver a rich AI application—they can do so more quickly and easily using AllegroGraph and GraphQL APIs," said Jans Aasman, CEO of Franz.

Posted May 17, 2022

In order to develop and launch a successful enterprise data and analytics strategy that addresses the pain points of an organization it is necessary to understand the business, including the skills and roles of everyone who works with data, from business executives to business analysts to data scientists, according to Wayne Eckerson, president, Eckerson Group, a consultancy and research firm focused on data and analytics.

Posted May 16, 2022

Machine learning is revolutionizing the process of complex decision-making by enabling the analysis of bigger, more complex datasets and the delivery of faster, more accurate results. At Data Summit 2022, Charna Parkey, VP of product, Kaskada presented "The Basics of Machine Learning" during her workshop session.

Posted May 16, 2022

Knowledge graphs are a valuable tool that organizations can use to manage the vast amounts of data they collect, store, and analyze. At Data Summit 2022, Joseph Hilger, COO, Enterprise Knowledge LLC and Sara Nash, senior consultant, data and information management, Enterprise Knowledge, LLC presented an "Introduction to Knowledge Graphs" during their workshop session.

Posted May 16, 2022

Python has become the default language of solving complex data problems due to its ease of use, plethora of domain-specific software libraries, and stellar community and ecosystem. All of these things have led to the emergence of even more new and easier-to-use frameworks that enable users to scale their Python code.

Posted May 16, 2022

IBM has unveiled a major new release of IBM i—7.5—in addition to a new Technology Refresh (IBM i 7.4 TR6) and a new product called "Merlin," which is short for IBM i Modernization Engine for Lifecycle Integration. The first kinds of tools IBM is introducing with Merlin are tools that help with developing software with the "IBM i Next Gen Apps" approach. "This means that the first release of Merlin focuses on developing code, so it has an integrated development environment (IDE), using a DevOps approach (so it has wizards to help you set up a DevOps environment, or to integrate your IBM i development into an existing DevOps environment), while making it easy to publish and consume function using services. And it does all of this with a browser interface," said Steve Will, the chief architect for IBM i, responsible for strategy and planning related to the OS.

Posted May 16, 2022

The case for increased data automation is clear. "Data teams are spending significant amounts of time on service requests like infrastructure, user provisioning, and incident coordination and communication," said Tina Huang, CTO and founder of Transposit. "Teams today are often manually creating tickets, Slack channels, and Zoom meetings, plus communicating with stakeholders. Data teams must ensure internal customers using data have access to the data they need and real-time updates about interferences with that data." Other tasks ripe for automation include log parsing, correlation, permissions and access, and more.

Posted May 16, 2022

Siren, a provider of Investigative Intelligence analytics, is releasing Siren 12.1, introducing several enhancements and improvements including 360 degrees data visibility, downloadable and editable reports, and data model scalability. The latest iteration of the Siren platform pushes forward what is achievable in the investigative world, launching new capabilities which have been developed in line with rapidly changing investigators requirements to generate insights at machine speed and scale, according to the vendor.

Posted May 09, 2022

The volume, velocity and veracity of today's data deluge has put immense pressure on underlying data platforms and organizations' abilities to manage them effectively. And the pandemic has only exacerbated the problem. According to a 2021 survey, nearly half of digital architects are under high or extremely high pressure to deliver digital projects, but 61% blame legacy technology for making it difficult to complete modernization efforts. That said, databases of all types—SQL, NoSQL, or NewSQL—be they on-prem, cloud, hybrid, or edge, are struggling to navigate this new reality.

Posted May 04, 2022

The value of normalization is in understanding the data well enough to create the normalized design. Pulling out the business rules, business terms, and relationships from the mass of jumbled together raw content is critical. The business rules that result from performing the normalization exercise establish the requirements that need to be satisfied by solutions, whether they are either built or purchased. When an organization creates and maintains a normalized design for the data within the important areas of their business, they reduce work on all future systems.

Posted May 04, 2022

The 9th annual Data Summit conference will be held May 17-18, 2022, at the Hyatt Regency Boston. Pre-conference workshops will take place on May 16, 2022. The program is available for review and a variety of pass options are available to suit individual requirements.

Posted May 04, 2022

Knowledge graphs are driving industry disruption and business transformation by bringing together previously disparate data, using connections for decision support, and adding context to AI applications. DBTA recently held a webinar with Dr. Maya Natarajan, senior director, knowledge graphs, Neo4j and Dr. Jesus Barrasa, senior director, sales engineering, Neo4j who discussed why knowledge graphs are on the rise, how they work, and what the most popular use cases are for enterprises.

Posted May 03, 2022

Airbyte, creators of an open-source data integration platform, is releasing its cloud service for data movement in the U.S. "With Airbyte Cloud, we remove the headache of building and maintaining custom data infrastructure by providing a simple, economical way for enterprises to move data as needed," said Michel Tricot, co-founder and CEO, Airbyte.

Posted April 28, 2022

Ascend.io, the Data Automation Cloud, announced it has received $31 million in Series B funding, enabling the company to scale go-to-market efforts and expand into new geographies, as well as extend Ascend's Data Automation Cloud to support full multi-cloud data mesh automation.

Posted April 28, 2022

Each year, Data Summit features industry-leading experts covering the topics that matter most for data professionals who want to stay on top of the latest technologies and strategies. The conference program is now available for review, and a variety of pass options are being offered, including special pricing for attendees who register early.

Posted April 28, 2022

Digital.ai is unveiling the latest release of its industry-leading AI-Powered DevOps Platform, dubbed Ascension, empowering both private and public sector organizations to unify, secure, and generate predictive insights across the software lifecycle. The Ascension release further enables technology leaders to scale their distributed workforce and connect agile development at the portfolio, team, and individual levels.

Posted April 27, 2022

Rollbar, provider of real-time error monitoring Software as a Service, is offering new and updated software development kits (SDKs) and capabilities. These SDKs keep Rollbar current on both older, but very significant, platforms like .Net, PHP, and Laravel and also the fastest-moving platforms like Apple iOS, React, Typescript, and Flutter.

Posted April 27, 2022

Aerospike Inc. is releasing Aerospike Database 6, a significant new update of the core engine that powers Aerospike Real-time Data Platform, giving developers an environment that supports multiple programming models to build real-time applications with a predictable performance at any scale.

Posted April 27, 2022

SAP is introducing an experience-driven journey to process analytics, correlating experience data from user surveys (whether customer, supplier, or employee) with underlying IT systems to give companies the ability to understand how best to optimize their end-to-end business processes for both operational excellence and customer experience.

Posted April 27, 2022

Oracle has announced that Oracle MySQL HeatWave now supports in-database machine learning (ML) in addition to the previously available transaction processing and analytics. MySQL HeatWave ML fully automates the ML lifecycle and stores all trained models inside the MySQL database, eliminating the need to move data or the model to a machine learning tool or service. Eliminating ETL reduces application complexity, lowers cost, and improves security of both the data and the model. HeatWave ML is included with the MySQL HeatWave database cloud service in all 37 Oracle Cloud Infrastructure (OCI) regions.

Posted April 20, 2022

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30

Sponsors