Follow us on #DataSummit
View the Final Program [PDF]. Or, see the Agenda page for a grid view.
Data Summit 2025 is a unique conference that brings together IT practitioners and business stakeholders from all types of organizations. Featuring workshops, panel discussions, and provocative talks, attendees get a comprehensive educational experience designed to guide them through all of today’s key issues in data management and analysis. Whether you're fascinated by the technical potential and complexities of emerging technologies or focused on leveraging Big Data for business intelligence, analytics, and strategic decision-making, we've got you covered!
Access to all tracks including AI & Machine Learning Summit plus Generative AI Boot Camp and Data Engineer Boot Camp is included when you register for an All-Access Pass or Full Two-Day Conference Pass. Attendees may switch between tracks as they choose. Only interested in the 2-day AI & Machine Learning Summit or our 1-day Boot Camps? Standalone registration for this content is also available.
Tuesday, May 13: 9:00 a.m. - 12:00 p.m. 
 Located in Martha's Vineyard A, Lobby Level
Every organization faces unique challenges in becoming data-driven. This practical half-day session guides attendees through creating a modern data architecture that aligns with your business strategy to deliver ongoing and scalable value. Through our proven four-step methodology, attendees learn to translate business goals into architectural decisions and evaluate emerging technologies from cloud-native platforms, data lakehouses, and data fabrics to build a prioritized road map. Data leaders gain frameworks to assess which modern data stack components best serve their organization's specific needs and capabilities. Most importantly, attendees leave with actionable insights that will motivate them to transform their data infrastructure and drive business outcomes.
John O'Brien, Principal Advisor & Industry Analyst, Radiant Advisors
Tuesday, May 13: 9:00 a.m. - 12:00 p.m. 
 Located in Martha's Vineyard B, Lobby Level
Semantic layers stand out as a key approach to solving business problems for organizations grappling with the complexities of managing and understanding the meaning of their data. A semantic layer, also called context layer, is a business representation of data that allows organizations to quickly map various data definitions from multiple data sources to familiar business terms, offering a consistent and consolidated view of data. Join our workshop to gain insights into the foundations of semantic/context layers, their implementation, and the business value they provide by enhancing the utility of your data. The workshop promises an interactive experience, offering participants the opportunity to both understand the nuances of semantic/context layers and actively engage in constructing one.
Joseph Hilger, COO, Enterprise Knowledge, LLC
Sara Nash, Principal Consultant, Enterprise Knowledge LLC
Tuesday, May 13: 1:00 p.m. - 4:00 p.m. 
 Located in Martha's Vineyard A, Lobby Level
Data teams are deluged with user requests for datasets, metrics, or reports. They need help finding, accessing, validating, or fixing data. For many data leaders, the solution is simple: Empower users to service their own data and analytics needs. But what is easy to say is challenging to do. This workshop provides practical, time-tested approaches to democratizing data and creating an insights-driven culture. It shows how data teams can eliminate data bottlenecks by transforming themselves from order takers to strategic business partners who proactively anticipate business needs. However, the path to self-service nirvana is not for the faint-hearted: It requires developing a deep knowledge of business user needs and then overhauling the team's operating model, data architecture, data governance, data delivery, and support networks to meet those needs.
Wayne Eckerson, President, Eckerson Group
Tuesday, May 13: 1:00 p.m. - 4:00 p.m.
Tuesday, May 13: 1:00 p.m. - 4:00 p.m. 
 Located in Martha's Vineyard B, Lobby Level
Engage in a workshop designed to bridge theory and practice as you learn how to build your first GenAI-driven chatbot with your enterprise data. This session guides you through connecting and unifying disparate data sources to empower real-time, context-aware conversations. Learn how to quickly integrate high-quality, distributed data into your chatbot—ensuring robust governance and security. This educational, step-by-step workshop equips you with the practical skills and tools to prototype a powerful AI chatbot. Whether you’re a seasoned developer or just starting your AI journey, you’ll leave with a tangible solution and the insights to transform data chaos into strategic, conversational excellence.
Kevin Bohan, Director of Product Marketing, Denodo
Romeel Sheth, Data Engineer, Customer Success, Denodo
Wednesday, May 14: 8:00 a.m. - 8:45 a.m.
Wednesday, May 14: 8:45 a.m. - 9:30 a.m. 
 Located in Grand Ballroom B
In the factory-driven Industrial Revolution, we began to view and measure work as a process. Now, in the AI Revolution, we will need to adopt a different model, where we view and measure work as a story. Building on the neuroscience that makes us wired for story patterns, storytelling uses “story” as a communication strategy, while story thinking uses “story” as an operational strategy. The volume, velocity, and variety of data will be connected to processes but also to the organization’s overall narrative intelligence. Lewis discusses the implications of data visualization through the lens of story visualization, which requires understanding human beliefs and commitments, and provides examples for leadership, change, innovation, healthcare, and organizational design.
John Lewis, CKO, Explanation Age LLC and Author, Story Thinking: Transforming Organizations for the Fourth Industrial Revolution
Wednesday, May 14: 9:30 a.m. - 9:45 a.m. 
 Located in Grand Ballroom B
As organizations race to implement AI initiatives, many discover that their fragmented data infrastructure is holding them back. Learn how unifying enterprise data in real time with CrateDB not only simplifies your architecture but also creates the foundation for successful AI deployment. Pullepu explores practical approaches to breaking down data silos and building a unified data foundation that's ready for today's AI demands and tomorrow's innovations.
Shiva Pullepu, Vice President, AI and Industry Solutions, CrateDB
Wednesday, May 14: 9:45 a.m. - 10:00 a.m. 
 Located in Grand Ballroom B
No, GenAI didn't "kill" business intelligence (BI). Rather, it's transforming it drastically. Companies that adapt by incorporating AI capabilities will thrive, while those that remain static will struggle to remain relevant. AI is fundamentally changing how we become data-driven as we move from passive dashboards to proactive, conversational insights. A larger audience can access insights democratizing data for nontechnical users. And changing data infrastructure requires that embedding intelligence into workflows, agents, and applications is a critical shift in where analytics happen. The emerging generative capabilities in machine learning and AI will reshape analytics and the way companies move forward in their journey to become data-driven.
Sami Akbay, VP, Product Management, insightsoftware
Wednesday, May 14: 10:00 a.m. - 10:45 a.m.
Modern Data Strategy Essentials Today is your guide to the key principles data-driven companies are applying to achieve success in our increasingly complex world of data sources, types, applications, requirements, and user expectations. Attend this track to learn how to align technology, people, and processes with the complete data journey and the capabilities that support your current and future needs.
Designed for chief information officers, chief data officers, digital transformation leaders, IT business liaisons, enterprise architects, data architects, data engineers, and data management and analytics professionals.
Wednesday, May 14: 10:45 a.m. - 11:45 a.m. 
 Located in Grand Ballroom B
Data is the cornerstone to becoming insights-driven.
Data observability and reliability engineering are rapidly emerging as foundational pillars in modern data engineering and MLOps. Ensuring that data pipelines are robust, trustworthy, and capable of supporting critical business operations is imperative. Chatterjee explains that data observability goes beyond traditional monitoring by offering a holistic, proactive approach to identifying and resolving issues before they impact downstream analytics or machine learning models. Data reliability leverages observability tools and practices to maintain highest standards of data quality and system uptime, often borrowing principles from Site Reliability Engineering.
Nilanjan Chatterjee, Sr. Staff Data Architect, AMD
Traditional MDM (master data management) falls short in today's AI-driven world. Smith introduces Syncari's Agentic MDM, which transforms MDM into an intelligent, scalable data foundation. Learn how to enable real-time AI integrations, adaptive governance, and data trust to drive business impact.
Jack Smith, Principal Solutions Engineer, Syncari
Wednesday, May 14: 12:00 p.m. - 12:45 p.m. 
 Located in Grand Ballroom B
To excel at maximizing your data resources, you need to have a workable strategy in place.
Cooney provides a set of real-world recommendations for the successful development of enterprise data strategy and practice in your organization. Based on more than 20 years of working with data technologies and with more than 10 years in the cloud and AI, he shares some key observations from that experience that will help you stay grounded in the flux of today's data and AI market. The five secrets will be revealed in this session.
Pete Cooney, Enterprise Lead Data Architect, Jackson National Life
The popularity of terms like DevOps and platform engineering to describe high-functioning technical organizations is trending. Expanding on the notion that databases are core to a DevOps philosophy, Vacca tells how you can embrace this mindset to reduce barriers between the business and the engineers responsible for maintaining and supporting the data layer, then suggests how and why reality sometimes falls short of the best intentions.
Phil Vacca, Practice Leader, PostgreSQL Services, Datavail
Wednesday, May 14: 12:45 p.m. - 2:00 p.m.
Wednesday, May 14: 2:00 p.m. - 2:45 p.m. 
 Located in Grand Ballroom B
Having data is one thing, but having a resilient strategy to manage it in an AI world is a decisive factor for success.
In today’s fast-paced digital landscape, a modern organization’s ability to effectively manage its data is a decisive factor in its success. As organizations strive to harness the power of AI and advanced analytics, maintaining high-quality, reliable data has never been more critical. Vasudevan explains how organizations can build resilient data strategies that improve data quality, visibility, and trust while reducing development time, enhancing decision making, and fostering collaboration.
Bharath Vasudevan, VP of Product, Quest Software
Wednesday, May 14: 2:45 p.m. - 3:15 p.m.
Wednesday, May 14: 3:15 p.m. - 4:00 p.m. 
 Located in Grand Ballroom B
People and the culture of organizations affect how decisions are made.
Data is the fuel for GenAI—but what happens when that fuel is scattered across disconnected systems, buried in legacy applications, or locked behind security constraints? Bohan explores how leading enterprises have broken down these barriers to unlock real-time, context-aware insights for their AI-driven applications. Discover practical strategies that can help you unleash the power of your data—so your AI initiatives deliver real business impact, not just theoretical promise.
Kevin Bohan, Director of Product Marketing, Denodo
This presentation has been cancelled due to unforeseen circumstances.
Suzannah Hicks, AI Program Architect & Strategist, Hummingbird Healthcare
Wednesday, May 14: 4:15 p.m. - 5:00 p.m. 
 Located in Grand Ballroom B
Understanding the people involved in decision making around implementing AI within an organization leads to partnership building.
Wilson Chase, a data science expert with 25-plus years of experience applying critical thinking in deconstructing data, challenges the numbers you trust. We live in a world obsessed with data, but what if the metrics we rely on don’t actually measure what we think they do? Wilson Chase exposes common data delusions—misleading metrics that create false confidence and drive bad decisions. From training completions that don’t measure learning to click rates that don’t predict conversions, she explores real-world examples of flawed measurement. This session helps you walk away with a sharper eye for identifying deceptive metrics and practical strategies to ensure your data tells the truth. Because in the end, good decisions don’t come from more data—they come from the right data.
Chantel Wilson Chase, Director, Quality Analytics and Reporting, Alexion, AstraZeneca Rare Disease Business Unit
Wednesday, May 14: 5:00 p.m. - 6:00 p.m.
What’s Next in Data and Analytics Architecture drills down on shifting trends and emerging best practices that are helping companies achieve more flexible, modular, and distributed data infrastructures to support modernization and innovation. Attend this track to gain a deeper understanding of the new technologies and strategies driving greater speed and scale, and improved governance and security, at organizations hungry for fast, actionable insights.
Designed for chief information officers, chief data officers, enterprise architects, data architects, data engineers, data scientists, and data management and analytics professionals.
Wednesday, May 14: 10:45 a.m. - 11:45 a.m. 
 Located in Dedham
When considering the many aspects of data architecting, a few best practices stand out.
Architectural best practices to build and scale the most effective GenAI and agentic AI applications, optimizing on cost and performance, lead to an unparalleled customer experience. Allowing enterprises to cost-effectively deliver personalized and dynamic online customer care applications that captivate their final end users involves leveraging a serverless approach and orchestrating agentic AI.
Vandana Saini, Senior WorldWide Generative AI Specialist Solutions Architect, AWS
This session delineates best practices for simplifying data processes, improving data quality, and ensuring consistency across reporting platforms. Learn how to standardize data governance frameworks, automate manual tasks, and create a centralized data catalog for better accessibility and control. Discover strategies for effective collaboration between data stewards and business teams to maximize the value of data assets. Whether you're working with SQL databases, Power BI, or other reporting tools, this presentation provides actionable steps to optimize your data management practices, reduce redundancy, and improve data reliability.
Yashasvi Singh, Sr Data Analyst, Navy Federal Credit Union
Wednesday, May 14: 12:00 p.m. - 12:45 p.m. 
 Located in Dedham
This session has been cancelled.
Wednesday, May 14: 12:45 p.m. - 2:00 p.m.
Wednesday, May 14: 2:00 p.m. - 2:45 p.m. 
 Located in Dedham
AI technologies are having an immense impact on organizations, including cloud databases.
Learn about the latest trends in database cloud modernizations, including the cloud-native and vector databases offered by AWS and Azure. Learn essential tips to help you pick the right database for your modern application and analytics needs, including use cases and comparing IaaS and PaaS databases, vector databases, and GenAI.
Michael Agarwal, Director & Practice Lead of Cloud Databases & SRE Services, Datavail
Wednesday, May 14: 2:45 p.m. - 3:15 p.m.
Wednesday, May 14: 3:15 p.m. - 4:00 p.m. 
 Located in Dedham
Agentic AI is front and center in many data areas, including data engineering.
AI is reshaping how data teams work—but most teams still rely on manual processes and tools that do not allow them to leverage AI to its full potential. Knapp explores the concept of agentic data engineering—an AI-native approach in which intelligent agents help build, orchestrate, and maintain pipelines. Attendees get a look under the hood at how these systems work, how they’re already boosting team efficiency, and why it’s time to rethink the way we develop data pipelines.
Sean Knapp, Founder & CEO, Ascend.io
Wednesday, May 14: 4:15 p.m. - 5:00 p.m. 
 Located in Dedham
Data warehouses are an integral part of modern data architectures.
In discussing future-proofing data warehousing, Vayyala considers the impact of data modeling, medallion architecture, and integration. Building an enterprise data warehouse is more than just a technical challenge; it’s a strategic investment in a company’s future. Implementing a robust, scalable, and flexible EDW ensures that an organization has the data it needs to make informed, data-driven decisions.
Rajesh Vayyala, Data Architect
Cusack discusses the role of Kubernetes in enabling elastic data warehousing across public clouds and on-prem environments. Kubernetes provides the portability layer that results in the same cloud-like enterprise data warehouse user experience everywhere, eliminating the proliferation of different tech stacks and reducing ecosystem complexity.
Mark Cusack, CTO, Yellowbrick
Wednesday, May 14: 5:00 p.m. - 6:00 p.m.
By identifying patterns in vast sums of data and creating human-like content at lightning-fast speeds, GenAI applications have emerged as a powerful tool for automating and optimizing a wide variety of tasks. Although adoption is still in the early stages, many organizations are currently testing and deploying GenAI applications in pursuit of greater efficiency and productivity. At the same time, succeeding with GenAI requires overcoming a range of challenges—from legacy infrastructure and skills shortages to governance and security risks, data quality issues, and trust and transparency concerns. Attend this boot camp to dive into the key technologies and emerging best practices.
Designed for chief information officers, chief data officers, data architects, data engineers, data scientists, and AI engineers and developers.
Wednesday, May 14: 10:45 a.m. - 11:45 a.m. 
 Located in Plymouth
Knowledge graphs are key to unlocking the power of retrieval-augmented generation.
AI’s "disillusionment" phase isn’t an AI problem—it’s a data problem, one that knowledge graphs can solve. They guide AI with precision and context, ensuring a clear path toward trustworthy AI. They prevent wrong turns by organizing and linking data in semantically contextual ways and ensure models don’t just process data, but do it accurately, reliably, and contextually with relevance to limit hallucinations. Pal discusses how knowledge graphs help improve data quality, mitigate AI risks, reduce costs, and prepare enterprises to be AI-ready to reap ROAI (Return on AI Investments).
Sumit Pal, Strategic Technology Director, Graphwise.ai
Atanas Kiryakov, President & Founder, Graphwise.ai
Wednesday, May 14: 12:00 p.m. - 12:45 p.m. 
 Located in Plymouth
Supercharging customer experiences is one aspect of GenAI that holds real promise.
Gudla looks at two innovative approaches designed to improve grocery search results by enhancing both relevance and discoverability, with a focus on the development and application of a new product relevance classification model, alongside the strategic integration of LLMs to improve discoverability of novel products. By leveraging the precise categorization capabilities of the ESCI model and the contextual understanding provided by LLMs, Instacart could anticipate and meet consumer needs more effectively. This ultimately led to increased engagement and incremental revenue.
Vinesh Gudla, Staff Machine Learning Engineer, Instacart
Wednesday, May 14: 12:45 p.m. - 2:00 p.m.
Wednesday, May 14: 2:00 p.m. - 2:45 p.m. 
 Located in Plymouth
Important components in gaining trust in GenAI models and implementations involves compute orchestration and RAG.
As AI projects blossom, organizations must balance compute costs and performance. During this presentation, we introduce and explain the concept of compute orchestration, which allows deployment of any model on any environment, using any hardware accelerator. A unified control plane allows you to orchestrate all your AI workloads, optimizing your compute efficiency automatically based on inference.
Alfredo Ramos, Chief Product & Technology Officer, Clarifai
Wednesday, May 14: 2:45 p.m. - 3:15 p.m.
Wednesday, May 14: 3:15 p.m. - 4:00 p.m. 
 Located in Plymouth
The possibilities inherent in introducing GenAI into organizations are exciting but may not address every issue.
GenAI is an exciting and useful technology that is adding value to many enterprise applications. Compelling as it is, GenAI is not always the correct solution for analyzing unstructured data. Sometimes other forms of AI and ML are better-suited to the job. For example, GenAI is great for summarizing the findings of a collection of research documents, but non-generative AI can surface and recommend other documents related to topics of interest. Seuss describes and demonstrates how AI in all its various forms can be combined to analyze unstructured data.
David Seuss, Founder & CEO, Northern Light
Wednesday, May 14: 4:15 p.m. - 5:00 p.m. 
 Located in Plymouth
It's tempting to think that GenAI will sell itself, but making the business case for it is still required.
In the modern business landscape, AI and data strategies can no longer operate in isolation. To drive meaningful outcomes, organizations must align these critical components within a unified framework tied to overarching business objectives. Crolene explores the necessity of integrating AI and data strategies, emphasizing the importance of high-quality data, scalable architectures, and robust governance. He outlines three essential steps: recognizing that AI requires the right data to succeed, prioritizing data quality and architecture, and establishing strong governance practices. He provides specific case examples highlighting the importance of a solid foundation and strategy.
David Crolene, VP, Data Analytics & AI, EXL Service
From Smart Grids to Financial Frauds, preventing disasters is better than reconciliation after things go wrong. Combining streaming data with contextual and adaptive decision intelligence can unlock value from your data before it gets stale—and it gets stale fast, within 10s of milliseconds. Milliseconds matter.
Dheeraj Remella, Chief Product Officer, Volt Active Data
Wednesday, May 14: 5:00 p.m. - 6:00 p.m.
AI and related technologies, such as machine learning, neural networks, and text analytics, have created new and powerful opportunities for businesses. Innovative uses of language models integrated with generative AI hold enormous promise for positive change within enterprises. At the same time, ethical considerations and the widely known tendency of generative AI to fabricate information must be top of mind. The AI & Machine Learning Summit is a 2-day immersion into the possibilities inherent in an AI-driven future, offering the opportunity to harness AI & ML’s transformative potential.
Designed for chief information officers, chief data officers, data scientists, data engineers, enterprise architects, data analytics directors and managers, application developers, and tech-savvy business leaders.
Wednesday, May 14: 10:45 a.m. - 11:45 a.m. 
 Located in Duxbury
Companies collect data at a great rate, but it takes AI to enhance how users interact with and derive value from it.
Sharing insights from his journey at Crunchbase, Gautam outlines how its new AI-powered tools are designed to anticipate user needs. He is a strong advocate of the jobs-to-be-done framework, believes that AI solutions should solve real customer problems, and that staying agile via a flexible AI architecture is essential. Join Gautam as he describes Crunchbase's approach.
Megh Gautam, Chief Product Officer, Crunchbase
DevOps, along with good data, is the foundation for MLOps and building responsible, repeatable AI applications. Karam focuses on data management and compliance for fine-tuning, retrieval-augmented generation (RAG), and best practices for extending existing application logic into generative AI (without turning everything into a chatbot!).
Steve Karam, Principal Product Manager, AI, SaaS, and Growth, Perforce Delphix
Wednesday, May 14: 12:00 p.m. - 12:45 p.m. 
 Located in Duxbury
As AI technologies rapidly advance, it's crucial to prioritize responsible AI development.
Imagine a world where AI systems perpetuate biases, exacerbate inequalities, and undermine trust. Well, you can stop "imagining," because it will happen if we are not careful while developing such systems. AI technology is full of biases, unfairness, and socio-technical problems that can have unexpected results if not properly understood. Join Gupta on a journey to explore the nuances of fairness constraints that can make or break an AI system's integrity. She invites discussion on real-world scenarios of unfair AI and in-depth learning of what fairness constraints are and how to apply them using open source Python libraries.
Parul Gupta, Engineer
Wednesday, May 14: 12:45 p.m. - 2:00 p.m.
Wednesday, May 14: 2:00 p.m. - 2:45 p.m. 
 Located in Duxbury
Geek out over unstructured data, vector databases, LLMs, LangChain, and RAG.
Spann explores how Apache NiFi can be used to integrate open source LLMs to implement scalable and efficient RAG pipelines. He shows how any kind of data including semistructured, structured and unstructured data from a variety of sources and types can be processed, queried, and used to feed large language models for smart, contextually aware answers. Look for his example utilizing Cortex AI, LLAMA, Apache NiFi, Apache Iceberg, Snowflake, open source tools, libraries, and Notebooks.
Timothy Spann, Senior Solutions Engineer, Snowflake
Wednesday, May 14: 2:45 p.m. - 3:15 p.m.
Wednesday, May 14: 3:15 p.m. - 4:00 p.m. 
 Located in Duxbury
The impact generative AI I and agentic AI will have on our personal and business lives is likely to be significant.
The rise of generative and agentic AI has been spectacular, but ethical, practical, and legal issues can arise. Kumar discusses some of these issues and what is being done today to solve these. He presents a framework that can be used today for responsible AI in the Agentic and Generative age along with possible future directions.
Kshitij KK Kumar, CEO and Chief Hat, Data-Hat AI and Former CDO, Haleon (GSK Consumer Health)
Wednesday, May 14: 4:15 p.m. - 5:00 p.m. 
 Located in Duxbury
Assess the maturity of AI governance capabilities and explore the potential impact of future AI regulations.
AI governance remains one of the biggest hurdles to realizing the full potential of AI and ML, even in advanced organizations. While governance frameworks exist, they fail to connect high-level principles with the practical actions needed to manage risk effectively across the AI lifecycle. The result is stalled projects, increased risks, and missed opportunities for impact. Huinker discusses how firms can bridge the governance gap, assess their governance maturity, and build scalable capabilities that ensure trust and compliance, as well as accelerated AI adoption.
Tony Huinker, VP of Field Engineering, Domino Data Lab
Wednesday, May 14: 5:00 p.m. - 6:00 p.m.
Thursday, May 15: 8:00 a.m. - 9:00 a.m.
Thursday, May 15: 9:00 a.m. - 9:45 a.m. 
 Located in Grand Ballroom B
Transforming AI hype into business outcomes is the objective of getting your business AI-ready. Based on Welsch’s AI Leadership Handbook: A Practical Guide to Turning Technology Hype Into Business Outcomes, he draws on more than 60 interviews he conducted with AI leaders and experts to offer strategic insights into AI implementations with a nine-step approach. Gain practical knowledge on fostering innovation, driving human-AI collaboration, and leading AI initiatives. This AI leadership keynote talk covers strategy, leadership, culture, and security, equipping you with tools to boost AI literacy and achieve measurable business success.
Andreas Welsch, Founder & Chief AI Strategist, Intelligence Briefing
Thursday, May 15: 9:45 a.m. - 10:00 a.m. 
 Located in Grand Ballroom B
This presentation explores Incorta's tech stack, showcasing how it revolutionizes analytics across industries. Highlighting successful innovations with AI and machine learning, it demonstrates Incorta's role in enhancing data trust, reducing costs, and optimizing operations. Learn why Incorta is an essential guide for businesses leveraging advanced analytics and AI for strategic advantage.
Ebrahim Alareqi, Principal Machine Learning Engineer, Incorta
Thursday, May 15: 10:00 a.m. - 10:45 a.m.
Emerging Technologies and Trends in Data and Analytics takes you through the most exciting developments reshaping the industry and helping businesses close the data value gap, from the rise of data fabric solutions and edge analytics to the spread of XOps and real-time capabilities. Attend this track to dive into innovative new technologies and practices to meet growing challenges and opportunities.
Designed for chief information officers, chief data officers, digital transformation leaders, enterprise architects, data architects, data engineers, data scientists, and data management and analytics professionals.
Thursday, May 15: 10:45 a.m. - 11:30 a.m. 
 Located in Grand Ballroom B
The many real-world situations that benefit from using a robust, scalable database.
Our Crate DB speaker, in a fast-paced presentation, showcases two real-world use cases. First up is California Wildfire Detection, which uses geospatial analytics to enable instant wildfire monitoring and rapid emergency response, turning raw data into life-saving insights. Next, watch how semantic search transforms static PDFs into interactive conversations, making knowledge more accessible and actionable. Whether you're monitoring critical situations or unlocking hidden insights in unstructured data, learn about how this scalable, high-performance database makes it possible.
Shiva Pullepu, Vice President, AI and Industry Solutions, CrateDB
Thursday, May 15: 11:45 a.m. - 12:30 p.m. 
 Located in Grand Ballroom B
Advanced techniques driving modern recommender systems include graph neural networks to uncover patterns in user-time relationships.
In today’s world, recommender systems shape what we watch, buy, read, and listen to, seamlessly tailoring experiences to match our unique preferences. This talk takes you behind the curtain of these algorithms, focusing on platforms like Netflix, YouTube, Amazon, and Spotify, to uncover how these systems work, from handling billions of datapoints to making personalized recommendations in real time—and how they can become smarter, fairer, and more impactful.
Disha Lamba, Data Scientist
Thursday, May 15: 12:30 p.m. - 1:45 p.m.
Thursday, May 15: 1:45 p.m. - 2:30 p.m. 
 Located in Grand Ballroom B
A data integration hub (DIH) can simplify development of multiple front-end applications, providing auditability, simplicity, low latency, and low infrastructure cost.
Systems of record (SORs) are scattered across large enterprises, each individually fit for a specific purpose. If you want to use that data to digitally transform business, you need to access all your data to drive applications and analytics. A data integration hub (DIH) isn’t another database. It’s an architectural concept that fits in between SORs and front-end applications. Necessary data is provided at real-time speed, and long-term data is reconciled across sources and persisted dependably, regardless of source format. Come to this talk to see some real-world implementations in financial, telecom, transportation, and logistics industries of a DIH. Learn the concepts, tips, tricks, and gotchas.
Paige Roberts, Technical Evangelist
Thursday, May 15: 2:45 p.m. - 3:30 p.m. 
 Located in Grand Ballroom B
You can't be an insights-driven enterprise without good data governance.
Challenging conventional thinking about data management in the age of AI, McGrattan emphasizes that success with GenAI requires more than just large datasets—it demands a strategic approach to data quality, trust, and governance. Strong data governance ensures access control, tracks usage, and aligns with business processes. However, well-meaning data governance initiatives frequently fail. Gain practical guidance from this experienced data management professional.
Emma McGrattan, CTO, Actian
Navigating the Data and Cloud Future explores the growing array of cloud types and services being adopted by enterprises, accompanying opportunities and challenges, and how enterprises are rethinking traditional data management technologies and practices to truly unlock the value of cloud data and analytics in the real world. Attend this track to dive into key solutions and strategies to overcoming hot-button issues, from migration mistakes to licensing and FinOps to performance, security, governance, and integration tips.
Designed for chief information officers, chief data officers, digital transformation leaders, IT managers and directors, enterprise architects, data architects, cloud architects and engineers, and data management professionals.
Thursday, May 15: 10:45 a.m. - 11:30 a.m. 
 Located in Dedham
Find actionable strategies for harnessing Guidewire to build scalable, efficient, and user-friendly applications.
In today’s competitive landscape, data is abundant—but truly actionable insights often remain elusive. To stand out and become an indispensable asset to their organizations, data producers must evolve into strategic business partners. In this highly interactive session, attendees learn practical strategies for deeply understanding user and business needs, effectively bridging the gap between data creation and impactful data consumption. Gain the skills and insights necessary to deliver measurable business value, transform your role from data provider to trusted advisor, and become the irreplaceable data partner your organization needs.
Marianne Kroha, CEO & Founder, Coeur Data
Thursday, May 15: 11:45 a.m. - 12:30 p.m. 
 Located in Dedham
The rise of the proprietary cloud data warehouse helped modernize data warehousing by providing scalability, convenience, and, most importantly, flexibility and openness.
Once data became available in the cloud, it was possible to use it for more use cases, including user-facing analytics, dashboarding, observability, machine learning, and so on. This led to recurrent performance challenges, a degraded user experience, significant runaway costs, and also vendor lock-in. Steinkamp discusses the role open source technologies (open source real-time analytical databases such as Druid, Pinot, and ClickHouse) and open data lake standards (Iceberg, Hudi, Delta Lake) play in transforming the modern data stack and helping organizations move away from a monolithic cloud data warehouse.
Zoe Steinkamp, Senior Developer Advocate, Clickhouse
Thursday, May 15: 12:30 p.m. - 1:45 p.m.
Thursday, May 15: 1:45 p.m. - 2:30 p.m. 
 Located in Dedham
Data security has become a top priority as organizations increasingly migrate their operations to the cloud.
With the exponential growth of data stored and processed in cloud environments, the stakes for securing sensitive information are higher than ever. GenAI is emerging as a transformative force in cloud data security, offering innovative solutions to combat threats such as malware, ransomware, and phishing. However, this revolutionary technology comes with its own set of challenges. The dual-edged nature of GenAI in cloud data security provides unprecedented capabilities to detect and mitigate security threats through advanced pattern recognition and automated threat response but also raises concerns about data privacy, ethical usage, and its potential misuse.
Hardik Ruparel, Software Engineer-3, Nutanix and Founder, EasyReferrals
Thursday, May 15: 2:45 p.m. - 3:30 p.m. 
 Located in Dedham
Data in the cloud has become commonplace but at what cost?
Join this panel discussion as we consider the advantages and drawbacks of placing data in the cloud. Is it, in fact, the most cost-effective solution? What about privacy and confidentiality? What migration issues exist?
As the builders and keepers of the data systems and pipelines that fuel insights, data engineers are expected to wear many hats, and their role continues to grow in importance. With many organizations focused on accelerating their AI and analytics capabilities, there is an enormous demand for secure, trusted, easily accessible data. For data engineers, this means new user requirements, new workloads, and more challenges. Data infrastructure complexity, data silos, governance, and security all top the list. Still, the world of data engineering is evolving fast—from cloud data platforms and tools for ingesting, processing, integrating, and analyzing data to data catalogs, active metadata, and data observability. Attend this boot camp to dive into the latest technologies and strategies for success.
Designed for data engineers and anyone interested in data engineering.
Thursday, May 15: 10:45 a.m. - 11:30 a.m. 
 Located in Plymouth
As data engineering evolves and generative AI gains traction, exploring all possibilities is important.
Data engineering is evolving fast, and GenAI isn’t just a buzzword anymore. It is quietly reshaping how we build, maintain, and think about data systems. Everyone’s talking about ChatGPT, but what if GenAI could do more than write text? What if it could reason through your pipelines, assist with your logic, and even challenge how you build? This isn’t a future vision, it’s already happening. Let's explore what changes when you stop seeing GenAI as just a tool and start using it as a true collaborator.
Kishan Kumar Isayamudhan, Senior Software Engineer, Chime
Thursday, May 15: 11:45 a.m. - 12:30 p.m. 
 Located in Plymouth
Learn how companies can scale their data strategies, fuel advanced workloads, and centralize sensitive information without compromising trust.
Data is the lifeblood of modern enterprises, but with every petabyte collected, the stakes grow higher. Whether it’s customer, patient, or financial data, organizations are under mounting pressure to protect sensitive datasets from exposure while navigating an increasingly complex regulatory landscape. Yet many businesses still rely on outdated approaches that not only stifle innovation but also increase vulnerabilities. Van de Weil unpacks lessons learned from Fivetran’s experience working with global enterprises, sharing actionable insights on bridging cloud and on-prem environments, ensuring airtight data governance, and unleashing the full power of your data—without losing control.
Mark Van de Wiel, Field CTO, Fivetran
Thursday, May 15: 12:30 p.m. - 1:45 p.m.
Thursday, May 15: 1:45 p.m. - 2:30 p.m. 
 Located in Plymouth
AI can streamline your approach to data.
AI is not just transforming data pipelines for applications—it’s also streamlining the process of building these pipelines. AI-assisted tools can automate much of the tedious work traditionally done by data engineers. Join this session to learn about the opportunities to accelerate your data team efficiency and reliability.
Nick Nowlan, AMER Solutions Engineering Leader, Rivery
Thursday, May 15: 2:45 p.m. - 3:30 p.m. 
 Located in Plymouth
Serverless data engineering refers to designing and managing data workflows using mostly cloud computing resources based on certain events.
In a serverless paradigm, developers focus on creating and running data pipelines without managing the underlying server infrastructure. Instead, the cloud provider dynamically allocates resources and handles scaling, availability, and maintenance. Serverless data engineering enables agile, scalable, and cost-effective solutions for modern data workflows. By offloading infrastructure management to cloud providers, organizations can innovate faster and focus more on delivering insights and value from their data.
Jerry Locke, Snowflake Practice Leader, Perficient
AI and related technologies, such as machine learning, neural networks, and text analytics, have created new and powerful opportunities for businesses. Innovative uses of language models integrated with generative AI hold enormous promise for positive change within enterprises. At the same time, ethical considerations and the widely known tendency of generative AI to fabricate information must be top of mind. The AI & Machine Learning Summit is a 2-day immersion into the possibilities inherent in an AI-driven future, offering the opportunity to harness AI & ML’s transformative potential.
Designed for chief information officers, chief data officers, data scientists, data engineers, enterprise architects, data analytics directors and managers, application developers, and tech-savvy business leaders.
Thursday, May 15: 10:45 a.m. - 11:30 a.m. 
 Located in Duxbury
A semantic layer provides GenAI with a programmatic framework to make organizational context, content, and domain knowledge machine readable.
Enterprise AI’s business potential cannot be overstated: By employing standards-based semantic components such as metadata, business glossaries, taxonomy/ontology, and graph solutions, a semantic layer arms organizations with a framework to aggregate and connect siloed data and unstructured content, explicitly provide business context for data, and serve as the layer for explainable GenAI solutions. Tesfaye and Majumder present case studies explaining semantic layer technical architectures and exploring the components that enable enterprise scale data transformation efforts.
Lulit Tesfaye, Partner & VP, Enterprise Knowledge, LLC
Urmi Majumder, Principal Data Architecture Consultant, Enterprise Knowledge, LLC
Thursday, May 15: 11:45 a.m. - 12:30 p.m. 
 Located in Duxbury
LLMs can help create structure in unstructured information, increasing findability.
The goal of search is to quickly and easily find the information we need when we need it. For decades, making search work has meant using techniques such as indexing to impose structure on rapidly growing unstructured data sources. Powerful though that approach is, as internal data sources become ever larger and more diverse, traditional methods of structuring the unstructured are falling short. LLMs provide a new approach to creating structure where no structure exists, dramatically changing the way we approach search. Probstein shows how to use LLMs to sharply improve document retrieval and shares notes from a case study on commercial contracts.
Sid Probstein, CEO, SWIRL
Thursday, May 15: 12:30 p.m. - 1:45 p.m.
Thursday, May 15: 1:45 p.m. - 2:30 p.m. 
 Located in Duxbury
A quick look at building a data project.
Asnani covers every step of the process of building a customer churn prediction pipeline—from data preprocessing and feature engineering to tracking experiments, building ML pipelines, and training high-performing classification models. The entire workflow is managed within MLFlow, allowing developers to build, track, and deploy pipelines seamlessly. It uses the Streamlit interface to show predictions as a real-time visualization of churn predictions. This session offers a practical and approachable way to implement customer churn prediction for both beginners and experienced data practitioners.
Priyanka Asnani, Senior ML Engineer, Fidelity Investments
Thursday, May 15: 2:45 p.m. - 3:30 p.m. 
 Located in Duxbury
AI, robotic process automation (RPA), and machine learning (ML) can transform government operations.
With efficiency, cost, and service enhancements being demanded of the federal government, the adoption of AI, robotic process automation (RPA), and machine learning (ML) is emerging as a great shift. These technologies can foster innovation and alter the processes and roles of various government agencies. AI-driven systems offer new data analysis possibilities that allow agencies to speed up decision making. These technologies are enhancing processes which include claim processing, records management, and a range of other activities contacted by the citizens, thereby improving the speed and quality of delivery of services to citizens.
Hariharan Pappil Kothandapani, AVP, Lead Data Science & Analytics Researcher, Federal Home Loan Bank of Chicago
Thursday, May 15: 3:45 p.m. - 4:15 p.m. 
 Located in Grand Ballroom B
Discover the future of conference engagement with an innovative idea that uses AI to record, transcribe, and build an interactive model around presentation content. Experience a live demo of the AI-powered chatbot used at Data Summit, designed to foster dynamic conversations by asking follow-up questions and providing insightful answers. You can interact with the bot to explore topics, dive deeper into sessions, and learn in a whole new way. This groundbreaking approach extends the value of conversations, making knowledge accessible and engaging.
Brian Pichman, Director, Strategic Innovation, Evolve Project
Thursday, May 15: 4:15 p.m. - 5:00 p.m. 
 Located in Grand Ballroom B
Moving beyond speculation to data, this keynote presents analysis and insights from our comprehensive Q1 2025 market study spanning 200-plus organizations. We examine how companies are actually implementing modern enterprise data architectures to support analytics and AI initiatives, revealing current adoption rates, investment patterns, and expected outcomes. Building on our 2023 study's foundation, which tracked early investments in modern data architectures, we survey the evolution of data platforms by adding vector databases, knowledge graphs, and semantic layers. The session cuts through market hype to present evidence-based results and insights on which architectural patterns—from data fabric to data lakehouse—deliver measurable value and how organizations successfully balance AI innovation with enterprise data management and governance requirements.
John O'Brien, Principal Advisor & Industry Analyst, Radiant Advisors