Conference Program

Data Summit 2019 is a unique conference that brings together IT practitioners and business stakeholders from all types of organizations. Featuring workshops, panel discussions, and provocative talks attendees get a comprehensive educational experience designed to guide them through all of today’s key issues in data management and analysis. Whether your interests lie in the technical possibilities and challenges of new and emerging technologies or using Big Data for business intelligence, analytics, and other business strategies, we have something for you! Access to all tracks plus the Cognitive Computing & AI Summit, Data Lake Boot Camp, and DataOps Boot Camp is included when you register for an All-Access Pass or Full Two-Day Conference Pass. Attendees may switch between tracks as they choose. Only interested in the two-day Summit or our one-day Boot Camps? Stand-alone registration for these events is also available.

Final Program Download the Final Program [PDF]
 

Monday, May 20

Preconference Workshops

 

W1. Machine Learning Best Practices

09:00 AM2019-05-202019-05-20

Monday, May 20: 9:00 a.m. - 12:00 p.m.

Machine learning (ML) is on the rise at businesses hungry for greater automation and intelligence with use cases spreading across industries. At the same time, most projects are still in the early phases. From selecting datasets and data platforms to architecting and optimizing data pipelines, there are many success factors to keep in mind. The advantages that ML offers organizations—the ability to automatically build models that can analyze huge volumes of data and deliver lightning-fast results—have also led to a growth in the availability of both commercial and open source frameworks, libraries and toolkits for engineers. Attend this workshop for a hands-on course in the enabling technologies, techniques, and applications you need to know to succeed in today’s environments.

Speaker:

, Assistant Professor of Business Analytics, Feliciano School of Business, Montclair State University

 

W2. Data Ops 101

09:00 AM2019-05-202019-05-20

Monday, May 20: 9:00 a.m. - 12:00 p.m.

DataOps has emerged as an agile methodology to improve the speed and accuracy of analytics through new data management practices and processes, from data quality and integration to model deployment and management. By leveraging automation, data democratization, and greater collaboration among data scientists, engineers, and other technologists, DataOps can help organizations improve the time-to-value of their data. Attend this workshop to hear about the key supporting technologies, real-world strategies, and success stories and how to get started on your DataOps journey.

Speaker:

, Head of Product, Tamr

 

W3. Data Science Best Practices

01:30 PM2019-05-202019-05-20

Monday, May 20: 1:30 p.m. - 4:30 p.m.

Data science, the ability to sift through massive amounts of data to discover hidden patterns and predict future trends, may be the "sexiest" job of the 21st century, but it requires an understanding of many different elements of data analysis. Extracting actionable knowledge from all your data to make decisions and predictions requires a number of skills, from statistics and programming to data visualization and business domain expertise. Attend this workshop for a deep dive into the fundamentals of data exploration, mining, and preparation, applying the principles of statistical modeling and data visualization in real-world applications.

Speakers:

, Principal Data Consultants, Caserta

, Principal Data Consultants, Caserta

, Caserta

 

W4. Cognitive Computing 101

01:30 PM2019-05-202019-05-20

Monday, May 20: 1:30 p.m. - 4:30 p.m.

A new era of cognitive computing is unfolding, and its impact is already being felt across industries, from preventative maintenance at manufacturing plants and patient diagnosis at hospitals to the rise of sophisticated chatbots ready to assist us across the connected world. The goal of cognitive computing is straightforward: to simulate human thought processes in a computerized model. However, building cognitive systems and applications that can perform specific, humanlike tasks in an intelligent way is far from easy. Attend this workshop to get a full understanding of how cognitive computing works, popular use cases, and best practices IT leaders and practitioners can apply today.

Speakers:

, President, Synthexis and Cognitive Computing Consortium

, Executive Director, Cognitive Computing Consortium

Tuesday, May 21

Keynotes

 

Welcome & Keynote - Big Data, Technological Disruption, and the 800-Pound Gorilla in the Corner

08:45 AM2019-05-212019-05-21

Tuesday, May 21: 8:45 a.m. - 9:30 a.m.

Stonebraker focuses on the current market for Big Data products, specifically those that deal with one or more of “the 3 V’s.” On the one hand, the Volume problem for business intelligence applications is pretty well solved by data warehouse vendors. However, upcoming data science tasks are poorly supported at present. On the other hand, there is rapid technological progress, so we need to stay tuned. In the Velocity arena, recent “new SQL” and stream processing products are doing a good job, albeit with some storm clouds on the horizon. The Variety space has a collection of mature products, along with considerable innovation from startups. He identifies opportunities, particularly those enabled by possible disruption from new technology. And then there’s that 800-pound gorilla in the corner.

Speaker:

, Adjunct Professor, MIT, & Co-Founder/CTO, Tamr

 

Sponsored Keynote presented by Oracle - Information as Competitive Advantage

09:30 AM2019-05-212019-05-21

Tuesday, May 21: 9:30 a.m. - 9:45 a.m.

While organizations have dramatically more data readily available, few are leveraging this data for true competitive advantage. McKinsey found that data-centric companies are driving a 9-fold increase in customer loyalty and an almost 20-fold increase in customer profitability. As data and analytics leaders, you have the opportunity to drive organizational focus on identifying, curating, and leveraging valuable data to support better strategic decision making. Levitt shares information management frameworks and anectotes that highlight the value of thinking outside box, discusses critical success factors, and recommends specific actions to improve your organization's competitive advantage.

Speaker:

, Business Strategist, Oracle

 

Sponsored Keynote presented by Pythian Group Inc. - A Big Data Reality Check: A View from the Front Lines of Data Analytics

09:45 AM2019-05-212019-05-21

Tuesday, May 21: 9:45 a.m. - 10:00 a.m.

Lynda Partner, VP Analytics at Pythian and her team of big data and analytics professionals work to solve the toughest data challenges for their clients. As veterans in the continually-evolving big data space, her team has been helping clients break down their data silos by bringing together data from disparate sources, and enabling use cases from BI to ML. When it comes to big data they’ve seen it all. In this session, you will hear how data professionals like you are dealing with the most challenging issues with data and keeping up with innovations. Learn through real-world examples how you too can be ready to address tomorrow’s opportunities with ever-advancing cloud analytics technologies and emerging practices, — in other words, what’s coming and how can you maneuver today to take advantage.

Speaker:

, VP, Products and Offerings, The Pythian Group

 

Tuesday, May 21

Track A: Moving to a Modern Data Architecture

Moderator:
John O'Brien, Principal Advisor & Industry Analyst, Radiant Advisors
 

A101. Building a Modern Data Architecture

10:45 AM2019-05-212019-05-21

Tuesday, May 21: 10:45 a.m. - 11:45 a.m.

There is a conflict going in the IT industry pitting Big Data against “legacy” data architectures. However, traditional architecture and technologies and newer Big Data approaches each offer advantages.

Designing a Data Architecture for Modern Business Intelligence & Analytics

10:45 a.m. - 11:45 a.m.

Find out what you need to know regarding structured, unstructured, and semi-structured data; hybrid integration and data engineering; and different analytical uses. Learn about the technologies to use for different projects, including relational, columnar, and in-memory. Topics include the must-have underlying foundational concepts for every project; high-level architectural design to pull it all together; the best and pragmatic practices to ensure success; how to avoid deadly data and integration silos; and how to prevent data swamps, data shadow systems, and spreadmarts.

Speaker:

, Managing Partner, Athena IT Solutions

Enabling the Real-Time Enterprise—Data Lakes, Streaming, & the Cloud

10:45 a.m. - 11:45 a.m.

To support a modern data architecture and approach to analytics, data integration strategies now support on-prem, cloud, and hybrid deployments. Meanwhile, streaming architectures featuring change data capture (CDC) technology are rapidly being embraced to process data in motion. This session discusses the new requirements and best practices to be successful in enabling a real-time enterprise, whether in a data lake, via streaming technology, or in the cloud.

Speaker:

, Senior Director of Marketing, Attunity, Inc.

 

A102. The New World of Database Technologies

12:00 PM2019-05-212019-05-21

Tuesday, May 21: 12:00 p.m. - 12:45 p.m.

There are game-changing technologies emerging in data management. But to win at the new world of Big Data, you have to know the changing rules.

Database Trends 2020

12:00 p.m. - 12:45 p.m.

The world of data management and administration is rapidly changing as organizations digitally transform. Mullins examines how database management systems are changing and adapting to modern IT needs. Understanding the trends occurring now and on the horizon is critical to being prepared for the rapidly changing data landscape. This presentation looks at cloud, analytics, NoSQL, IoT, in-memory, and DevOps and examines what is happening with DBAs and their roles within modern organizations. Mullins backs up the trends with references and links where appropriate.

Speaker:

, President & Principal Consultant, Mullins Consulting, Inc. and IBM Gold Consultant

 

A103. Understanding Cloud Licensing

02:00 PM2019-05-212019-05-21

Tuesday, May 21: 2:00 p.m. - 2:45 p.m.

It was hard enough to manage IT infrastructures when everything was on-premise only. But today, with combined on-premise deployments, SaaS, and hybrid cloud scenarios, there is uncertainty about the proper way to license software in these very complex environments.

Straight Talk on the Cloud License Landscape

2:00 p.m. - 2:45 p.m.

Keeping software in compliance is a more significant challenge today than ever before. Sorting through all the FUD (fear, uncertainty, doubt) and getting straight answers from the vendors on the proper way to license software in this complicated world is nearly impossible. Making matters worse is the fact that many software vendors have turned to software license audits as an easy way to generate additional revenues. This session covers current software licensing trends, important lessons learned from the real world, and the steps every organization should take now to avoid becoming a victim of a software license audit whose real purpose is to generate revenue.

Speakers:

, Co-Founder, LicenseFortress

, Product Line Manager, Business Critical Applications, Broadcom (VMware)

 

A104. Overcoming Big Data Integration Challenges

03:15 PM2019-05-212019-05-21

Tuesday, May 21: 3:15 p.m. - 4:00 p.m.

Data is flowing into organizations from a previously unimaginable array of sources and at unprecedented speed and volume. This means that the challenges of cleaning, deduplicating, and integrating data are increasing.

Dismantling Data Silos Through Cloud Integration

3:15 p.m. - 4:00 p.m.

A cloud-native data platform may be the best way for organizations to cost-effectively deliver on the promise of better insights and more intelligent systems through data. Danil Zburivsky covers how a cloud integration approach can lead to better data governance and more accurate analysis and ensure consistency of data across systems, as well as the best practices for cloud data integration and how a cloud data platform breaks down data silos within the organization. The presentation also looks at how one client successfully took its global sales data to the cloud to uncover new opportunities.

Speaker:

, Director of Engineering, Kick Analytics-as-a-Service, The Pythian Group

Diving Under the Hood of Actian Avalanche, a Gen III Cloud Data Warehouse

3:15 p.m. - 4:00 p.m.

From the perspective of an experienced engineering thought leader, Paul Wolmering, VP Worldwide Sales Engineering, Actian Corporation, will deliver a deep dive into Actian’s newly launched Gen III cloud data warehouse. Learn about key considerations for building a fully managed, multi-cloud data warehouse with federated query capabilities, that’s built for hybrid data. Understand key success factors for migrating to columnar analytics to gain actionable insights from an operational data warehouse. Learn what it takes to deliver insights from real-time data economically and at scale with hybrid data regardless of location, in the cloud, on-premises or both.

Speaker:

, VP Worldwide Sales Engineering, Actian Corporation

 

A105. Securing the Internet of Things

04:15 PM2019-05-212019-05-21

Tuesday, May 21: 4:15 p.m. - 5:00 p.m.

There are significant benefits offered by IoT, but also new threats and dangers. Do we really understand the challenges posed by all these connected “things”?

The Dark Side of the Internet of Things

4:15 p.m. - 5:00 p.m.

With the Internet of Things (IoT), essentially everything becomes a computer. This means that everything can be hacked—including cars, home appliances, medical devices, and more. This presentation provides examples of IoT hacks and the consequences of not getting security right as we move forward in the world of smart and connected machines.

Speaker:

, Distinguished Engineer, IT Security Architect, IBM

 

Tuesday, May 21

Track B: Competing on Analytics

Moderator:
Lindy Ryan, Professor & Research Faculty, Montclair State University; Rutgers University
 

B101. Taking Your Analytics to the Next Level

10:45 AM2019-05-212019-05-21

Tuesday, May 21: 10:45 a.m. - 11:45 a.m.

AI and Big Data offer seemingly unlimited potential for organizations to better understand their customers, make more informed decisions, and address challenges with greater agility. It’s important to understand the choices available to achieve the best outcomes.

Applied Analytics: From BI to AI

10:45 a.m. - 11:45 a.m.

The intersection of AI and Big Data provides the ability to deliver more targeted, timely, relevant insight in a pervasive and intuitive manner. However, delivering that simplicity requires an analytics and data ecosystem that is markedly more complicated than 10 years ago. To that end, effectively deploying analytics from BI to AI is a now an exercise in portfolio management—complete with discrete customer segments, diverse data environments, development methods, and a wide spectrum of deployment options. This session puts the diverse—and growing—landscape of analytics capabilities from BI to AI into context.

Speaker:

, Strategic Advisor, SAS

How to Build Data Science Teams that Deliver Business Value

10:45 a.m. - 11:45 a.m.

In spite of the buzz around AI, organizations are struggling to build data science teams that deliver value on the ground. This talk presents the three distinctive phases of growth for data science teams, highlighting potential challenges and suggesting a standard framework of guidelines to successfully navigate this evolution. Vastly different approaches are needed in each stage of maturity to tackle aspects such as strategic direction, project framework, the mix of skills, hiring strategies, and fostering of a data culture.

Speaker:

, Co-Founder & Head of Analytics, Gramener Inc.

 

B102. Data Science Best Practices

12:00 PM2019-05-212019-05-21

Tuesday, May 21: 12:00 p.m. - 12:45 p.m.

Emerging technologies such as AI, IoT, and machine learning are changing what is knowable about customers. At the same time, the frequency of data misuse is leading government entities and individuals to demand higher standards of accountability.

Ethics, Data Ownership, & Privacy in Data Science

12:00 p.m. - 12:45 p.m.

This presentation explores the issues around modernizing security and governance, as well as what it means to deliver transparency and what users actually expect. It also covers the need to manage accountability within systems of multiple decision-makers; why it is necessary to build fairness into the system to overcome bias, discrimination, and enable diversity; and the need to address expectations of privacy and appropriate use of data.

Speaker:

, Business Solutions Manager, SAS Best Practices, SAS Institute

Accelerating Analytics in a New Era of Data

12:00 p.m. - 12:45 p.m.

Due to exponentially growing data stores, organizations today are facing slowdowns and bottlenecks at peak processing times, with queries taking hours or days. Some complex queries simply cannot be executed. Data often requires tedious and time-consuming preparation before queries can be run. This session will demonstrate how the power of GPUs can help conquer these challenges, enabling data professionals to rapidly analyze more data on more dimensions, for previously unobtainable business insights.

Speaker:

, CMO, SQream

 

B103. Analytics in Action

02:00 PM2019-05-212019-05-21

Tuesday, May 21: 2:00 p.m. - 2:45 p.m.

Organizations in all industries are under pressure to take advantage of Big Data and newer data sources for real-time decision making in mission-critical environments. New technologies provide opportunities to gain insight into the future.

Fannie Mae’s Journey to a Data-Driven Organization

2:00 p.m. - 2:45 p.m.

How does an organization evolve from an application-centric to a data-driven enterprise? This presentation covers how Fannie Mae embarked on a major transformation journey to modernize its data infrastructure, transitioning from legacy data platforms to more integrated and scalable architecture to capitalize on the growing opportunities of the analytics economy and generate substantial business value, internally and externally.

Speaker:

, Director, Development, Fannie Mae

Riding the Waves of Big Data Disruption: Machine Learning, Cloud Analytics, IoT, and More

2:00 p.m. - 2:45 p.m.

As Big Data grows and evolves, your enterprise faces both challenges and market-disrupting opportunities to analyze and manage larger data volumes for business value. But with seemingly endless commerical, open source, and "as-a-service" offerings hitting the market each week. How do you choose the right mix of technologies and avoid creating an accidental architecture that will limit you from future innovaation? How are organizations actually achieving true bottom-line benefits from their Big Data initiatives? Learn how to adopt an effective and agile approach to Big Data analytics.

Speaker:

, Open Source Relations Manager, Vertica

 

B104. Delivering Trusted Data

03:15 PM2019-05-212019-05-21

Tuesday, May 21: 3:15 p.m. - 4:00 p.m.

With the vast quantities of data flowing into organizations, the job of cleansing and validating data is only becoming more difficult. In order to gain the kind of insights and outcomes that organizations seek, new processes and technologies must be deployed.

Flipping the 80/20 Rule of Data Prep and Analysis

3:15 p.m. - 4:00 p.m.

The (IRS) Compliance Data Warehouse (CDW) is an analytical data warehouse used for research purposes. It empowers researchers to spend more time on analytics and less on data wrangling. To ensure all data is loaded properly, consistent, well-thought-out validation steps must be included in the ETL process. This presentation offers a case study of accomplishments and lessons learned (since FY 2016), including the data quality issues identified by CDW users (data stewards), and takeaways for attendees on how to improve decision making.

Speaker:

, Senior Operations Research Analyst, IRS-RAAS (Research, Applied Analytics, and Statistics)

 

B105. Everyday Chaos

04:15 PM2019-05-212019-05-21

Tuesday, May 21: 4:15 p.m. - 5:00 p.m.

Best-selling author David Weinberger previews his new book on everyday chaos.

How Machine Learning Is Changing the Future as a Fact and as an Idea

4:15 p.m. - 5:00 p.m.

Ultimately, machine learning’s most important effect may not be in the benefits its use brings, but how it is implicitly transforming our understanding of how the world works and our most basic strategies for dealing with the future. From Newton on through the Computer Age, we have assumed that the universe is ruled by a relative handful of laws that are the same everywhere and that are simple enough for us to understand. But machine learning shows us a world of motes of data in networks so dense with connections and so delicately balanced, we sometimes can’t understand them. This sort of model of the world is changing not only our strategies, but our moral sense, our ideas about meaning, and even what makes humans special.

Speakers:

, Harvard's Berkman Klein Center for Internet & Society and Author, Everyday Chaos, Everything is Miscellaneous, Too Big to Know, Cluetrain Manifesto (co-author)

, Co-founder, Cognitive Computing Consortium

 

Tuesday, May 21

Track C: Data Lake Boot Camp

Moderator:
Julie Langenkamp, Director, Editorial & Content Strategy, Radiant Advisors
 

C101. Building a Data Lake for the Enterprise

10:45 AM2019-05-212019-05-21

Tuesday, May 21: 10:45 a.m. - 11:45 a.m.

A new data platform approach is needed to extend the data warehouse and address the vast quantity and variety of data flowing into organizations, much of it unstructured.

The Data Warehouse Is Dead

10:45 a.m. - 11:45 a.m.

The data warehouse is experiencing pressure from increasing data volumes, more users, and tight budgets—a triple threat to its ongoing existence and value. In addition, new data types are coming into play. This increased pressure means the old-school data warehouse may not be delivering insights at the speed of business. There are a number of alternatives to meet modern analytics infrastructure needs. This presentation outlines in detail why a modern data platform is required to deliver on new analytics demands.

Speaker:

, VP, Products and Offerings, The Pythian Group

Exploiting Enterprise Data for Transformational Projects

10:45 a.m. - 11:45 a.m.

In a data fabric, the data discovery and integration layer maps all enterprise data in its original business context so that users can find and blend data from diverse siloed sources into analytic-ready data sets on an on-demand basis. Join Cambridge Semantics CTO Sean Martin to hear how companies are using data discovery and integration solutions to exploit enterprise data fortransformational analytic and machine learning projects.

Speaker:

, CTO, Cambridge Semantics

 

C102. Data Discovery in Data Lakes

12:00 PM2019-05-212019-05-21

Tuesday, May 21: 12:00 p.m. - 12:45 p.m.

With the abundance of data stored in data lakes, finding the relevant information is increasingly challenging, particularly in light of the many formats in which the data apears.

Data Discovery, Selection, and Provisioning

12:00 p.m. - 12:45 p.m.

With the realization of the power of data lakes, more and more organizational data in various formats and standards are being made available there. Given this plethora of information, it is becoming increasingly daunting for users to search for the data of interest to them with the use of conventional data analytical tools. A combination of Data Discovery tools, making use of semantic search and concept search, brings in the right blend of capability, enabling "comparison shopping" between seemingly similar datasets, and allowing end users to evaluate the best fit while facilitating the discovery and reuse of all available information and data assets, both internal and external.

Speaker:

, Associate Director-Digital Capability Management, R&D Data Lakes and Integrations, Bristol-Myers Squibb

 

C103. PANEL: Data Lakes: Challenges and Opportunities

02:00 PM2019-05-212019-05-21

Tuesday, May 21: 2:00 p.m. - 2:45 p.m.

As part of our deep dive into data lakes, our panel of experts contemplates success factors, failure avoidance, and new developments. Join us for an invigorating discussion.

Speakers:

, Managing Partner, Athena IT Solutions

, CTO, Cambridge Semantics

 

C104. Data Lakes in Action

03:15 PM2019-05-212019-05-21

Tuesday, May 21: 3:15 p.m. - 4:00 p.m.

Data lakes are highly appealing as they provide the capacity to support all types of data and maintain it in its original format for future purposes. Before diving in, it’s important to be aware of the components of a successful data lake implementation.

Uber’s Hadoop Data Ingestion and Dispersal Framework

3:15 p.m. - 4:00 p.m.

Marmaray, Uber’s general-purpose Apache Hadoop data ingestion and dispersal framework and library, was open-sourced in 2018. Marmaray was envisioned, designed, and ultimately released in late 2017 to fulfill the need for a flexible, universal dispersal platform that would complete the Hadoop ecosystem by providing the means to transfer Hadoop data out to any online data store. Before Marmaray, each team was building its own ad hoc dispersal systems, which resulted in duplicated efforts and an inefficient use of engineering resources.

Speaker:

, Software Engineer, Uber

Building a Data Lake in Two Weeks Without Writing Code

3:15 p.m. - 4:00 p.m.

Rafael shares how an ad-tech company moved from a DW to a fully functional data lake that processes over 400,000 events per second without writing one line of code.

Speaker:

, CEO & Co-Founder, UpSolver

 

C105. Frameworks for the Future

04:15 PM2019-05-212019-05-21

Tuesday, May 21: 4:15 p.m. - 5:00 p.m.

New frameworks and platforms are enabling organizations to improve time-consuming processes and meet enterprise requirements for high performance.

Tackling Data Ingestion Challenges at LinkedIn With Apache Gobblin

4:15 p.m. - 5:00 p.m.

Apache Gobblin is a distributed data integration framework for both streaming and batch data ecosystems. This presentation covers how Gobblin powers several data processing pipelines at LinkedIn and use cases such as ingestion of more than 300 billion events for thousands of Kafka topics on a daily basis, metadata and storage management for several petabytes of data on HDFS, and near real-time processing of thousands of enterprise customer jobs. It also looks at the key Gobblin features that help LinkedIn build and run these data pipelines at extreme scale.

Speaker:

, SR Site Reliability Engineer, LinkedIn

 

Tuesday, May 21

Track CS: Cognitive Computing & AI Summit

 

CS101. The Rise of Artificial Intelligence

10:45 AM2019-05-212019-05-21

Tuesday, May 21: 10:45 a.m. - 11:45 a.m.

Artificial intelligence (AI) had the potential to completely revolutionize how we do business and increasingly affects people’s daily lives.

No AI Without IA

10:45 a.m. - 11:45 a.m.

Depending on who you talk to, AI will either enable massive productivity gains from your employees or replace them entirely. Hype aside, AI is coming, and companies need to understand how to harness it. Despite the promise of “plug and play” technology, real AI requires varying degrees of information architecture (IA), knowledge engineering, product and content architecture, and high-quality data sources to be effective. 

Speaker:

, CEO, Earley Information Science and Author, The AI Powered Enterprise

 

CS102. Machine Learning in the Real World

12:00 PM2019-05-212019-05-21

Tuesday, May 21: 12:00 p.m. - 12:45 p.m.

The power of machine learning is particularly evident when used to predict events in the real world.

Machine Learning in the Real World

12:00 p.m. - 12:45 p.m.

Machine learning (ML) has become top of mind for many businesses. Wilde shares his experience and insights about ML in the real world.

 

CS103. AI in Action

02:00 PM2019-05-212019-05-21

Tuesday, May 21: 2:00 p.m. - 2:45 p.m.

The customer experience (CX) is prime territory for employing AI technologies.

AI-Driven CX

2:00 p.m. - 2:45 p.m.

Let’s take a look at enhanced customer experiences for hospitality consumers delivered via virtual concierge services that provide resort guests with personal and contextual hospitality services. Built upon mobile use and location data, bot services help customers with a variety of services, ranging from in-room amenities and dinner reservations to event tickets. Don Spaulding provides an overview of the technical solution and powerful results of this virtualized solution.

Speaker:

, Business Principal, Experience Innovation, Verizon

 

CS104. AI Success Factors

03:15 PM2019-05-212019-05-21

Tuesday, May 21: 3:15 p.m. - 4:00 p.m.

The combination of Big Data with AI technologies creates both challenges and opportunities. Identifying the factors that turn challenges to opportunities leads to success.

AI On the Edge

3:15 p.m. - 4:00 p.m.

With edge computing becoming a thing, AI on-the-edge is quickly following suit. It unlocks a whole new world of possibilities, including predicting customer needs before they even know them. But edge AI seems like it’s only a game for the most cutting-edge companies like Apple, Amazon, or Tesla, to name a few. Traditional enterprises aren’t really embracing it out of fear it may cost too much or due to uncertainty about the potential ROI. To tap into this opportunity, organizations don’t need to choose a risky “all in” approach; a small iterative approach reduces the risk while ensuring your edge AI projects aligns with your overall business strategy. Join this session to learn how to apply a Minimal Viable Prediction (MVP) approach to your next edge AI project.

Speakers:

, Chairman, EastBanc Technologies

, Data Scientist, PhD, EastBanc Technologies

 

CS105. Exploring Machine Learning

04:15 PM2019-05-212019-05-21

Tuesday, May 21: 4:15 p.m. - 5:00 p.m.

Everyone’s talking about machine learning, but we hear much less about how to put it into practice.

Exploring Machine Learning on the Google Cloud Platform

4:15 p.m. - 5:00 p.m.

Only 10 years ago, you needed access to extensive academic and computing resources to make use of machine learning (ML). Fast-forward to today, and we’ve seen revolutionary changes in the hardware and software that are making ML accessible for any developer or data scientist. Whether you’re completely new to ML or you’ve already trained and deployed your own model from scratch, Google Cloud Platform has a variety of tools to help you start using ML right now. Sara Robinson starts with the basics: how to use a pre-trained ML model with one REST API call. Then she explains how to use your own dataset to customize a pre-trained model with transfer learning, how to build your own model from scratch with TensorFlow, and how to train and serve it in the cloud with GCP.

Speaker:

, Developer Advocate, Google

Tuesday, May 21

Evening Events

 

Networking Reception in the Data Solutions Showcase

05:00 PM2019-05-212019-05-21

Tuesday, May 21: 5:00 p.m. - 6:00 p.m.

Wednesday, May 22

Keynotes

 

Data & Donuts Presentation Sponsored by Actian Corporation

08:00 AM2019-05-222019-05-22

Wednesday, May 22: 8:00 a.m. - 8:45 a.m.

Learn about next-generation cloud data warehousing and what it takes to deliver insights from real-time data economically and at scale with hybrid data regardless of location, in the cloud, on-premises or both, while you enjoy Boston's finest deep fried pastries!

Speaker:

, VP Worldwide Sales Engineering, Actian Corporation

 

Opening Keynote - Digital Transformation Is Business Transformation: How to Incorporate AI Technology Into a 130-Year-Old Company

09:00 AM2019-05-222019-05-22

Wednesday, May 22: 9:00 a.m. - 9:45 a.m.

Elsevier is a 130 year old company with a long legacy of providing quality content that serves professionals in their fields. Its digital transforming is aimed at transforming that content into actionable intelligence for users. However, digital transformation is not just a matter of updating legacy systems and workflows, it also requires the capacity to transform the entire business; from pricing models to company culture. In this talk, Deus explores Elsevier’s journey to transform its business so it is able to incorporate advanced technologies like machine learning and AI technology to better serve customers.

Speaker:

, Technology Research Director, Elsevier, Inc.

 

Sponsored Keynote presented by Gemini Data - The Evolution of Big Data Analytics

09:45 AM2019-05-222019-05-22

Wednesday, May 22: 9:45 a.m. - 10:00 a.m.

We find ourselves continuously copying, transforming, and aggregating data into various large scale, complex, proprietary systems for the purpose of Digital Transformation in order to gain some competitive edge. Unfortunately, this process ends up exacerbating the problem by creating more silos, complexity and making the process of extracting knowledge more difficult overall. Deyette reviews where we are, how we got here, and what must come next with regard to leveraging Big Data Analytics.

Speaker:

, Chief Customer Officer, Gemini Data, Inc.

 

Wednesday, May 22

Track A: Building the Data-Driven Future

Moderator:
John O'Brien, Principal Advisor & Industry Analyst, Radiant Advisors
 

A201. Winning With a Modern Data Strategy

10:45 AM2019-05-222019-05-22

Wednesday, May 22: 10:45 a.m. - 11:30 a.m.

Today, there are more opportunities than ever for organizations to achieve new levels of success by capitalizing on data and analytics. Despite this fact, many organizations struggle to build, grow, and reinforce their analytics capabilities.

ADAPT+C: A Framework for Building a Data-Driven Organization

10:45 a.m. - 11:30 a.m.

Understanding that no two organizations are the same, and there are no “one-size-fits-all” solutions, Booz Allen uses a framework known as ADAPT+C to address an organization’s most critical analytics problems with a “best-fit” approach. This presentation covers the six components to ADAPT+C and how this framework can be applied to organizations of all types and sizes to help guide them in their data-driven journeys.

Speaker:

, Director, Booz Allen Hamilton

Quest's Solutions for Database Management and Replication

10:45 a.m. - 11:30 a.m.

The role of a Database Administrator has changed significantly over the past few years. The continued adoption of open source databases, cloud and/or hybrid architectures, and non-relational data sources have added significant complexity to the modern data center, all while organizations strive to become more agile through DevOps. Hall focuses on these trends, how they've impacted DBAs, and best of breed solutions being used by enterprises today.

Speaker:

, Senior Solutions Architect, Quest Software

 

A202. Supporting Modern Applications

11:45 AM2019-05-222019-05-22

Wednesday, May 22: 11:45 a.m. - 12:30 p.m.

Modern data architectures address the need of business agility required in the modern age. However, moving from a traditional data architecture to a modern architecture has many challenges.

Modern Data Platforms & Application Architecture

11:45 a.m. - 12:30 p.m.

This presentation helps to define a modern data architecture and takes an in-depth look at how to build an end-to-end modern data platform from the plethora of choices and tools available, including when and why to use cloud implementations. It also focuses on application architecture and discusses some real-world examples of click stream analysis, customer 360, fraud detection, and scoring, in addition to data platform optimizations for business applications.

Powering the Data-Driven Enterprisse

11:45 a.m. - 12:30 p.m.

We continue to hear that "data is the new oil." How are data driven enterprises capturing, managing, and using this data as a competitive advantage? What challenges do they face with legacy technologies that have not been optimized to handle these data management activities? Learn about these challenges, what the optimal solution would look like, the benefits a true data management platform can provide, and how Actifio can help.

Speaker:

, Director of Product Marketing, Actifio

 

A203. Designing for Speed & Scalability

02:00 PM2019-05-222019-05-22

Wednesday, May 22: 2:00 p.m. - 2:45 p.m.

There are new technologies that contribute to speed and scale of a modern data platform. But as data size and complexity increase with Big Data, data quality and data integration issues must still be addressed.

Designing a Fast, Scalable Data Platform

2:00 p.m. - 2:45 p.m.

Similar to many other mission-critical data management situations, clinical trials are fraught with missteps and data quality issues. This presentation focuses on the architecture of a modern, cloud-based, real-time data integration and analytics platform that ingests any type of clinical data (structured, unstructured, binary, lab values, etc.) at scale from any data sources. Attention is paid to assembling the architectural building blocks of a modern data platform, including the benefits of a “serverless” data pipeline, cloud architecture and deployment, microservices, continuous integration and deployment pipeline, platform scalability, data governance, and master data management.

Speaker:

, VP & Global Chief Data Officer, ERT

 

A204. The Rise of Knowledge Graphs

03:00 PM2019-05-222019-05-22

Wednesday, May 22: 3:00 p.m. - 3:45 p.m.

The ability for knowledge graphs to amass information and relationships and connect facts is showing potential for a range of use cases.

From Structured Text to Knowledge Graphs: Creating RDF Triples From Published Scholarly Data

3:00 p.m. - 3:45 p.m.

For publishers, content is the most important data asset. However, journal articles (and other content) are expressed in formats ideally suited for transmittal and display—not as data for analysis. By leveraging the structure inherent in scholarly published XML content, we can create structured data in RDF for analysis. Further, we can link out to external resources as linked data to further enrich the content-as-data assets and provide a rich analytical environment.

Speaker:

, Information Architect, Factor

 

Wednesday, May 22

Track B: Digital Transformation

Moderator:
Lindy Ryan, Professor & Research Faculty, Montclair State University; Rutgers University
 

B201. Achieving a 360-Degree Customer View

10:45 AM2019-05-222019-05-22

Wednesday, May 22: 10:45 a.m. - 11:30 a.m.

The holy grail of marketers is to attain a 360-degree view of customers in order to increase brand loyalty and understand preferences and purchases for a more personalized approach. Technologies are available to help make that goal a reality.

User Behavior Data Analysis in R & Python

10:45 a.m. - 11:30 a.m.

Brand and marketing teams often have questions about how to segment and classify users based on various attributes. This is used to identify cohorts and consequently run campaigns with strategic plans. This presentation shows how a data analyst can collect relative data and give sense to analysis output.

Speaker:

, Data Analyst/Scientist, Square Enix

CloverDX: Enabling Robust Integration Strategies

10:45 a.m. - 11:30 a.m.

If you take a step back, a 360-degree view begins with a successful integration strategy--a really good one! In this talk, Benedetti shares two different stories, one in B2B and another in B2C, on how CloverDX helped enable customers to achieve a 360-degree view without too much of an investment and left them empowered to bring in more data as they grow in the future.

Speaker:

, Global Solutions Director, CloverDx

 

B202. Digital Transformation in the Real World

11:45 AM2019-05-222019-05-22

Wednesday, May 22: 11:45 a.m. - 12:30 p.m.

The demand to become a data-driven business with a competitive edge in the rapidly changing economy is greater now than ever. With the constant explosion of data within the organizations and across the industry, companies are increasingly moving toward leveraging data for business decisions and analytics.

Chasing Real-time Decision-Making with No-SQL at Scale

11:45 a.m. - 12:30 p.m.

Wayfair, one of the world's leading home furnishing platforms, has undergone immense and rapid growth, analyzing data all along the way. The successful creation of a retail holiday pushed its systems to the limit. "Way Day," as it's affectionately known, was full of highs and lows, and showed the team it needed to transform not only to the cloud but also to a high performance, low latency database. Wayfair lives and breathes on data-driven decision making that impacts the entire customer experience. Data science has been the core driver of Wayfair's success. The Aerospke NoSQL database enables a highly scalable, fault tolerant, and performance driven environment for customer intelligence, product recommendations, and real-time marketing.

Speaker:

, NoSQL Data Architect, Wayfair

 

B203. Tapping Into New Data Sources for Business Value

02:00 PM2019-05-222019-05-22

Wednesday, May 22: 2:00 p.m. - 2:45 p.m.

With the rise of Big Data, IoT, and AI, useful sources of data are emerging and new opportunities are being created.

Mind the Gap: How Location Data Connects Consumers’ Online & Offline Journeys

2:00 p.m. - 2:45 p.m.

Location data is critical to closing the gap between the online and offline world. By leveraging location data, marketers can create impactful moments to influence the behavior of the consumer. This session reveals how location data is the cookie for the real world and how it empowers marketers to run unique campaigns to drive real results in a highly measured way.

Speaker:

, EVP, Strategic Partnerships, GasBuddy

 

B204. Emerging Applications for Blockchain

03:00 PM2019-05-222019-05-22

Wednesday, May 22: 3:00 p.m. - 3:45 p.m.

Blockchain, the distributed ledger technology, is expected to impact a diverse range of industries, from agriculture to accounting to healthcare. From guaranteeing the authenticity of products to safeguarding transactions, blockchain holds wide-ranging potential.

Blockchain-Based Anti-Counterfeit & Identity Solutions

3:00 p.m. - 3:45 p.m.

Blockchain technology has emerged as an innovative and easy-to-adopt approach to improve anti-counterfeit measures in different industries and delivers a significant positive social impact. It offers a transparent environment where it is impossible to duplicate products and there is no need to rely on trust alone. This presentation shows how to use a blockchain with smart contracts to track products at every step of the production and sales process and make this information available to anyone.

Speaker:

, Principal Consultant, Infosys

 

Wednesday, May 22

Track C: DataOps Boot Camp

Moderator:
Julie Langenkamp, Director, Editorial & Content Strategy, Radiant Advisors
 

C201. Succeeding With DataOps Today

10:45 AM2019-05-222019-05-22

Wednesday, May 22: 10:45 a.m. - 11:30 a.m.

DataOps is an emerging set of practices, processes, and technologies for building and enhancing data and analytics pipelines to better meet the needs of the business.

Five Key Requirements for DataOps Success

10:45 a.m. - 11:30 a.m.

Join this presentation to explore the five key steps necessary to be successful with DataOps, including the process and cultural shift required. The discussion also covers the benefits of enabling DataOps’ success, such as improved productivity, streamlined and automated processes, increased output, and higher collaboration across teams. Learn how to better manage data flow across the data lifecycle—from ingestion to provisioning to analytics; derive tips from use cases involving data lakes, cloud, and data warehousing for better business insights; and increase collaboration, productivity, and business value.

Speaker:

, Senior Director of Marketing, Attunity, Inc.

How to Succeed with DataOps Today

10:45 a.m. - 11:30 a.m.

The list of failed Big Data projects is long. They leave end users, data analysts, and data scientists frustrated with long lead times for changes. Bergh illustrates how to make changes to Big Data, models, and visualizations quickly, with high quality, using the tools teams love. Synthesizing techniques from DevOps, Demming, and direct experience shows you how to succeed with DataOps today.

Speaker:

, CEO, Head Chef, DataKitchen

 

C202. The Rise of Containers

11:45 AM2019-05-222019-05-22

Wednesday, May 22: 11:45 a.m. - 12:30 p.m.

The use of containers—which enable applications, data, dependencies, and runtimes to be housed within a portable environment to support greater flexibility—is becoming more widespread.

Understanding Database Containerization

11:45 a.m. - 12:30 p.m.

Container usage is now being adopted by organizations of all sizes, from small startups to companies with huge, established microservices platforms. This presentation is aimed at helping practitioners navigate the minefield of database containerization and avoid some of the major pitfalls that can occur. It covers considerations such as container configuration and homogeneous versus heterogeneous node types; data resilience, resources, and storage; cluster upscale, downscale, and upgrade; and data locality and networking.

Speakers:

, Director, Platform Strategy & Innovation, InterSystems

, Product Specialist, InterSystems

 

C203. Operationalizing Big Data Workloads

02:00 PM2019-05-222019-05-22

Wednesday, May 22: 2:00 p.m. - 2:45 p.m.

Transformational Analytic Outcomes Through DataOps

2:00 p.m. - 2:45 p.m.

We’re at a unique point in time when fundamental changes in data management in the enterprise, the emergence of new technologies, and the volume and value of enterprise data combine into a massive opportunity to create next-generation data engineering pathways. Marinelli highlights the converging factors that affect why DataOps is gaining traction in the marketplace today, how it’s different from earlier generations of data curation, and what the key components are to implementing DataOps.

Speaker:

, Head of Product, Tamr

 

C204. Unlocking the Power of Data Wrangling

03:00 PM2019-05-222019-05-22

Wednesday, May 22: 3:00 p.m. - 3:45 p.m.

Data wrangling skills and capabilities are critical and a true business imperative. Organizations that are unable to derive true business insights from data will be left behind in the ever-changing and fast-moving digital economy.

AI for Good: AI, Bad Guys, & Messy Data

3:00 p.m. - 3:45 p.m.

All the information that is needed to find and stop bad actors from entering our financial system already exists and is available to you today; it’s just buried in terabits of messy, unstructured data all over the internet. For those performing investigations and evaluating risk, this needle in a haystack-of-needles problem is huge and growing: Unstructured data already dominates the web (growing exponentially year over year), and the traditional technology these departments use cannot keep up. Recent developments in natural language processing technology (NLP), the field of AI that focuses on human language, have, for the first time, made it possible for automated systems to find and deliver identity-relevant intelligence hidden in unstructured textual data. These innovations unlock a new world of actionable insight, providing much-needed ammunition in the fight against fraud, money-laundering, financial crime, and terrorism.

Speaker:

, VP, Engineering, Basis Technology

 

Wednesday, May 22

Track CS: Cognitive Computing & AI Summit

 

CS201. Machine Learning Best Practices

10:45 AM2019-05-222019-05-22

Wednesday, May 22: 10:45 a.m. - 11:30 a.m.

There are many ways to implement ML projects, but not all of them work out as planned.

How to Implement a Machine Learning Algorithm in Lean Production

10:45 a.m. - 11:30 a.m.

EnCata’s work is organized according to lean principles, which mean reducing delivery time and eliminating time and money losses, connected with knowledge transfer and materials logistics. We used machine learning to create a neural network as a means of monitoring the working tool in production.

Speaker:

, CEO, EnCata

 

CS202. Diving Into Deep Learning

11:45 AM2019-05-222019-05-22

Wednesday, May 22: 11:45 a.m. - 12:30 p.m.

Deep learning brings many opportunities that businesses can use to augment current processes.

Using Deep Learning for Recommender Systems

11:45 a.m. - 12:30 p.m.

Using ML-based deep neural networks for prescriptive analytics via recommendations, followed by predictive analytics using the results of recommendation is the focus of Lakshman Bulusu’s presentation. He explains the pragmatics of using R and provides an industry use case as a proof of concept about how business can leverage this to augment BI at an enterprise level. He includes a demonstration of how AI Meets BI via integration with a BI platform using Oracle/Tableau.

Speaker:

, Consultant & VP of Research, Matlen Silver, Qteria

 

CS203. AI Use Cases Today

02:00 PM2019-05-222019-05-22

Wednesday, May 22: 2:00 p.m. - 2:45 p.m.

Think big, start small, and just get going is an excellent approach to take full advantage of AI’s transformational technologies.

Solving Business Problems in Government, Financial Services

2:00 p.m. - 2:45 p.m.

AI is already having a significant impact for the U.S. government, including Defense and Intelligence Community use cases, and is also providing game-changing capabilities for global enterprises in a range of industries, including financial services, life sciences, and technology. Guarino provides real-world examples of how AI is driving measurable benefits in a range of industry sectors. She discusses the importance of Explainable AI to regulated industries like financial services and healthcare, where being able to justify the reasoning behind algorithmic decisions is essential.

Speaker:

, COO, Kyndi

 

CS204. PANEL: Ethical Considerations of AI Technologies

03:00 PM2019-05-222019-05-22

Wednesday, May 22: 3:00 p.m. - 3:45 p.m.

As we move into a new computing era, technology has outstripped society in its ability to upend labor markets and industries, as well as personal lives.  This panel discussion by researchers and industry experts, and moderated by the Cognitive Computing Consortium, presents a number of ethical and legal issues that computing advances have raised.  Areas covered include: security, privacy, decision making by humans vs. machines, self driving vehicles, effects on labor markets and industries.  Presenters will avoid hype and discuss practical approaches to these issues.

Speakers:

, President, Synthexis and Cognitive Computing Consortium

, Executive Director, Cognitive Computing Consortium

, CEO, Indico Data

, COO, Co-Founder, Basis Technology

Wednesday, May 22

Keynotes

 

CLOSING KEYNOTE - Bring It Home: How to Advance Your Analytics Strategies

04:00 PM2019-05-222019-05-22

Wednesday, May 22: 4:00 p.m. - 5:00 p.m.

Companies are not lacking in technology options; in most cases, more advanced technologies exist than can be absorbed into the organization all at once. As data and analytics leaders, you need to determine how to make the biggest business impacts with advances in AI and ML for analytics, enable self-service with governance, and support BI and data engineering processes with the data lake. To establish this solid enterprise foundation, it is essential to recommit to data management principles and prioritize platform technologies and ecosystems accordingly. In his closing keynote, O’Brien shares a clear path forward based on the evolving concepts in data and analytics within the context of an enterprise-scale data strategy.

Speaker:

, Principal Advisor & Industry Analyst, Radiant Advisors

Don’t Miss These Special Events

BROUGHT TO YOU BY