Preconference Morning Workshops

 

Workshop 1. Introduction to Hadoop

Monday, May 15: 9:00 a.m. - 12:00 p.m.

Hadoop has forever changed the economics and dynamics of large-scale computing, and its use among enterprises looking to augment traditional data warehouses continues to grow. Join this workshop to explore the basics of Hadoop, including the Hadoop Distributed File System, MapReduce, and the budding ecosystem of Hadoop software projects. Learn best practices for installing and configuring Hadoop in your environment, managing its performance, and developing Big Data applications. Please Note: This workshop is hands-on. Attendees are responsible for bringing their own laptop.

Speaker:

, Data Engineer, MapR

 

Workshop 2. Introduction to Spark

Monday, May 15: 9:00 a.m. - 12:00 p.m.

Built for large-scale data processing, Spark is on the rise as one of the most active open source projects in the marketplace today. By enabling organizations to perform in-memory analytics on big datasets, Spark offers substantial performance benefits, as well as versatility as a multi-use platform for data scientists. This workshop provides an overview of its key features, ecosystem, and architecture, as well as how to run Spark, connect data sources, and start building applications. Please Note: This workshop is hands-on. Attendees are responsible for bringing their own laptop.

Speaker:

, Platform Engineering Manager, Handy

 

Preconference Afternoon Workshops

 

Workshop 3. Introduction to Data Science

Monday, May 15: 1:30 p.m. - 4:30 p.m.

Data science, the ability to sift through massive amounts of data to discover hidden patterns and predict future trends, may be the "sexiest" job of the 21st century, but it requires an understanding of many different elements of data analysis. This workshop dives into the fundamentals of data exploration, mining, and preparation, applying the principles of statistical modeling and data visualization in real-world applications. Please Note: This workshop is hands-on. Attendees are responsible for bringing their own laptop.

Speakers:

, CEO and Founder, Caserta Concepts

, Senior Data Architect, Caserta Concepts

 

Workshop 4. Introduction to Machine Learning

Monday, May 15: 1:30 p.m. - 4:30 p.m.

From recommender systems to disease diagnosis, machine learning is revolutionizing the process of complex decision-making by enabling the analysis of bigger, more complex datasets and the delivery of faster, more accurate results. This workshop examines the statistical and algorithmic principles for developing scalable, real-world machine learning pipelines and applications. Please Note: This workshop is hands-on. Attendees are responsible for bringing their own laptop.

Speaker:

, Data Scientist, Spotright

Keynotes

 

WELCOME & KEYNOTE: The Human Side of the Data Revolution

Tuesday, May 16: 9:00 a.m. - 9:45 a.m.

For more than a decade “data” has been at or near the top of the enterprise agenda. A robust ecosystem system has emerged around all aspects of data—collection, management, storage, exploitation and disposition. And yet, more than 66% of Global 2000 senior executives are dissatisfied with their data investments/capabilities. This is not a technology problem. This is not a technique problem. This is a people problem. Futurist Thornton May, in a highly interactive session, shares research results of his multi-institution examination of the human side of the data revolution.

Speaker:

, CEO, FutureScapes Advisors, Inc.

 

SPONSORED KEYNOTE: Architecting Business Disruption with Machine Learning

Tuesday, May 16: 9:45 a.m. - 10:00 a.m.

"Data" is the new differentiator and companies who can successfully adapt their businesses based on insights gleaned from data will have a significant advantage. In this session, Rob Thomas will provide a brief overview of Machine Learning and the use case patterns that clients are using to disrupt their industry.

Speaker:

, General Manager, IBM Analytics

 

OPENING KEYNOTE: Information Governance Meets Big Data Analytics

Wednesday, May 17: 8:45 a.m. - 9:30 a.m.

Big Data presents challenges and opportunities for a wide range of activities, enterprises, and individuals. From complex data architectures, cloud storage, data analysis, and overall database management to the legal implications, information governance underpins Big Data analytics. The innovative uses of Big Data are exciting, but questions about data retention policies, securing your data (both big and small), ransomware, and compliance with legal systems in the U.S. and other countries shouldn’t be ignored.

Speakers:

, Associate General Counsel, ZL Technologies

, Chief Data Scientist & Chair of the Information Governance Group, Drinker Biddle & Reath

 

SPONSORED KEYNOTE

Wednesday, May 17: 9:30 a.m. - 9:45 a.m.

 

Data Capital in the 21st Century

Wednesday, May 17: 9:45 a.m. - 10:00 a.m.

Only a small fraction of global firms increase productivity year after year, according to the Organization of Economic Cooperation and Development (OECD). Creating and using unique stocks of data capital is one of the key tactics these firms use to widen their lead. Come learn how two new ideas – data trade and data liquidity – can help all companies, not just superstars, take advantage of the data revolution, and hear examples of firms already putting these ideas into practice.

Speaker:

, Oracle Big Data Strategist, Oracle

 

CLOSING KEYNOTE: Architecting Data for the Modern Enterprise

Wednesday, May 17: 4:00 p.m. - 5:00 p.m.

The “Big Data era” has ushered in an avalanche of new technologies and approaches for delivering information and insights to business users. What is the role of the cloud in your analytical environment? How can you make your migration as seamless as possible? This closing keynote, delivered by a prominent consultant who has helped many global enterprises adopt Big Data, provides the inside scoop you need to supplement your data warehousing environment with data intelligence—the amalgamation of Big Data and business intelligence.

Speaker:

, CEO and Founder, Caserta Concepts

 

Track A: Moving to a Modern Data Architecture

 

A101. The Rise of the Internet of Things

Tuesday, May 16: 10:45 a.m. - 11:45 a.m.

IoT represents the next wave of data on which companies are focused for new business opportunities as well as insights into their existing customers, products, and operations. What are the implications for the enterprise in terms of issues, such as data management, governance, and security?

IoT Adoption & Maturity Today

There is plenty of buzz today about the Internet of Things. But what’s real and what’s hypothetical or just the marketing machine behind the technology’s potential? Radiant Advisors, DBTA, and Unisphere Research recently partnered on an IoT Market Survey to assess the state of IoT adoption and maturity. This presentation shares results of the survey and key insights, including overall market maturity, key drivers and expectations, perceived technology gaps, primary challenges and success factors, and the importance of privacy and security considerations.

Speaker:

, Founder and Principal, Radiant Advisors

Data Governance in IoT

IoT changes the data game. Again. The ability to create, collect, and exchange data with connected devices, sensors, actuators, and essentially anything in which a computing system can be embedded offers a new level of insight and intelligence into every industry. It also introduces a new level of risk and liability which companies cannot ignore. Anne Buff looks at the governance challenges IoT presents and suggests actions for you to take now to protect your business. Learn the key success factors in providing a governed IoT environment that enables discovery and innovation while protecting the business and its most valuable assets.

Speaker:

, Business Solutions Manager, SAS Best Practices, SAS Institute

 

A102. Succeeding With Big Data in the Real World

Tuesday, May 16: 12:00 p.m. - 12:45 p.m.

Efficiently collecting and securely storing large quantities of data is a challenging enough, but using it for greater insights and competitive advantage, even more so. Organizations need to know the tools and strategies that can help put Big Data to work.

Building a Data Lake for the Real World

We’ve all been promised the “Shangri-La” that is data lakes: More data means more insights—synergy! But has it really panned out? Data lakes are actually more similar to the early days of the Internet than they are a panacea for Big Data woes, filled with pristine, useful information. The data may have been reliable when first published but may not be now. Yet, as with many stale webpages, there is no way to tell, and the business continues to rely on wrong data to make decisions. Zillow faced the same problem and decided to change it. This session covers the tools the company built and the key tenets to help you ensure your lake rejuvenates your organization.

Speaker:

, Director, Analytics Engineering, Zillow Group

Continuous Data Integration: From Data Lakes to Cloud Data Integration, Keeping it Real-Time

How do you drive the most value from your data? There is growing attention on the need to process and analyze data in motion. As your business looks to improve value generated from various data sources, the finding is likely that processing historical data is not enough nor is relying solely on real-time streaming solutions. Learn how continuous data integration can drive value for your business. Gain insights from HVR on how leading global organizations have implemented this approach and the impact on their business.

Speaker:

, VP Field Engineering, HVR

 

A103. Database Management Today

Tuesday, May 16: 2:00 p.m. - 2:45 p.m.

Having access to data that can be trusted is essential for effective decision making and ultimately for business success

Ensuring Data Integrity in Operational Databases Systems

Just because a DBMS supports ACID does not guarantee that your data will be correct. There are many other parameters and programming decisions that will impact your data, and, all too often, nobody is paying any attention to them. This session looks at the difference between ACID and BASE transaction models and considers other issues that can impact data integrity and accuracy.

Speaker:

, President & Principal Consultant, Mullins Consulting, Inc. and IBM Gold Consultant

Monitoring Health & Performance of Big Data Environments

In many organizations, distributed data architectures like Hadoop and in-memory databases like Redis have become the backbone of an increasingly important data analytics engine that informs critical business applications. But an analytics application is only as good as the data it depends on, and many organizations don’t have a good grasp on the health of their Hadoop environment until there’s a problem. Are you relying on open-source tools to monitor an enterprise-class analytics cluster? You’re certainly not alone. But, you might sleep a little better if you took a tip or two from this presentation.

Speaker:

, Product Manager, SelectStar

 

A104. Going Beyond Relational

Tuesday, May 16: 3:15 p.m. - 4:00 p.m.

Databases continue to be a central part of solutions architectures, but different systems offer different attributes. In this fast-paced Big Data world, understanding the array of data management choices is essential.

Crafting Your Database Strategy for the Future

Big Data and Hadoop, in particular, have heightened the importance of making prudent database decisions in days and weeks where months and years were previously allotted. Couple this compressed decision window with an order of magnitude more choices, and even a consummate architect or database professional has got a serious challenge. Targeted at architects, administrators, and DBAs, this presentation covers a taxonomy of databases, unique advantages and disadvantages of each, usage, tidbits, positioning, strategy, and sundry items such as a “cheat sheet” to help architects navigate the maze to help make both strategic and tactical decisions with ninja-like precision.

Speaker:

, Enterprise Technologist, Big Data, ISG, DellEMC

Surviving and Thriving in a Hybrid Database World

With the proliferation of data, databases, and data types, it may seem that you're drowning in a sea of data. When hybrid is not just an idea, but a reality within your organization, you must not only survive, but also thrive, because your business depends on data. During this session, senior solutions architect, Russ Tuttle gives you practical advice on how to maintain productivity and reduce risks while implementing and utilizing more database platforms. He shows how to make your hybrid environment work for you by sharing best practices for proactive database management, how to simplify administration, and ways to minimize the learning curve for new and additional database platforms.

Speaker:

, Senior Solutions Architect, Quest

 

A105. Supporting Real-Time Applications

Tuesday, May 16: 4:15 p.m. - 5:00 p.m.

BI and analytics are at the top of corporate agendas. Competition is intense, and, more than ever, organizations require fast access to insights about their customers, markets, and internal operations to make better decisions—often, in real time.

Architectures for Real-Time Applications

Enterprises face challenges powering real-time business analytics and systems of engagement (SOEs). Analytic applications and SOEs need to be fast and consistent, but traditional database approaches, including RDBMS and first-generation NoSQL solutions, can be complex, a challenge to maintain, and costly. Companies should aim to simplify traditional systems and architectures while also reducing vendors. One way to do this is by embracing an emerging hybrid memory architecture, which removes an entire caching layer from your front-end application. This talk discusses real-world examples of implementing this pattern to improve application agility and reduce operational database spend.

Speaker:

, CTO & Co-Founder, Aerospike

 

A201. The Future of Data Warehousing

Wednesday, May 17: 10:45 a.m. - 11:30 a.m.

The enterprise data warehouse is evolving. Cloud services, virtualization, the data lake, and other emerging technologies have a role to play. Find out how.

The Journey to the Logical Data Warehouse

In addition to the ongoing blizzard of technological changes and related opportunities facing large organizations, including Big Data, NoSQL, IoT, and cloud, many leading financial services organizations are grappling with additional issues that are relevant to the financial services arena. This presentation discusses whether the logical data warehouse is an appropriate approach to capitalize on the recent advances and looks at additional data management strategies, such as blockchain and FIBO (Financial Industry Business Ontology).

Speaker:

, CEO, ADRM Software, Inc.

Customer Case Studies in Bi-Modal Logical data Warehouse Architecture Using Data Virtualization

Data Warehouse is no longer the single source of the truth. The traditional data warehouse has reached its limit. With the advent of big data, it is no longer feasible to store all of the data in the data warehouse because of cost, and in some cases like the unstructured data, it is no longer possible to store such data in the data warehouse. Data warehouse has evolved into bi-modal logical data warehouse architecture where it is now necessary to combine the derived, structured data from the data warehouse with unstructured data from Hadoop, Spark, NoSQL, and other enterprise systems. This session will showcase case studies of award-winning customers, whose logical data warehouse architectures demonstrate the future of the data warehouse.

Speaker:

, Chief Marketing Officer, Denodo Technologies

 

A202. The Evolution of Data Lakes

Wednesday, May 17: 11:45 a.m. - 12:30 p.m.

With the growth of Big Data, there is renewed appreciation for data governance principles. Understanding data lineage and provenance and having policies and processes in place are key requirements for ensuring that data is trustworthy and protected.

Search the Enterprise Data Lake—Watch Us Do It!

People talk a lot about building enterprise “data lakes” or data hubs to knock down data silos and democratize data access to different types of users. These are abstract topics. Maybe it’s time to stop talking and see how this can be done in practice. Search Technologies is currently processing and searching data from a data lake for a large life sciences customer. Come see, step by step, how this is accomplished. Paul Nelson takes disparate data sources such as document files and data tables, indexes and searches them and shows attendees morph lines for content processing and indexing, as well as search and visualizations to provide business insight into this data lake. He does so using Cloudera CDH, including Cloudera Search and Hue.

Speaker:

, Vice President & Chief Architect, Search Technologies

 

A203. Overcoming the Big Data Quality Challenge

Wednesday, May 17: 2:00 p.m. - 2:45 p.m.

As organizations face greater pressure to compete on analytics, the need for data quality becomes paramount. In order to realize the promise of Big Data for improved decision making, data stores must be trustworthy.

Data Quality in the Age of Big Data

As an enterprise moves from historical business intelligence reporting to predictive analysis founded on Big Data, data quality becomes increasingly important for successful outcomes. By observation, using conventional BI tools, the consumer of the information can drill down through the report to the detail when the aggregate number does not “feel” right, but this will not be possible with Big Data due to the sheer volume and complexity of the data. This presentation looks at the elements of data quality required to make effective use of algorithm-based data analysis and predictive analytics. Using the Data Management Maturity Model from the CMMI Institute, the presentation reviews the capabilities of an effectual data quality program and considers the competencies required to support data analysis and predictive analytics with Big Data.

Speaker:

, VP, Architecture Strategies, Sandhill Consultants

 

A204. The In-Memory Revolution

Wednesday, May 17: 3:00 p.m. - 3:45 p.m.

A recent survey by Unisphere Research of data executives and professionals found that a majority of respondents, 57%, had strong demand for real-time information within their organizations. With the ability to bring faster insights to users and applications, in-memory technologies hold the potential to bring analytics to new levels

Mastering In-Memory Data Technology

You're an architect or a developer with years of experience creating amazing applications. And now you've just been assigned to a low-latency project! Because you're an expert, you're expected to master a new in-memory data technology and be productive from Day 1. This session covers what you need to know to be successful on your low-latency in-memory project. Attendees learn about the modern in-memory technology landscape; distributed data topologies; reliable, scalable, and durable in-memory data—the techniques for really big in-memory data. And, most important, no prior knowledge of in-memory data grids is required!

Speaker:

, Senior Solutions Architect, Hazelcast

 

Track B: Analytics and Applications

 

B101. Becoming a Data-Driven Enterprise

Tuesday, May 16: 10:45 a.m. - 11:45 a.m.

Marc Andreesen’s famous pronouncement that software is eating the world continues to ring true. Cutting-edge approaches are helping organizations define success in ways that couldn’t be imagined just a few years ago.

Simple Strategies for Faster Knowledge Discovery With Big Data

In today’s analytics-driven world, the challenge for business success is not so much who can mine actionable nuggets of information from a mountain of data but who can get to it faster. To achieve this edge, Uber created an intelligent data infrastructure layer that actively monitors data usage patterns across its Hadoop cluster and recommends strategies for faster data processing. This presentation provides insights into Uber’s intelligent data infrastructure layer and how to achieve knowledge discovery at speed using simple data organization strategies, as well as the pitfalls to avoid while implementing them.

Speakers:

, Senior Data Scientist, Uber

, Staff Software, Engineer, Uber

The Five Key Pillars of Big Data Analytics

Data, big and small, surrounds us at every point and from every perspective. It can open new, powerful insights on virtually everything. What can it inform you and your business opportunities? This interactive program details the five key pillars of Big Data analytics—context, data, methods, technologies, and resources, along with their orchestration in technical architectures and formal data science processes to successfully uncover actionable answers found in multiple large datasets and data streams.

Speaker:

, Chief Data Scientist/Principal Engineer, RMS, Lockheed Martin

 

B102. Driving Innovation Through Big Data

Tuesday, May 16: 12:00 p.m. - 12:45 p.m.

For both government agencies and businesses, unforeseen opportunities for innovation and differentiation as well as fresh approaches to problem-solving can emerge from massive quantities of information—when combined with the right Big Data technologies.

Data as Rocket Fuel for Accelerating Innovation

Big Data and analytics exist at the core of 21st-century discovery and decision making in an increasingly data-driven world. Learn how leaders in this space are leveraging Big Data assets to enable collaborative problem-solving, transform interactions across industry and government, and accelerate innovation and how Rhode Island is exploring new models for driving Big Data partnerships forward.

Speaker:

, Chief Innovation Officer, State of Rhode Island & Rhode Island Commerce Corp.

Building Data Lakes for Real-time Analytics and IoT: Best Practices from the Fortune 100

Attunity, the #1 independent provider of real-time data streaming, will share its experiences in enabling universal data ingest for Fortune 100 Companies’ large-scale data lakes in support of real-time analytics and IoT initiatives. Join us to examine case studies from some of the world’s largest automotive, manufacturing, food processing and retail corporations.

Speaker:

, Senior Director and Technology Evangelist, Attunity, Inc.

 

B103. Big Data Analytics in Action

Tuesday, May 16: 2:00 p.m. - 2:45 p.m.

Big Data has major implications for areas such as marketing and CRM. Forward-thinking companies in a range of fields are using Big Data to engage more effectively with their customer base.

Big Data for Line-of-Business Content

The majority of business content is locked in disparate silos of unstructured data across many types of systems, including Collaboration, ECM, CRM, ERP, PLM, and archiving systems. This talk by BA Insight covers the techniques we've used—applying NLP, MapReduce, and search—to extract content from these systems, add structure to it, correlate it, use it to derive deep insight, and publish the resulting insights and explorable data via enterprise-wide portals. We cover four specific real-world examples and leave attendees with some specific steps they can take as well as inspiration about what they can achieve.

Speaker:

, CTO, BA Insight

Streaming Real-Time Data from On-Premise Databases to the Cloud

Having the capacity to move data seamlessly between on-premise to the cloud often requires significant internal effort and resources. However, the ability to move and integrate data seamlessly across heterogeneous systems in real-time is becoming mission critical in today’s ever growing competitive environments. We invite you to see how you can mitigate this challenge. In this HVR quick how-to, we will show you how easy it can be to implement a download that will capture real-time changes from relational databases and efficiently stream them in real-time to multiple destinations in the cloud.

Speaker:

, VP Field Engineering, HVR

 

B104. Beating the Barriers to Better Analytics

Tuesday, May 16: 3:15 p.m. - 4:00 p.m.

With the advent of Big Data, new approaches to data architecture are needed to ingest data faster, store it more effectively, and put it to use for better decision making.

Overcoming Scale Barriers Through Approximate Query Processing

The days of “adding another node” are numbered, and the increase in complexity requires an evolutionary approach. In this session, Don DeLoach explains approximate query processing (AQP) and shows how high value approximations provide equivalent insight to exact queries while ensuring SLAs and reducing resource utilization. Using statistical metadata offers an opportunity to overcome the mounting scale barriers and increased complexity by rethinking the strategy and reaching previously unattainable opportunities.

Speaker:

, President & CEO, Infobright

Breaking the Cloud Barrier for Data Analytics

Explore the challenges and benefits of leveraging the cloud for data analytics. Jeschonek discusses why on-demand cloud computing infrastructure and its near-infinite processing capabilities make cloud a viable option for data modeling and analytics. Learn the criteria for choosing the right cloud strategy for your projects, the key benefits anr use cases for cloud analytics, and how to deal with the data gravity issue when considering cloud analytics.

Speaker:

, Director Cloud Solutions, Avere Systems

 

B105. Supercharging Your Marketing With Big Data

Tuesday, May 16: 4:15 p.m. - 5:00 p.m.

Using robust technologies to track and analyze a buyer’s journey, organizations are using Big Data to supercharge their marketing efforts.

Using Machine Learning & Predictive Analytics to Drive Marketing Intelligence

Being data-driven is more than collecting data and reporting on past success, it’s about dynamically measuring data in real time and using it to predict future revenue impact. In this session, Nadim Hossain discusses how the current data explosion and emergence of digital intelligence are impacting marketing. He calls it “marketing intelligence (MI),” which is all about using data to connect marketing to sales and revenue goals. MI goes beyond predictive lead scoring to leverage historical performance, machine learning, and predictive data analytics to automate direct-attribution and forecasting. Insights from MI allow sales and marketing to know exactly which efforts are impacting revenue, enabling them to make real-time, strategic decisions.

Speaker:

, Head of Product Strategy, SiSense

 

B201. Maturing Your Data Quality & Analytics

Wednesday, May 17: 10:45 a.m. - 11:30 a.m.

High-quality analytics requires high-quality data. To achieve business goals from traditional data management stacks, emerging technology environments, or a combination thereof, data quality is still a foundational requirement.

Merging the Data Quality & Data Analytics Lifecycles

The Tab Group created a lifecycle curve for data quality. Many of its nodes have come to fruition. However, many of these nodes have coincided with similar points on an inverse curve that represents the lifecycle of data analytics. The simultaneous tracing of these symbiotic nodes on both curves provides opportunities and challenges toward accelerating maturation for data quality and analytics.

Speakers:

, Data Integrity Officer, Wells Fargo Bank

, Data Integrity Manager, Wells Fargo Bank

Cognitive Search & Analytics: What It Is and Why You Should Care

If you are a believer in the data-driven organization (or even just curious) and have ever wondered what could happen if you cleverly combined the power of data collection, indexing, text mining, search, and machine learning into a unified platform and applied it within the enterprise, this talk is for you! Come learn about the state of cognitive search and analytics technology and how it is enabling great companies across a wide swath of industries to amplify mission-critical expertise within their business in a surprisingly short amount of time.

Speaker:

, Senior Product Marketing Manager, Sinequa

 

B202. Enabling the Real-Time Enterprise

Wednesday, May 17: 11:45 a.m. - 12:30 p.m.

With the growing dependence on data analytics, experts agree the realtime delivery of data insights can be invaluable to business agility.

The Value of Interactive Analytics for Real-Time Insights

In a world awash with data, stale spreadsheets and lagging queries cannot provide the rapid insights required to succeed in today’s data-driven marketplace. There is an entire layer of insights buried underneath these rudimentary graphs that could prove invaluable if users had access to them. Enter a new generation of analytics: Interactive. This presentation covers the value of interactive analytics and dives into the issues that analytics pose for today’s data-driven leaders.

Four Ways to Scale Interactive Analytics and BI for Real-Time Insights

We live in an on-demand world of Twitter, DVRs, Snapchat, and more. Consumers get what they want, when they want it. Why should analytics for organizations or customers be any different? Yet, “Big Data” and “Fast Data” are at odds with traditional data architectures, requiring a new approach to providing real-time insights and interactive analytics. Apache Hadoop, Spark, Kafka, Kudu, and others are modernizing the data platform. The final step is to provide real-time, self-service analytics and interactive visualization for end users to get the real-time insights they need to make more informed decisions and test new hypotheses.

Speaker:

, CTO and Co-Founder, Arcadia Data

 

B203. Unlocking the Power of Data Science

Wednesday, May 17: 2:00 p.m. - 2:45 p.m.

Famously tagged as the sexiest job of the 21st century in the Harvard Business Review, the reality at many organizations is that data scientists, as well as analysts and other users, are struggling to find the information they need when they need it. New approaches are available to unlock the power of data science.

The “Amazonification” Approach to Data

Savvy businesses are recognizing the value of taking an “Amazon-ified” approach to data. Mimicking the sort the one-stop-shop approach that users enjoy and are accustomed to with Amazon, a sophisticated data catalog that is tied to data governance offers businesses the ability to deliver new levels of efficiency for users to easily “shop” for the data they need, all from one central location. This can be done instantaneously for quicker business operations and decisions, without needing to go through a technical intermediary. This presentation covers how a data catalog can enable data citizens to uncover insights and drive innovation that solves critical business challenges.

Speaker:

, CTO, Collibra

 

B204. The Future of Analytics

Wednesday, May 17: 3:00 p.m. - 3:45 p.m.

As organizations seek to use data for competitive advantage, providing information when it is needed in the right context is a challenge. Creating a data-driven culture means embracing new approaches to deliver value for decision makers.

Data Socialization: Revolutionizing Self-Service Data Preparation & Analytics

Self-service analytics has many benefits, but it’s also caused the data landscape within many companies to become like the Wild West. Data and analytics outcomes aren’t being shared and reused for the greater good; rather, users are starting every analytics project from scratch without the benefit of repeatable data modeling. This presentation explains how data socialization is revolutionizing self-service data preparation and the analytics experience. This transformative new capability integrates traditional self-service data preparation benefits with key attributes common to social media platforms. The powerful combination enables data scientists, business analysts, and even novice business users to search for, share, and reuse prepared, managed data to achieve true enterprise collaboration and agility.

Speaker:

, Chief Product Officer, Datawatch

 

Track H: Hadoop Day

 

H101. Unleashing the Power of Hadoop

Tuesday, May 16: 10:45 a.m. - 11:45 a.m.

Hadoop is here to stay, but so are a host of other approaches. To be effective, they must all work together in the enterprise.

Accelerating Big Data Implementations Through Hadoop Interoperability

During the last 10 years, Apache Hadoop has proven to be a popular platform among developers who require a technology that can power large, complex applications. For customers, partners, and application ISVs who write on top of Hadoop, there is still one huge issue that remains—interoperability. Steve Jones and John Mertic take a closer look at how Apache Hadoop can become more interoperable to accelerate Big Data implementations.

Speakers:

, Director, Program Management, ODPi

, Global VP, Big Data, Capgemini

SQL on Hadoop & Big Data Systems

SQL has been with us for more than 40 years and Hadoop, about 10. Even though when Hadoop was born there was no SQL interface to it, it has become imperative that SQL on Hadoop solutions are brought to the market. This talk provides an overview of SQL on Hadoop, including low latency SQL on Hadoop for analytic workloads, and how SQL engines are innovating

Speaker:

, Big Data and Data Science Architect, Independent Consultant and Author - Book - Apress - "SQL on Big Data - Technology, Architecture and Innovation"

 

H102. Harnessing Big Data With Spark

Tuesday, May 16: 12:00 p.m. - 12:45 p.m.

Open source platforms and frameworks such as Apache Spark have paved the way for commodity-priced processing on a massive scale.

Build Machine-Learning Algorithms Powered by Spark

One of the most exciting use cases of Apache Spark is the development of Self Service and interactive Predictive Analytic platforms. We can now integrate model generation and prediction of machine learning with data visualization capabilities that are powered by distributed processing capabilities of Apache Spark. In this presentation we would explore this capability to see how you can 'see' your data in full color.

Speaker:

, Lead Big Data Architect, Transamerica Data Engineering

 

H103. The Streaming Future of Big Data

Tuesday, May 16: 2:00 p.m. - 2:45 p.m.

Real-time utilization of streaming data requires a modern architecture that can scale. Learn about the technologies that can help.

Event-Driven Microservices With Streams & Docker

This presentation covers how to build a multiple location, event-driven architecture that uses streaming data to interconnect Docker-hosted microservices that allow implementation of scalable, redundant, and highly available services across multiple data centers. Using Docker containers and single-purpose microservices, this presentation demonstrates how these services are interconnected with event-driven streams and how this architecture can be deployed

Speaker:

, Senior Field Enablement Engineer, MapR Technologies

IBM Corporation
 

H104. Building an Enterprise Data Lake

Tuesday, May 16: 3:15 p.m. - 4:00 p.m.

The concept of an enterprise data lake is enticing. Find what’s needed and the technologies available to help build a data lake for the enterprise.

Open Source, Code-Free Data Pipelines

An enterprise data lake typically requires substantial effort to ingest, process store, secure, and manage data from a variety of sources. Cask Data Application Platform (CDAP) is an open source solution, which offers a self-service user interface for creating data lakes and simplifies the building and managing of production data pipelines on Spark, Spark Streaming, MapReduce and Tigon. This talk discusses how to achieve broad, self-service access to Hadoop while maintaining the controls and monitors necessary within the enterprise.

Speaker:

, CEO & Founder, Cask

 

H105. Integrating Hadoop Into Your BI Environment

Tuesday, May 16: 4:15 p.m. - 5:00 p.m.

A recent Unisphere Research survey on data management found that Apache Hadoop is gaining significant traction. About 40% of respondents now have a Hadoop installation.

The Do’s & Don’ts for Success With BI on Big Data

Think Hadoop is not in your future? According to a recent survey, 97% of organizations working with Hadoop anticipate onboarding analytics and BI workloads to Hadoop. When this happens, the companies which have disregarded the Big Data opportunity may be left behind. The good news is that onboarding your business intelligence workloads to Hadoop is not as complicated as it used to be. If you understand some key concepts, the transition can be simpler and more successful—allowing you to recycle current skill sets while avoiding either a rip-and-replace of your technical stack or elimination of business analysts to hire data scientists.

Speaker:

, VP, Product, AtScale

 

Track D: Data Security Forum

 

D101. The Emerging Threat Landscape

Tuesday, May 16: 10:45 a.m. - 11:45 a.m.

Data breaches can have wide-reaching implications for a range of organizations spanning healthcare, financial services, government, and manufacturing— and more. Find out how the threat landscape has changed and what you need to do to defend your organization.

Securing Your Data From Emerging Cyber Threats

The relentless pace of IT innovation is unfortunately matched by the speed of innovation in the ranks of hackers, cyber thieves, and other bad actors in the digital world. This talk looks at some of the emerging cyber threats likely facing enterprises in the near future.

Speaker:

, President, Pearl Knowledge Solutions

 

D102. Managing Data Vulnerabilities

Tuesday, May 16: 12:00 p.m. - 12:45 p.m.

Managing data vulnerabilities both internally and externally to prevent data breaches from happening is a key concern in the enterprise today.

Data Breaches: Protecting Your Enterprise’s Reputation

Exposure of private and sensitive data is making headlines. Affected organizations suffer severe costs and damage to their reputations. This presentation reviews ways to facilitate forensic analysis when data has been compromised and methods for prevention to ensure that vulnerability and loss do not happen (or happen again). Learn how to identify the “who, when, and where” of your private and sensitive data. It’s not just about ensuring data security; it’s about trust and traceability and the appropriate level of data governance. All data assets do not need to be managed to the same degree. Learn how you can focus on your critical data elements.

Speaker:

, VP, Architecture Strategies, Sandhill Consultants

 

D103. Security in the IoT World

Tuesday, May 16: 2:00 p.m. - 2:45 p.m.

We all want devices to be smarter. The benefits of IoT across industries, such as healthcare, manufacturing, oil and gas, and others, are gaining greater appreciation, but the security implications of all these connected things are also becoming better understood.

Things Gone Wild: Assessing IoT Risks

How long can we continue to place trust in the everyday devices we rely on? In an age of growing connectedness for everything from manufacturing robots to toothbrushes, the Internet of Things has the potential to morph from a helpful productivity enhancer into a cover for malicious infiltration of your home and office. Learn how makers can build secure “things” as well as the security controls that operators can implement. Milan Patel presents a simple model for assessing threats to the IoT ecosystem relevant to your industry and products. Security practitioners can learn how to be effective early adopters without being victims of “things.”

Speaker:

, Security Offerings Manager, IBM

 

D104. Database Security Today

Tuesday, May 16: 3:15 p.m. - 4:00 p.m.

With data increasingly seen as an important asset whose value is enhanced when it is made more widely accessible to internal users and outside partners, the risk of cyberattacks is increasing at the hands of bad actors as well as from unintentional lapses by trusted employees

Core Strategies for Improving the Security of Your Databases

This session presents the threat mechanisms that commonly result in data breaches and lays out the core strategies for improving the security of your databases, including encrypting data at rest and in transit, key management, user and service authentication, secure data sharing, protecting against injection attacks, and configuration changes. The specific challenges and tactics for implementing these strategies vary between relational databases and NoSQL. It looks at the tools, processes, and practices you can put in place regardless of where you are storing your data, including tools that check for vulnerabilities in the code that accesses your data and blocks attacks. It also surveys leading database products and identifies product-specific tools provided by leading database vendors.

Speaker:

, VP, Engineering, EC Wise, Inc. and CTO (acting), Click, Deal, Buy!; VP, Engineering (acting) Pivot Payables

 

D105. Building Security Into Your Data Architecture

Tuesday, May 16: 4:15 p.m. - 5:00 p.m.

The challenges to data security keep coming. Find out how to build security into your data architecture.

IBM Corporation
 

D201. Security in the Hadoop World

Wednesday, May 17: 10:45 a.m. - 11:30 a.m.

Awareness of data security has increased at many organizations due to the number of recent high-profile breaches. Understanding how to ramp up security is critical in this era of Hadoop and Big Data.

Delivering Secure BI in the Age of Hadoop

No industry is immune to a security breach. A recent report claimed that the most common cyber incidents are actually data breaches, “dwarfing rates of all other cyber events.” For enterprises the world over, data security, both internal and external, is of paramount importance. However, IT departments are faced with a difficult challenge. They need to ensure that internal business users have access to the data they need to make decisions and do their jobs and they also have to enforce appropriate controls to prevent unauthorized users from viewing or modifying sensitive data.

Speaker:

, VP, Product, AtScale

 

D202. Security in the NoSQL World

Wednesday, May 17: 11:45 a.m. - 12:30 p.m.

With the rise of Big Data and NoSQL technologies, what are the implications for data protection and security?

Preventing Injection Attacks Against NoSQL Databases

The terms “SQL injection” and “cross-site scripting attack” are well known to the application security community. With NoSQL databases, pure SQL injection attacks are no longer possible, but different types of injection attacks, occurring at all levels of the application stack, more than make up for them. This session looks at several strategies an attacker might use to perform parameter and even command injection in applications running against a NoSQL database such as MongoDB. It then looks at various strategies for eliminating these vulnerabilities, which mainly involve input validation and sanitization. In this context, the Open Web Application Security Project (OWASP) is discussed, along with some of the work being done under the OWASP banner in this area. Finally, methodologies are discussed for testing your applications to verify that they are (relatively) hardened against these types of attacks.

Speaker:

, VP, Engineering, EC Wise, Inc. and CTO (acting), Click, Deal, Buy!; VP, Engineering (acting) Pivot Payables

 

D203. Data-Driven Security

Wednesday, May 17: 2:00 p.m. - 2:45 p.m.

The opportunities for cyberattacks, mistakes by trusted insiders, and fraud are broad—and constantly growing. Today, organizations must adopt a data-driven security stance.

Preventing Millions of Dollars in Fraud Losses Through Data Sharing

When we think of fraud today, our minds go first to stolen credit card credentials, but fraudsters have developed new and innovative approaches to monetizing data about companies and individuals. Among these efforts has been synthetic small business fraud, in which a small business may appear on any given day with no records of incorporation, no recorded credit history among bureaus, and no employees to vouch for its legitimacy. XOR Data Exchange has formulated its Small Business Risk Exchange to help service providers determine both legitimacy and ability to pay for services, even with little or no recorded credit history or employees. And it’s all driven by data.

Speaker:

, Chief Operating Officer, Exchange Services, XOR Data Exchange

 

D204. The Great Cloud Security Debate

Wednesday, May 17: 3:00 p.m. - 3:45 p.m.

From private, to public, to hybrid, the adoption of all cloud types is growing, driven by the need for organizations to increase the efficiency and availability of their databases and applications, while reducing costs and complexity. However, security remains a contentious issue, especially in discussions around the public cloud. While some IT leaders do not trust moving any sensitive data off-premises, others believe that public cloud providers can provide better security than they can achieve on their own. The fact is it all depends on your requirements and how the public cloud provider can meet them. Join us for a special, no-holds-barred panel that explores the public cloud landscape, pitfalls to avoid and successful strategies for ensuring the protection of your data.

Speakers:

, Dir. Cloud Computing Evangelist, Spectrum Enterprise Navisite

, System Engineer Database Specialist, VMWare

 

Track V: Virtualization Day

 

V201. Database Management & The Cloud

Wednesday, May 17: 10:45 a.m. - 11:30 a.m.

Many IT departments are seeking ways to automate routine tasks and free up resources. Cloud represents a new way of approaching database management, potentially relieving enterprise data shops of many administrative burdens.

Virtual Database Management & the Cloud

Cloud is everywhere, and the database administrator is caught in the middle. How do you know where to start and what to migrate first? How do you perform consolidation planning and ensure data security? This session builds on a full demonstration, along with tips and tricks to show how DBAs can secure their environments to allow a self-service option so they no longer are the constraints to projects.

Speaker:

, Technical Intelligence Manager for the Office of CTO, Delphix

Architectures for Real-Time Analytic Applications

BI and analytics are at the top of corporate agendas. Competition is intense, and, more than ever, organizations require fast access to insights about their customers, markets, and internal operations to make better decisions—often, in real time. Enterprises face challenges powering real-time business analytics and systems of engagement (SOEs). Analytic applications and SOEs need to be fast and consistent, but traditional database approaches, including RDBMS and first-generation NoSQL solutions, can be complex, a challenge to maintain, and costly. Companies should aim to simplify traditional systems and architectures while also reducing vendors. One way to do this is by embracing an emerging hybrid memory architecture, which removes an entire caching layer from your front-end application. This talk discusses real-world examples of implementing this pattern to improve application agility and reduce operational database spend.

Speaker:

, CTO & Co-Founder, Aerospike

 

V202. The Cloud Landscape Today

Wednesday, May 17: 11:45 a.m. - 12:30 p.m.

Cloud solutions and services continue to be embraced for competitive advantage due to a variety of reasons. It is important to understand how the environment works today.

Dirty Little Secrets of the Cloud

After 10 years of existence, the cloud industry would have you believe it is a mature, stable industry that is safe and secure to take your business to. Yet, the secret the insiders in the know is caveat emptor: Let the buyer beware. This presentation covers the state of the cloud today, lessons learned from the early adopters, the new role of cloud keeper, and what to know in advance to create a successful cloud deployment. It ends with a checklist you can use ensure you choose a cloud provider that can meet your needs.

Speakers:

, Dir. Cloud Computing Evangelist, Spectrum Enterprise Navisite

, System Engineer Database Specialist, VMWare

 

V203. Increasing Agility Through Data Virtualization

Wednesday, May 17: 2:00 p.m. - 2:45 p.m.

Data virtualization is an enabling technology that can help an organization address the need for a comprehensive view of its data, avoid the need to replace physical systems, and allow the business and IT sides to work closer together in a more agile fashion.

Enabling the Data Marketplace Using Data Virtualization

The data as a service (DaaS) approach enables provisioning of both internal and external assets that can readily be used in the decision-making process. DaaS enables transformation and delivery of the raw data into meaningful data assets for multiple business initiatives in an organization. Attend this presentation to learn how Guardian Life has successfully created a data marketplace using data virtualization to deliver real-time trusted data assets. Learn how this approach reduces risks from data silos that contribute to wrong decisions and serves the right data with the right context at the right time, enabling monetization of data assets by serving necessary data to both internal and external stakeholders.

Speaker:

, Assistant VP, Enterprise Data & Insights Office, Guardian Life Insurance

 

V204. When Big Data Meets Cloud

Wednesday, May 17: 3:00 p.m. - 3:45 p.m.

Virtualization and cloud can provide an approach to abstract data from its sources of origin so that all data environments are opened up to access, including RDBMSs, NoSQL database systems, storage systems, and newer Big Data systems such as Hadoop and Spark.

A Practical Journey to Big Data and the Oracle Cloud

Cloud adoption is the No. 1 challenge enterprise customers face today. What will your journey look like? What path should you take? How do you get started? This session provides three practical journeys to the cloud that can solve real technical challenges organizations face today. Whether your challenge is delivering self-service BI, a new data lake, or data science analytics, these easy-to-learn tutorials are designed to provide you with hands-on experience with Oracle Cloud services.

Speaker:

, Enterprise Architect, Oracle

Don’t Miss These Special Events

BROUGHT TO YOU BY

ASSOCIATION PARTNER