Tuesday, May 12, 2015

Track A — Analytics and Applications
Track B — Moving to a Modern Data Architecture
Track C — IOUG Track: Big Data in the Cloud
Hadoop Day

CONTINENTAL BREAKFAST

8:00 a.m. - 9:00 a.m.

WELCOME & KEYNOTE - Understanding the Data Value Chain

9:00 a.m. - 9:45 a.m.

Creating value from data requires a new mind-set. It’s hard to escape silos, whether they are technical or conceptual. To exploit fully the opportunity of Big Data tools and architectures, we need a new way of think- ing that frames data as a raw material of business. The answer is to focus not on the functional components— what you do to data—but on business outcomes and how they can be achieved—what you do with data. This new approach can be cultivated through looking at the data value chain.

MODERATOR: Marydee Ojala, Editor-in-Chief, KMWorld Magazine Editor, Enterprise AI World

Edd Dumbill, VP Strategy, Silicon Valley Data Science

Agile "AppStore" (SBA) Creation on a Rich Search Index

9:45 a.m. - 10:00 a.m.

The bad reputation of enterprise search will change as more powerful technology allows the extension of search to many enterprise data sources. Once enterprises have done the ground work of indexing all or most of their data, they can do things they had never thought of before, such as easily and rapidly developing Search Based Applications (SBAs) to meet user needs. This talk will present several of the SBAs that populate the AstraZeneca "AppStore" on top of the Sinequa platform.

Hans-Josef Jeanrond, VP Marketing, Sinequa

Rob Hernandez, Data Analytics Lead, CTO Office, AstraZeneca

COFFEE BREAK in the Data Solutions Showcase

10:00 a.m. - 10:45 a.m.

Track A — Analytics and Applications

Moderator: Joe McKendrick, Principal Researcher, Unisphere Research

A101: Moving From Big Data to Big Value

10:45 a.m. - 11:45 a.m.

There is no denying that today, creating competitive advantage means embracing Big Data. Learn the differences between the new and traditional methods for getting the most from your data, and walk through an organizational maturity and readiness model that can be applied to help determine how your organization can adopt the latest Big Data approaches for the greatest value.

Embracing Big Data to Gain Competitive Advantage

Brian Squibb, Enterprise Cloud Platfrom Engineer, Google

In the Trenches With Big Data

Paul Nelson, Innovation Lead, Accenture

A102: Big Data Methodology

12:00 p.m. - 12:45 p.m.

Tony Shan thinks about Big Data holistically, with a methodical framework that comprises the scientific and engineering steps in accelerating Big Data solutions: Problem, Diagnosis, Facts, Analysis, Hypothesis, Solution, Prototype, and Implementation. He then drills down to the key techniques and methods commonly used in these steps, such as root cause examination, process mapping, force field investigation, benchmarking, interview, brainstorming, focus group, Pareto chart, SWOT, impact evaluation, gap analysis, POC, and cost-benefit study. Best practices and lessons learned from the real-world Big Data projects are revealed in this session.

Tony Shan, Chief Architect, Wipro

Data Governance and Data Modeling

Casey Gwozdz, Principle Consultant, CA ERwin Products, CA Technologies

ATTENDEE LUNCH in the Data Solutions Showcase

12:45 p.m. - 2:00 p.m.

A103: Succeeding With Big Data Analytics

2:00 p.m. - 2:45 p.m.

Opening the door to Big Data analytics may seem like an impossible task, but it’s not if you hold the right keys. Learn how building millions of predictive models in minutes is now computationally possible, more accurate, and operationally deployable from 1010data’s Ashfin Goodarzi, and join SiSense’s Eldad Farkash to hear about two Big Data projects that resulted in increased profitability and reduced customer churn, providing nontech- nical employees with real-time analytics, and gaining a handle on massive, exponentially growing datasets.

Big Data Mandated Analytics and Predictive Modeling

Afshin Goodarzi, Vice President, Chief Analyst, 1010data

Analytics Without the Headaches

Eldad Farkash, Founder & CTO (Chief Technology Officer), Sisense

COFFEE BREAK in the Data Solutions Showcase

2:45 p.m. - 3:15 p.m.

A104: Winning Customers Through Business Intelligence

3:15 p.m. - 4:00 p.m.

Mark Kryska, business intelligence leader at Kelly Services, Inc., has spent the last year working in its Business Intelligence Competency Center. Bringing a mathematical mind-set to business intelligence projects led him to technological solutions to winning customers at Kelly Services. From a customer case study, xplenty explains how to handle data challenges such as security, performance, scalability, and agility.

How Kelly Services Wins Customers Through Business Intelligence

Mark Kryska, Business Intelligence Manager, Kelly Services

Using xplenty to Integrate and Process Structured and Semi-structured Data

Yaniv Mor, Founder & CEO, Xplenty

A105: Capitalizing on IoT and Big Data Analytics

4:15 p.m. - 5:00 p.m.

Internet of Things We live in an increasingly connected world in which huge amounts of sensor data are creating new opportunities for applying advanced analytics at different touchpoints, to drive business value. With the rise of the Internet of Things (IoT), the behavior of the “things” in the form of connected industrial machine data can be understood using physics-based models or math-based models. The outliers from such behavior help to understand the health of such machines and move to proac tive asset management. Find out how this is leading to monetization of the data from the IoT and the Industria Internet, as well as the emerging opportunities.

Big Data Use Cases for the Internet of Things

Gokula Mishra, VP, Advanced Analytics & Big Data, Oracle

NETWORKING RECEPTION in the Data Solutions Showcase

5:00 p.m. - 6:00 p.m.

Track B — Moving to a Modern Data Architecture

Moderator: John O'Brien, Principal Advisor & Industry Analyst, Radiant Advisors

B101: Understanding the Modern Data Architecture

10:45 a.m. - 11:45 a.m.

Next-generation enterprise environments must support diverse structured and semi-structured data types, greater volumes of internal and external data, more users, and usage types—all while scaling indefinitely and affordably. Along with changing requirements, the role of data architects and modelers has also evolved. Join Clarity Solution Group’s Mike Lamble to learn about the "Top 10" cornerstones of a modernized enterprise data environment and how it can help organizations accelerate time-to-answer, reduce cost-per-answer, and improve the quality of answers. And hear from Embarcadero’s Ron Huizenga about why an enterprise modeling strategy is now more important than ever and what modeling techniques can help address the top data architecture challenges.

Top 10 Cornerstones of a Modern Data Architecture

Mike Lamble, CEO, Clarity Solution Group

Top 5 Data Architecture Challenges

Ron Huizenga, Senior Product Manager, Idera

B102: Creating a Data Architecture for Business Intelligence and Analytics

12:00 p.m. - 12:45 p.m.

The Analytical Data Architecture (ADA) represents the evolution of pragmatic practices for BI, data integration, and data warehousing. This framework leverages current and emerging technologies in its implementation supporting structured, semi-structured, and unstructured data. It includes data architecture, integration, and governance as they apply to business intelligence and analytics. ADA avoids pitting Big Data against "old school" architectures and presents a portfolio of data, information and technology architectural components to help enterprises implement BI and analytics.

Richard Sherman, Managing Partner, Athena IT Solutions

Data Virtualization: Plug 10 Key Gaps in Your BI/Analytics Capability Today (in <6 Months)

Suresh Chandrasekaran, Denodo

ATTENDEE LUNCH in the Data Solutions Showcase

12:45 p.m. - 2:00 p.m.

B103: Unlocking the Power of Machine Data

2:00 p.m. - 2:45 p.m.

Machine data is one of the fastest growing, most complex, and most valuable segments of data. The technology infrastructure running your enterprise—the web servers, applications, network devices, mobile devices, sensors—generates massive streams of data, in an array of unpredictable formats that are difficult to process and analyze by traditional methods or in a timely manner. But why is it so valuable? Machine data contains a trace—a categorical record—of user behavior, cybersecurity risks, application behavior, service levels, fraudulent activity, and customer experience. If leveraged correctly, a company can gain a significant competitive advantage with machine data.

Why Unlocking Machine Data Is the Secret Ingredient to Your Business Success

Dejan Deklich, VP, Platform and Cloud Engineering, Splunk

Crowdflower Helps Unlock Data

Lukas Biewald, CEO & Founder, CrowdFlower

COFFEE BREAK in the Data Solutions Showcase

2:45 p.m. - 3:15 p.m.

B104: Taking Advantage of New Data Management Technologies

3:15 p.m. - 4:00 p.m.

More than ever before, businesses today run on data. Apache Ignite is the in-memory data fabric that combines in-memory cluster and computing, in-memory data grid, and in-memory streaming under one umbrella. Join this session to find out more about in-memory computing (IMC) and what is driving its exponential adoption, as well as the benefits of moving from an RDBMS view of the data world to a NoSQL view in order to leverage modern data models in your applications.

Apache Ignite: In-Memory Data Fabric

Dmitriy Setrakyan, EVP of Engineering, GridGain Systems, Inc.

Moving From a Traditional Database to a NoSQL Database

David Gorbet, VP of Engineering, MarkLogic

B105: Driving Performance in the Data-Driven Enterprise

4:15 p.m. - 5:00 p.m.

Today, it’s all about the database. Optimizing the performance of your database can translate to significant business advantage, while poorly planned changes and misconfigurations can result in availability and performance snafus, or worse, unplanned outages. This session shows you how to build a database monitoring tool for free with some SQL scripts and shell scripts. Learn how to monitor sessions, disk performance, SQL performance, DB tablespace information, concurrent programs, and object statistics—all using one single script.

Building a Database Monitoring Tool Using Splunk for Free

Chuck Ezell, Practice Lead of Integration & Development Services, Development, Tuning & Automation, Datavail

Creating a Data-Driven Organization: The Importance of Visualization

Robert Moss, Optimity Advisors

Vipul Parekh, Manager, Optimity Advisors

NETWORKING RECEPTION in the Data Solutions Showcase

5:00 p.m. - 6:00 p.m.

Track C — Big Data in the Cloud

Moderator: Alexis Kolak, Education Manager, IOUG

C101: Blueprint for Architecting Big Data Analytics Solutions

10:45 a.m. - 11:45 a.m.

Big Data Analytics applications are widespread and, given the massive amounts of information now available, are expected to grow exponentially. Big Data application architecture is challenging, but rewards can be game-changing. This session helps you successfully architect Big Data Analytics applications to support your Big Data strategy.

Sunil Ranka, Technical Director - Big Data / Business Intelligence, Teksystem Global Systems

C102: An Introduction to Oracle Enterprise Metadata Manager

12:00 p.m. - 12:45 p.m.

As technology stacks expand and transform over time, one constant shines bright—the need for metadata. Metadata manager tools have come and gone over the years, but they always felt incomplete. The newly released Oracle Enterprise Metadata Manager is the world's first complete metadata tool. This session shows how the tool deals with the standard features of data lineage, impact analysis, and version management applied across a myriad of Oracle and non-Oracle technologies and combines them into a consistent metadata whole, including Oracle Database, Oracle Data Integrator, Oracle Business Intelligence, and Hadoop.

Stewart Bryson, Red Pill Analytics

ATTENDEE LUNCH in the Data Solutions Showcase

12:45 p.m. - 2:00 p.m.

C103: Integration of a SQL Database Oracle and a NoSQL Database Cassandra for a High Data Volume Big Data Application

2:00 p.m. - 2:45 p.m.

Oracle is a strong relational database management system that well serves the use cases for which it was originally designed, such as OLTP applications. But for applications with big/fast data processing requirements, it makes both business and technical sense to have an integrated database system. Cassandra is a non-ACID, NoSQL key-value pair database that uses a sharding approach for horizontal scalability and is built on guiding principles that are opposite for those guiding the Oracle database. This session discusses how Apex Supply Chain Technologies built an integrated POC database system for a transformative supply chain cloud-based

Rao S Kasinadhuni, Database Technologist, Vice President, JPMorgan

COFFEE BREAK in the Data Solutions Showcase

2:45 p.m. - 3:15 p.m.

C104: Data Warehousing with Hadoop and Integration with OBIEE 11g Leveraging Oracle Big Data Discovery (BDD) and Big Data SQL

3:15 p.m. - 4:00 p.m.

Increasing interest in using Hadoop for data management, transformation, and analysis had led to significant development efforts by companies to enhance and extend Hadoop framework and offer a range of different Hadoop solutions. Many of these solutions can be used to enhance and extend the current BI and data warehousing environment. However, architecting such a system requires understanding of not just Hadoop but a broad range of related projects in the Hadoop ecosystem such as Flume, Hive, Impala, Oozie, Sqoop, etc. It also requires an understanding of the existing Enterprise Data Warehousing systems but more importantly how they integrate with various tools in the Hadoop ecosystem. The newest release of Oracle Business Intelligence 11.1.1.7 shows Oracle's continued efforts in trying to integrate its Oracle Business Intelligence Platform with big data technologies such as Hadoop and Hive.

Pravin Patil, Practice Director, BI/EPM, Apps Associates LLC

C105: Analytics: Traditional Versus Big Data

4:15 p.m. - 5:00 p.m.

Big Data insight is getting tremendous attention across all organizations, but data mining and analytics are not new methodology. This presentation emphasizes how Big Data technologies are helping data discovery and turning information and knowledge into wisdom. This presentation not only helps you understand different types of analytic techniques using examples, but also explains how Hadoop is contributing to the success of analytics.

Anand Rai, DBA, Verizon Wireless

NETWORKING RECEPTION in the Data Solutions Showcase

5:00 p.m. - 6:00 p.m.

Hadoop Day

Moderator: Lindy Ryan, Professor & Research Faculty, Montclair State University; Rutgers University

H101: The Current State of Hadoop

10:45 a.m. - 11:45 a.m.

Apache Hadoop has become the predominant Big Data platform for storing and analyzing data. Companies use Hadoop to get value and gain competitive differentiation from their ever-increasing wealth of data. Knowing where and how to start exploring Hadoop's rich set of tools is a “Big Data” challenge of its own. Learn the key differences between the most popular Hadoop distributions so you can start using Hadoop today.

The Hadoop Ecosystem

James Casaletto, Principal Solutions Architect, Professional Services, MapR

Hadoop: Whose to Choose

David Teplow, Founder & CEO, Integra Technology Consulting

H102: Hadoop and Your Data Warehouse

12:00 p.m. - 12:45 p.m.

Elliott Cordo shares real-world insights across a range of topics, including the evolving best practices for building a data warehouse on Hadoop that also coexists with multiple processing frameworks and additional non-Hadoop storage platforms, the place for massively parallel-processing and relational databases in analytic architectures, and the ways in which the cloud offers the ability to quickly and cost-effectively establish a scalable platform for your Big Data warehouse.

Building a Real-World Data Warehouse

Elliott Cordo, Founder/Builder, Data Futures, LLC

Snowflake and Data Warehouses

Greg Rahn, Director of Product Management, Snowflake Computing

ATTENDEE LUNCH in the Data Solutions Showcase

12:45 p.m. - 2:00 p.m.

H103: Hadoop in the Cloud

2:00 p.m. - 2:45 p.m.

To get your Big Data job done right, you need to use the right Big Data tools. How can you make sure you are leveraging the right tools? Learn from Ben Sgro about how Simulmedia, a pioneer in audience-based advertising on TV, is using a custom Python framework to programmatically create EMR clusters, move data to and from Amazon Simple Storage Service, and load data into its Redshift data warehouse. Xplenty’s Yaniv Mor talks about how using Hadoop in a coding-free, cloud-based environment ensures that businesses can benefit from Big Data without having to invest in hardware, software, or related personnel.

Python and EMR for MapReduce ETLs in the Cloud

Ben Sgro, Director of Data Engineering, Simulmedia

Offloading Data Integration/ETL to the Cloud (Using Hadoop)

Yaniv Mor, Founder & CEO, Xplenty

COFFEE BREAK in the Data Solutions Showcase

2:45 p.m. - 3:15 p.m.

H104: Harnessing the Hadoop Ecosystem

3:15 p.m. - 4:00 p.m.

Big Data is transforming how companies analyze information and enabling them to connect with customers in ways never possible before. Radius, which provides companies with a real-time marketing intelligence platform, is moving its core infrastructure from Hadoop to Spark. Hear Spotright’s Nathan Halko talk about his experiences moving from Hadoop to Spark. Qubole’s Jason Huang provides an overview of Apache Hive, the key differences between Hive and traditional data warehouses built on top of RDBMSs, and key techniques to increase performance and simplify Hive.

Moving From Hadoop to Spark: The Business Case

Nathan P Halko, Data Scientist, Spotright

Deep Dive Into Apache Hive

Jason Huang, Senior Solutions Architect, Qubole

H105: Panel Discussion: The Data Lake: From Hype to Reality

4:15 p.m. - 5:00 p.m.

There has been a lot of hype around data lakes and their relevance to Big Data challenges. The data lake approach is being championed by some as a way to realize the promise of Big Data, allowing organizations to move data in its raw form into a central storage reservoir until it is needed. There has also been much scrutiny in the marketplace over the potential pitfalls of data lakes. To find out what you need to know before you dive into the data lake, join Venkat Eswara of GE, Joe Caserta of Caserta Concepts, and George Coregedo of RedPoint Global for a lively panel discussion about using Hadoop to create a centralized processing pool where data is captured, cleansed, linked, and structured in a consistent way.

Joe Caserta, Founding President, Caserta

George Corugedo, CTO, RedPoint Global Inc.