DS19 Final Program
DS19 Final Program
FEATURING THESE
SPECIAL EVENTS
KEYNOTE SPEAKERS
Michael Stonebraker
Adjunct Professor,
MIT, & Co-Founder/
CTO, Tamr
Michelle L. Gregory
SVP, Data Science,
Elsevier
HYATT REGENCY
BOSTON John O’Brien
Principal Advisor &
Chief Researcher,
BOSTON, MA
Radiant Advisors
dbta.com/datasummit
DIAMOND
SPONSORS
PLATINUM
SPONSORS
®
SPONSORS
CONNECT: #DataSummit
UNLEASH THE POWER OF YOUR DATA AT DATA SUMMIT 2019
Welcome to the Data Summit and Cognitive Computing & AI Summit conferences. We’re delighted to
be back in Boston and to greet all of you at the conferences.
At Data Summit 2019, you’ll hear speakers give their practical experiences within these six tracks:
Moving to a Modern Data Architecture, Competing on Analytics, Data Lake Boot Camp, Building the
Data-Driven Future, Digital Transformation, and DataOps Boot Camp. At the Cognitive Computing & AI
Summit, our speakers address the important issues facing these new technologies. I particularly call
your attention to the panel discussion on Wednesday about ethics.
This year we’ve added a session called Data and Donuts (yes, there will be donuts!) to get your
Wednesday off to a sweet start. I encourage you to network with other attendees and chat further with
speakers. Visit the exhibit area to see new and exciting products. And don’t forget the reception in the
Data Solutions Showcase.
Marydee Ojala, Conference Program Director
4 #DataSummit dbta.com/datasummit
CONFERENCE
AT-A-GLANCE
MONDAY MAY 20 PRECONFERENCE WORKSHOPS (All workshop rooms are located on the Main Lobby Level.)
TUESDAY MAY 21 (All session rooms are located on the 4th floor.)
9:30 a.m. – 9:45 a.m. SPONSORED KEYNOTE z Grand Ballroom B z Information as Competitor Advantage z Lee Levitt, Oracle
9:45 a.m. – 10:00 a.m. SPONSORED KEYNOTE z Grand Ballroom B z A View From the Front Lines of Data Analytics z Lynda Partner, Pythian
10:00 a.m. – 10:45 a.m. COFFEE BREAK ❚ In the Data Solutions Showcase
TRACK A ❚ Grand Ballroom B TRACK B ❚ Duxbury DATA LAKE
Modern Data Architecture Competing on Analytics boot camp Dedham Plymouth
10:45 a.m. – 11:45 a.m. A101 ❚ Building a Modern B101 ❚ Taking Your C101 ❚ B
uilding a Data CS101 ❚ The Rise of
Data Architecture Analytics to the Lake for the Artificial
Next Level Enterprise Intelligence
12:00 p.m. – 12:45 p.m. A102 ❚ The New World B102 ❚ Data Science C102 ❚ Data Discovery CS102 ❚ Machine
of Database Best Practices in Data Lakes Learning in the
Technologies Real World
12:45 p.m. – 2:00 p.m. ATTENDEE LUNCH ❚ In the Data Solutions Showcase
2:00 p.m. – 2:45 p.m. A103 ❚ Understanding B103 ❚ Analytics C103 ❚ PANEL: Data CS103 ❚ AI in Action
Cloud Licensing in Action Lakes: Challenges
and Opportunities
2:45 p.m. – 3:15 p.m. COFFEE BREAK ❚ In the Data Solutions Showcase
3:15 p.m. – 4:00 p.m. A104 ❚ Overcoming Big B104 ❚ Delivering C104 ❚ Data Lakes in CS104 ❚ AI Success
Data Integration Trusted Data Action Factors
Challenges
4:15 p.m. – 5:00 p.m. A105 ❚ Securing the B105 ❚ Everyday Chaos C105 ❚ Frameworks for CS105 ❚ Exploring
Internet of Things the Future Machine Learning
5:00 p.m. – 6:00 p.m. NETWORKING RECEPTION ❚ In the Data Solutions Showcase
WEDNESDAY MAY 22 (All session rooms are located on the 4th floor.)
8:00 a.m. – 8:45 a.m. DATA & DONUTS PRESENTATION z Grand Ballroom B z Paul Wolmering, Actian Corp.
9:00 a.m. – 9:45 a.m. OPENING KEYNOTE z Grand Ballroom B
Digital Transformation Is Business Transformation z Michelle L. Gregory, Data Science, Elsevier
9:45 a.m. – 10:00 a.m. SPONSORED KEYNOTE z Grand Ballroom B z The Evolution of Big Data Analytics z Matthew Deyette, Gemini Data, Inc.
10:00 a.m. – 10:45 a.m. COFFEE BREAK ❚ In the Data Solutions Showcase
TRACK A ❚ Grand Ballroom B TRACK B ❚ Duxbury DATAOPS
Building the Data-Driven Future Digital Transformation boot camp Dedham Plymouth
10:45 a.m. – 11:30 a.m. A201 ❚ Winning With B201 ❚ Achieving a C201 ❚ S
ucceeding With CS201 ❚ Machine
a Modern Data 360-Degree DataOps Today Learning Best
Strategy Customer View Practices
11:45 a.m. – 12:30 p.m. A202 ❚ Supporting B202 ❚ Digital C202 ❚ The Rise of CS202 ❚ Diving Into
Modern Transformation in Containers Deep Learning
Applications the Real World
12:30 p.m. – 2:00 p.m. ATTENDEE LUNCH ❚ In the Data Solutions Showcase
2:00 p.m. – 2:45 p.m. A203 ❚ Designing B203 ❚ Tapping Into New C203 ❚ Operationalizing CS203 ❚ AI Use
for Speed & Data Sources for Big Data Cases Today
Scalability Business Value Workloads
3:00 p.m. – 3:45 p.m. A204 ❚ The Rise of B204 ❚ Emerging C204 ❚ Unlocking the CS204 ❚ PANEL:
Knowledge Applications for Power of Data Cognitive
Graphs Blockchain Wrangling Computing
6 #DataSummit dbta.com/datasummit
GENERAL INFORMATION
LOCATION KEY WI-FI
All rooms are located on the 4th Floor. Complimentary Wi-Fi will be available in conference areas
during conference hours.
Keynotes ■ DATA LAKE Network: datasummit
Grand Ballroom B boot camp
(Tuesday, May 21) ■ Dedham Username/Password: ds2019
Track A ■ Grand Ballroom B
Track B ■ Duxbury DATAOPS DATA SOLUTIONS SHOWCASE
boot camp
■ Plymouth (Wednesday, May 22) ■ Dedham A vibrant marketplace for data and information management
companies, the Data Solutions Showcase features the
top companies in the industry and offers attendees the
NETWORKING AT DATA SUMMIT opportunity to explore new developments in product and
Data Summit 2019 offers wonderful opportunities to get service solutions. The Showcase is an excellent place to
acquainted with experts and your peers in the field. Learn look for a particular product, evaluate competing systems,
from their experience and share your successes and and keep up with the latest trends. Located in Adrienne
challenges. Salon/Grand Ballroom A
Networking Reception in the Showcase
Tuesday, May 21 ■ 5:00 p.m. – 6:00 p.m. SHOWCASE HOURS
Data & Donuts presented by Tuesday, May 21 ■ 10:00 a.m. – 6:00 p.m.
Wednesday, May 22 ■ 8:00 a.m. – 8:45 a.m. Networking Reception ■ 5:00 p.m. – 6:00 p.m.
Continental Breakfasts, Breaks, & Lunches Wednesday, May 22 ■ 10:00 a.m. – 2:00 p.m.
Check the schedule for times and locations.
CONFERENCE PRESENTATIONS
Twitter ■ # DataSummit #DataLakeBC Many speakers have made their presentations available for
#DataOpsBC #CCAISummit download at dbta.com/datasummit/2019/presentations.aspx
LinkedIn ■ http://bit.ly/DBTA_LI
MONDAY
MAY 20
PRECONFERENCE WORKSHOPS (All workshop rooms are located on the Main Lobby Level.)
Preconference workshops are practical and hands-on. Please bring a laptop with you so you can participate in the exercises.
Laptops will not be provided on site.
8 #DataSummit dbta.com/datasummit
TUESDAY
MAY 21
8:00 a.m. – 8:45 a.m. TRACK A z Grand Ballroom B
CONTINENTAL BREAKFAST Moving to a Modern Data Architecture
MODERATOR: J ohn O’Brien, CEO & Principal Advisor,
8:45 a.m. – 9:30 a.m. Radiant Advisors
10 #DataSummit dbta.com/datasummit
lytics, NoSQL, IoT, in-memory, and DevOps and examines what data. Understand key success factors for migrating to columnar
is happening with DBAs and their roles within modern organi- analytics to gain actionable insights from an operational data
zations. Mullins backs up the trends with references and links warehouse. Learn what it takes to deliver insights from real-time
where appropriate. data economically and at scale with hybrid data regardless of
location—in the cloud, on-premises, or both.
12:45 p.m. – 2:00 p.m.
ATTENDEE LUNCH in the Data Solutions Showcase 4:15 p.m. – 5:00 p.m.
A105 z Securing the Internet of Things
2:00 p.m. – 2:45 p.m. There are significant benefits offered by IoT, but also new threats
A103 z Understanding Cloud Licensing and dangers. Do we really understand the challenges posed by
all these connected “things”?
It was hard enough to manage IT infrastructures when everything
was on-premise only. But today, with combined on-premise The Dark Side of the Internet of Things
deployments, SaaS, and hybrid cloud scenarios, there is uncer- Jeff Crume, Distinguished Engineer, IT Security Architect,
tainty about the proper way to license software in these very IBM Master Inventor
complex environments. With the Internet of Things (IoT), essentially everything becomes
Straight Talk on the Cloud License Landscape a computer. This means that everything can be hacked—includ-
Michael Corey, Co-Founder, LicenseFortress ing cars, home appliances, medical devices, and more. This pre-
sentation provides examples of IoT hacks and the consequences
Don Sullivan, VMware Product Line Manager, Business Critical
of not getting security right as we move forward in the world of
Applications
smart and connected machines.
Keeping software in compliance is a more significant challenge
today than ever before. Sorting through all the FUD (fear, uncer-
tainty, doubt) and getting straight answers from the vendors on
5:00 p.m. – 6:00 p.m.
the proper way to license software in this complicated world is NETWORKING RECEPTION in the Data Solutions Showcase
nearly impossible. Making matters worse is the fact that many
software vendors have turned to software license audits as an
easy way to generate additional revenues. This session covers
TRACK B z Duxbury
current software licensing trends, important lessons learned Competing on Analytics
from the real world, and the steps every organization should MODERATOR: L indy Ryan, Professor & Research Faculty,
take now to avoid becoming a victim of a software license audit Montclair State University and Rutgers University
whose real purpose is to generate revenue.
10:45 a.m. – 11:45 a.m.
2:45 p.m. – 3:15 p.m. B101 z T aking Your Analytics
COFFEE BREAK in the Data Solutions Showcase to the Next Level
AI and Big Data offer seemingly unlimited potential for orga-
3:15 p.m. – 4:00 p.m. nizations to better understand their customers, make more
informed decisions, and address challenges with greater agility.
A104 z O
vercoming Big Data It’s important to understand the choices available to achieve the
Integration Challenges best outcomes.
Data is flowing into organizations from a previously unimaginable Applied Analytics: From BI to AI
array of sources and at unprecedented speed and volume. This
Kimberly Nevala, Strategic Advisor, SAS
means that the challenges of cleaning, deduplicating, and inte-
grating data are increasing. The intersection of AI and Big Data provides the ability to deliver
more targeted, timely, relevant insight in a pervasive and intuitive
Dismantling Data Silos Through Cloud Integration manner. However, delivering that simplicity requires an analytics
Danil Zburivsky, Director Data Engineering, The Pythian Group and data ecosystem that is markedly more complicated than 10
A cloud-native data platform may be the best way for organiza- years ago. To that end, effectively deploying analytics from BI to
tions to cost-effectively deliver on the promise of better insights AI is a now an exercise in portfolio management—complete with
and more intelligent systems through data. Danil Zburivsky cov- discrete customer segments, diverse data environments, devel-
ers how a cloud integration approach can lead to better data opment methods, and a wide spectrum of deployment options.
governance and more accurate analysis and ensure consistency This session puts the diverse—and growing—landscape of
of data across systems, as well as the best practices for cloud analytics capabilities from BI to AI into context.
data integration and how a cloud data platform breaks down data How to Build Data Science Teams
silos within the organization. The presentation also looks at how that Deliver Business Value
one client successfully took its global sales data to the cloud to
Ganes Kesari, Co-Founder & Head, Analytics, Gramener Inc.
uncover new opportunities.
In spite of the buzz around AI, organizations are struggling to
Diving Under the Hood of Actian Avalanche, build data science teams that deliver value on the ground. This
a Gen III Cloud Data Warehouse talk presents the three distinctive phases of growth for data
Paul Wolmering, VP, Worldwide Sales Engineering, Actian Corp. science teams, highlighting potential challenges and suggesting
From the perspective of an experienced engineering thought a standard framework of guidelines to successfully navigate this
leader, Paul Wolmering delivers a deep dive into Actian’s newly evolution. Vastly different approaches are needed in each stage
launched Gen III cloud data warehouse. Learn about key con- of maturity to tackle aspects such as strategic direction, project
siderations for building a fully managed, multi-cloud data ware- framework, the mix of skills, hiring strategies, and fostering of
house with federated query capabilities that’s built for hybrid a data culture.
dbta.com/datasummit #DataSummit 11
TUESDAY
MAY 21
12:00 p.m. – 12:45 p.m. Riding the Waves of Big Data Disruption:
Machine Learning, Cloud Analytics, IoT, & More
B102 z Data Science Best Practices Paige Roberts, Open Source Relations Manager, Microfocus |
Emerging technologies such as AI, IoT, and machine learning are Vertica
changing what is knowable about customers. At the same time, As Big Data grows and evolves, your enterprise faces both chal-
the frequency of data misuse is leading government entities and lenges and market-disrupting opportunities to analyze and man-
individuals to demand higher standards of accountability. age larger data volumes for business value. But with seemingly
Ethics, Data Ownership, & Privacy in Data Science endless commercial, open source, and "as-a-service" offerings
Anne Buff, Strategic Advisor, SAS hitting the market each week, how do you choose the right mix
This presentation explores the issues around modernizing of technologies and avoid creating an accidental architecture
security and governance, as well as what it means to deliver that will limit you from future innovation? How are organizations
transparency and what users actually expect. It also covers actually achieving true bottom-line benefits from their Big Data
the need to manage accountability within systems of multiple initiatives? Learn how to adopt an effective and agile approach
decision-makers; why it is necessary to build fairness into the to Big Data analytics.
system to overcome bias, discrimination, and enable diversity;
and the need to address expectations of privacy and appropriate 2:45 p.m. – 3:15 p.m.
use of data. COFFEE BREAK in the Data Solutions Showcase
Accelerating Analytics in a New Era of Data
Dan Leichner, CMO, SQream 3:15 p.m. – 4:00 p.m.
Due to exponentially growing data stores, organizations today are B104 z Delivering Trusted Data
facing slowdowns and bottlenecks at peak processing times, with With the vast quantities of data flowing into organizations, the job
queries taking hours or days. Some complex queries simply can- of cleansing and validating data is only becoming more difficult.
not be executed. Data often requires tedious and time-consuming In order to gain the kind of insights and outcomes that organiza-
preparation before queries can be run. This session demonstrates tions seek, new processes and technologies must be deployed.
how the power of GPUs can help conquer these challenges,
enabling data professionals to rapidly analyze more data on more Flipping the 80/20 Rule of Data Prep and Analysis
dimensions for previously unobtainable business insights. Robin Rappaport, Senior Operations Research Analyst, IRS–RAAS
(Research, Applied Analytics, and Statistics)
12:45 p.m. – 2:00 p.m. The (IRS) Compliance Data Warehouse (CDW) is an analyti-
cal data warehouse used for research purposes. It empowers
ATTENDEE LUNCH in the Data Solutions Showcase researchers to spend more time on analytics and less on data
wrangling. To ensure all data is loaded properly, consistent, well-
2:00 p.m. – 2:45 p.m. thought-out validation steps must be included in the ETL pro-
B103 z Analytics in Action cess. This presentation offers a case study of accomplishments
and lessons learned (since FY 2016), including the data quality
Organizations in all industries are under pressure to take advan-
issues identified by CDW users (data stewards), and takeaways
tage of Big Data and newer data sources for real-time decision
for attendees on how to improve decision making.
making in mission-critical environments. New technologies pro-
vide opportunities to gain insight into the future.
Fannie Mae’s Journey to a Data-Driven Organization
4:15 p.m. – 5:00 p.m.
Badal Shah, Director, Development, Fannie Mae B105 z Everyday Chaos
How does an organization evolve from an application-centric to Best-selling author David Weinberger previews his new book
a data-driven enterprise? This presentation covers how Fannie on everyday chaos.
Mae embarked on a major transformation journey to modernize How Machine Learning Is Changing the
its data infrastructure, transitioning from legacy data platforms Future as a Fact and as an Idea
to more integrated and scalable architecture to capitalize on the David Weinberger, Senior Researcher, Harvard’s Berkman Center
growing opportunities of the analytics economy and generate for Internet & Society
substantial business value, internally and externally. Interviewed by Hadley Reynolds, Co-Founder, Cognitive
Computing Consortium
Ultimately, machine learning’s most important effect may not be
in the benefits its use brings, but how it is implicitly transforming
our understanding of how the world works and our most basic
GET MOBILE! strategies for dealing with the future. From Newton on through
the Computer Age, we have assumed that the universe is ruled
Enter URL: by a relative handful of laws that are the same everywhere and
m.dbta.com that are simple enough for us to understand. But machine learn-
ing shows us a world of motes of data in networks so dense with
Lighten your load with the Data connections and so delicately balanced, we sometimes can’t
Summit mobile program. Get easy understand them. This sort of model of the world is changing
access to everything you need during not only our strategies, but our moral sense, our ideas about
the event, anytime you need it. No meaning, and even what makes humans special.
download required!
5:00 p.m. – 6:00 p.m.
NETWORKING RECEPTION in the Data Solutions Showcase
12 #DataSummit dbta.com/datasummit
From centralized data acquisition and offloading to data
discovery and data science projects, data lakes are on the rise
at enterprises today. Data Lake Boot Camp offers attendees a
deep dive into the latest supporting technologies, best practices,
real-world success factors and expert insights.
All sessions are located in the Dedham room, 4th floor unless otherwise noted.
8:00 a.m. – 8:45 a.m. other words, what’s coming and how can you maneuver today
to take advantage.
CONTINENTAL BREAKFAST
10:00 a.m. – 10:45 a.m.
8:45 a.m. – 9:30 a.m. COFFEE BREAK in the Data Solutions Showcase
W
ELCOME & KEYNOTE z Grand Ballroom B
Big Data, Technological Disruption, and 10:45 a.m. – 11:45 a.m.
the 800-Pound Gorilla in the Corner C101 z B
uilding a Data Lake
ichael Stonebraker, Adjunct Professor, MIT, & Co-
M for the Enterprise
Founder/CTO, Tamr A new data platform approach is needed to extend the data
Stonebraker focuses on the current market for Big Data products, warehouse and address the vast quantity and variety of data
specifically those that deal with one or more of “the 3 V’s.” flowing into organizations, much of it unstructured.
On the one hand, the Volume problem for business intelligence The Data Warehouse Is Dead
applications is pretty well solved by data warehouse vendors. Lynda Partner, VP, Marketing and Analytics as a Service,
However, upcoming data science tasks are poorly supported at The Pythian Group
present. On the other hand, there is rapid technological progress,
The data warehouse is experiencing pressure from increasing data
so we need to stay tuned. In the Velocity arena, recent “new volumes, more users, and tight budgets—a triple threat to its
SQL” and stream processing products are doing a good job, ongoing existence and value. In addition, new data types are com-
albeit with some storm clouds on the horizon. The Variety space ing into play. This increased pressure means the old-school data
has a collection of mature products, along with considerable warehouse may not be delivering insights at the speed of business.
innovation from startups. He identifies opportunities, particularly There are a number of alternatives to meet modern analytics infra-
those enabled by possible disruption from new technology. And structure needs. This presentation outlines in detail why a modern
then there’s that 800-pound gorilla in the corner. data platform is required to deliver on new analytics demands.
Exploiting Enterprise Data for Transformational Projects
9:30 a.m. – 9:45 a.m. Sean Martin, CTO, Cambridge Semantics
SPONS0RED KEYNOTE z Grand Ballroom B In a data fabric, the data discovery and integration layer maps
Information as Competitor Advantage all enterprise data in its original business context so that users
Lee Levitt, Business Strategist, Oracle can find and blend data from diverse siloed sources into analyt-
While organizations have dramatically more data readily avail- ic-ready datasets on an on-demand basis. Join Sean Martin to
able, few are leveraging this data for true competitive advantage. hear how companies are using data discovery and integration
McKinsey found that data-centric companies are driving a 9-fold solutions to exploit enterprise data for transformational analytic
increase in customer loyalty and an almost 20-fold increase in and machine learning projects.
customer profitability. As data and analytics leaders, you have
the opportunity to drive organizational focus on identifying, curat- 12:00 p.m. – 12:45 p.m.
ing, and leveraging valuable data to support better strategic deci- C102 z D
ata Discovery in Data Lakes
sion making. Levitt shares information management frameworks With the abundance of data stored in data lakes, finding the
and anecdotes that highlight the value of thinking outside box, relevant information is increasingly challenging, particularly in
discusses critical success factors, and recommends specific light of the many formats in which the data appears.
actions to improve your organization's competitive advantage.
Data Discovery, Selection, & Provisioning
9:45 a.m. – 10:00 a.m. Subhayan Das, Associate Director, Digital Capability Management
With the realization of the power of data lakes, more and more
SPONS0RED KEYNOTE z Grand Ballroom B organizational data in various formats and standards are being
A Big Data Reality Check: A View From the made available there. Given this plethora of information, it is
Front Lines of Data Analytics becoming increasingly daunting for users to search for the data
Lynda Partner, VP, Analytics, Pythian of interest to them with the use of conventional data analyti-
cal tools. A combination of data discovery tools, making use of
Lynda Partner and her team of Big Data and analytics profession-
semantic search and concept search, brings in the right blend of
als work to solve the toughest data challenges for their clients.
capability, enabling "comparison shopping" between seemingly
As veterans in the continually evolving Big Data space, her team
similar datasets and allowing end users to evaluate the best fit
has been helping clients break down their data silos by bringing while facilitating the discovery and reuse of all available infor-
together data from disparate sources and enabling use cases mation and data assets, both internal and external.
from BI to ML. When it comes to Big Data they’ve seen it all. In
this session, you hear how data professionals like you are dealing
with the most challenging issues with data and keeping up with
12:45 p.m. – 2:00 p.m.
innovations. Learn through real-world examples how you too ATTENDEE LUNCH in the Data Solutions Showcase
can be ready to address tomorrow’s opportunities with ever-ad-
vancing cloud analytics technologies and emerging practices—in
Subscribe today!
dbta.com/newsletters
18 #DataSummit dbta.com/datasummit
ideally suited for transmittal and display—not as data for anal- tions and across the industry, companies are increasingly moving
ysis. By leveraging the structure inherent in scholarly published toward leveraging data for business decisions and analytics.
XML content, we can create structured data in RDF for analysis. Chasing Real-Time Decision-Making
Further, we can link out to external resources as linked data With NoSQL at Scale
to further enrich the content-as-data assets and provide a rich
Ken Bakunas, NoSQL Data Architect, Wayfair
analytical environment.
Wayfair, one of the world's leading home furnishing platforms,
has undergone immense and rapid growth, analyzing data all
4:00 p.m. – 5:00 p.m. along the way. The successful creation of a retail holiday pushed
CLOSING KEYNOTE z Grand Ballroom B its systems to the limit. "Way Day," as it's affectionately known,
Bring It Home: How to Advance was full of highs and lows and showed the team it needed to
Your Analytics Strategies transform not only to the cloud but also to a high-performance,
low-latency database. Wayfair lives and breathes on data-driven
John O’Brien, Principal Advisor & Chief Researcher,
decision making that impacts the entire customer experience.
Radiant Advisors
Data science has been the core driver of Wayfair's success. The
Companies are not lacking in technology options; in most cases, Aerospke NoSQL database enables a highly scalable, fault-tol-
more advanced technologies exist than can be absorbed into erant, and performance-driven environment for customer intel-
the organization all at once. As data and analytics leaders, you ligence, product recommendations, and real-time marketing.
need to determine how to make the biggest business impacts
with advances in AI and ML for analytics, enable self-service
with governance, and support BI and data engineering processes 12:30 p.m. – 2:00 p.m.
with the data lake. To establish this solid enterprise foundation, ATTENDEE LUNCH in the Data Solutions Showcase
it is essential to recommit to data management principles and
prioritize platform technologies and ecosystems accordingly. In 2:00 p.m. – 2:45 p.m.
his closing keynote, O’Brien shares a clear path forward based
on the evolving concepts in data and analytics within the context B203 z T apping Into New Data
of an enterprise-scale data strategy. Sources for Business Value
With the rise of Big Data, IoT, and AI, useful sources of data are
emerging and new opportunities are being created.
TRACK B z Duxbury
Digital Transformation Mind the Gap: How Location Data Connects
MODERATOR: L indy Ryan, Professor & Research Faculty,
Consumers’ Online & Offline Journeys
Montclair State University and Rutgers University Mark Coffey, SVP, Strategic Partnerships, GasBuddy
Location data is critical to closing the gap between the online and
10:45 a.m. – 11:30 a.m. offline world. By leveraging location data, marketers can create
impactful moments to influence the behavior of the consumer.
B201 z A
chieving a 360-Degree This session reveals how location data is the cookie for the real
Customer View world and how it empowers marketers to run unique campaigns
The holy grail of marketers is to attain a 360-degree view of to drive real results in a highly measured way.
customers in order to increase brand loyalty and understand
preferences and purchases for a more personalized approach. 3:00 p.m. – 3:45 p.m.
Technologies are available to help make that goal a reality.
B204 z E merging Applications for Blockchain
User Behavior Data Analysis in R & Python Blockchain, the distributed ledger technology, is expected to
Babak Khosravifar, Data Analyst/Scientist, Square Enix impact a diverse range of industries, from agriculture to account-
Brand and marketing teams often have questions about how to ing to healthcare. From guaranteeing the authenticity of products
segment and classify users based on various attributes. This is to safeguarding transactions, blockchain holds wide-ranging
used to identify cohorts and consequently run campaigns with potential.
strategic plans. This presentation shows how a data analyst can
collect relative data and give sense to analysis output.
Enabling Robust Integration Strategies
Jay Benedetti, Global Solutions Director, CloverDX
If you take a step back, a 360-degree view begins with a success-
ful integration strategy‹a really good one! In this talk, Benedetti
shares two different stories, one in B2B and another in B2C, on
how CloverDX helped enable customers to achieve a 360-degree
view without too much of an investment and left them empowered
to bring in more data as they grow in the future.
dbta.com/datasummit #DataSummit 19
WEDNESDAY
MAY 22
Blockchain-Based Anti-Counterfeit & Identity Solutions
Arnab Banerjee, Principal Consultant, Infosys
Blockchain technology has emerged as an innovative and easy-
to-adopt approach to improve anti-counterfeit measures in dif- GET MOBILE!
ferent industries and delivers a significant positive social impact.
It offers a transparent environment where it is impossible to Enter URL:
duplicate products and there is no need to rely on trust alone. m.dbta.com
This presentation shows how to use a blockchain with smart
contracts to track products at every step of the production and
Lighten your load with the Data
sales process and make this information available to anyone. Summit mobile program. Get easy
access to everything you need during
4:00 p.m. – 5:00 p.m. the event, anytime you need it. No
CLOSING KEYNOTE z Grand Ballroom B download required!
Bring It Home: How to Advance
Your Analytics Strategies
John O’Brien, Principal Advisor & Chief Researcher, Radiant
Advisors
The world of data management has changed drastically—from even just a few
years ago. Data lake adoption is on the rise, Spark is moving toward mainstream,
and machine learning is starting to catch on at organizations seeking digital
transformation across industries. All the while, the use of cloud services
continues to grow across use cases and deployment models. Download
the sixth edition of the Big Data Sourcebook today to stay on top of the
latest technologies and strategies in data management and analytics.
20 #DataSummit dbta.com/datasummit
DATAOPS
The world of data management, with its rigid schemas, silos, and
manual processes, has historically been at odds with the fast,
automated, highly iterative world of DevOps. At DataOps Boot Camp,
boot camp you hear about the key supporting technologies, strategies, real-world
success stories, and how to get started on your DataOps journey.
WEDNESDAY, May 22
MODERATOR:
Julie Langenkamp, Director, Editorial & Content Strategy, Radiant Advisors
All sessions are located in the Dedham room, 4th floor unless otherwise noted.
9:00 a.m. – 9:45 a.m. Five Key Requirements for DataOps Success
Dan Potter, VP, Product Marketing, Attunity
OPENING KEYNOTE z Grand Ballroom B Join this presentation to explore the five key steps necessary
Digital Transformation Is to be successful with DataOps, including the process and cul-
Business Transformation: How to tural shift required. The discussion also covers the benefits of
Incorporate AI Technology Into a enabling DataOps’ success, such as improved productivity,
130-Year-Old Company streamlined and automated processes, increased output, and
Michelle L. Gregory, SVP, Data Science, Elsevier higher collaboration across teams. Learn how to better manage
data flow across the data lifecycle—from ingestion to provision-
Any company that’s been in existence for 130 years has col-
ing to analytics; derive tips from use cases involving data lakes,
lected a vast amount of data. When that company is a publishing
cloud, and data warehousing for better business insights; and
behemoth like Elsevier, making that data into actionable infor-
increase collaboration, productivity, and business value.
mation is not simply an exercise in digital transformation, it has
the capacity to transform the entire business. How to Succeed With DataOps Today
Christopher P. Bergh, CEO & Founder, DataKitchen
9:45 a.m. – 10:00 a.m. The list of failed Big Data projects is long. They leave end users,
SPONS0RED KEYNOTE z Grand Ballroom B data analysts, and data scientists frustrated with long lead times
for changes. Bergh illustrates how to make changes to Big Data,
The Evolution of Big Data Analytics models, and visualizations quickly, with high quality, using the
Matthew Deyette, Chief Customer Officer, Gemini Data, Inc. tools teams love. Synthesizing techniques from DevOps, Dem-
We find ourselves continuously copying, transforming, and aggre- ing, and direct experience shows you how to succeed with
gating data into various large scale, complex, proprietary systems DataOps today.
for the purpose of gaining some sort of competitive edge through
enhanced analytic capabilities. Unfortunately, this process ends
up exacerbating the problem by making Big Data "bigger" and
11:45 p.m. – 12:30 p.m.
making the process of extracting knowledge more difficult over- C202 z The Rise of Containers
all. Deyette reviews where we are, how we got here, and what The use of containers—which enable applications, data,
must come next with regard to leveraging Big Data Analytics. dependencies, and runtimes to be housed within a portable
environment to support greater flexibility—is becoming more
widespread.
Understanding Database Containerization
Jeff Fried, Director of Product Management, &
Joe Carroll, Product Specialist, InterSystems
Container usage is now being adopted by organizations of all
sizes, from small startups to companies with huge, established
microservices platforms. This presentation is aimed at helping
practitioners navigate the minefield of database containerization
and avoid some of the major pitfalls that can occur. It covers con-
siderations such as container configuration and homogeneous
versus heterogeneous node types; data resilience, resources,
and storage; cluster upscale, downscale, and upgrade; and data
locality and networking.
GET
SOCIAL!
Connect with Database Trends and Applications
online and get industry news, trends, and analysis, plus
information on learning opportunities in the field.
TUESDAY, MAY 21 All sessions are located in the Plymouth room unless otherwise noted.
CS102 z Machine Learning in the Real World Exploring Machine Learning on the
Google Cloud Platform
The power of machine learning is particularly evident when used
Sara Robinson, Developer Advocate, Google
to predict events in the real world.
Only 10 years ago, you needed access to extensive academic
Session to be announced and computing resources to make use of machine learning (ML).
For updated information on this presentation, please visit http:// Fast-forward to today, and we’ve seen revolutionary changes in
dbta.com/cognitivecomputingsummit. the hardware and software that are making ML accessible for
any developer or data scientist. No matter where yo are in ML,
12:45 p.m. – 2:00 p.m. Google Cloud Platform has a variety of tools to help you. Sara
ATTENDEE LUNCH in the Data Solutions Showcase Robinson starts with the basics: how to use a pre-trained ML
model with one REST API call. Then she explains how to use
your own dataset to customize a pre-trained model with transfer
learning, and how to train and serve it in the cloud with GCP.
24 #CCAISummit dbta.com/cognitivecomputingsummit
A new era of cognitive computing has already begun, and its impact is being felt across industries, from healthcare
and financial services to manufacturing and education. However, building cognitive systems and applications that
can perform specific, humanlike tasks in an intelligent way is far from easy. This one of-a kind-event is an intense,
2-day immersion into the leading cognitive computing and AI use cases, strategies, and technologies that every
organization should know about. If you are on the front lines of AI and cognitive computing, this summit is for you.
MODERATOR: Seth Earley, CEO, Earley Information Science
WEDNESDAY, MAY 22 All sessions are located in the Plymouth room unless otherwise noted.
dbta.com/cognitivecomputingsummit #CCAISummit 25
SPEAKER
DIRECTORY
Ken Bakunas Matthew Deyette Oleg Kondrashov Hadley Reynolds
Wayfair Gemini Data, Inc. EnCata Soft Cognitive Computing
mbushell@aerospike.com matt.deyette@ social.encatasoft@ Consortium
geminidata.com gmail.com hreynolds@cognitive
Arnab Banerjee
computingconsortium.com
Infosys Seth Earley Julie Langenkamp
Arnab_Banerjee08@ Earley Information Science Radiant Advisors Paige Roberts
infosys.com seth@earley.com julie.langenkamp@ Microfocus | Vertica
@EarleyInfoSci radiantadvisors.com cornish@microfocus.com
David Bayer
Cognitive Computing Susan E. Feldman David Leichner Sara Robinson
Consortium Synthexis SQream Google
dbayer@cognitivecomputing sue@synthexis.com sarah@sqreamtech.com thesararobinson@gmail.com
consortium.com @susanfeldman @srobtweets
Lee Levitt
Jay Benedetti Jeff Fried Oracle Wolf Ruzicka
CloverDX Intersystems lee.levitt@oracle.com EastBanc Technologies
jay.benedetti@cloverdx.com jfried@intersystems.com wruzicka@
Mark Marinelli
@jefffried eastbanctech.com
Christopher P. Bergh Tamr
DataKitchen Michelle L. Gregory mark.marinelli@tamr.com Lindy Ryan
joanne@datakitchen.io Elsevier Montclair State University;
Sean Martin
mgregory@elsevier.com Rutgers University
Shelly Brown Cambridge Semantics
lindymryan@gmail.com
Booz Allen Hamilton Amy Guarino sean@cambridge
brown_S-Asst@bah.com Kyndi semantics.com Prakriteswar Santikary
amy.guarino@kyndi.com ERT
Anne Buff Craig S. Mullins
@amyg44 prakriteswar.santi@ert.com
SAS Institute Mullins Consulting, Inc.
anne.marie.buff@gmail.com Jason Hall craig@craigsmullins.com Badal Shah
@anne_buff Quest Software @craigmullins Fannie Mae
jason.hall@quest.com badal.shah@yahoo.com
Lakshman Bulusu Kimberly Nevala
@jasonfhall
Matlen Silver, Qteria SAS Richard Sherman
balakshman@gmail.com Chelsey H. Hill kimberly.nevala@sas.com Athena IT Solutions
Feliciano School of rick.sherman@athena-
Joe Carroll John O'Brien
Business, Montclair State solutions.com
InterSystems Radiant Advisors
University @rpsherman
joe.carroll@ john.obrien@
chh35@drexel.edu
intersystems.com radiantadvisors.com Don Spaulding
Omkar Joshi @obrienjw Verizon
Joe Caserta Uber donald.spaulding@
Caserta Lynda Partner
omkar@uber.com verizon.com
joe@casertaconcepts.com The Pythian Group
@joe_caserta Padmesh Kankipati slack@pythian.com Michael Stonebraker
Florida Blue MIT
Danny Chen Dan Potter
padmesh.kankipati@ dbtcom@gmail.com
Uber Attunity
bcbsfl.com
danny.chen@gmail.com dan.potter@attunity.com Don Sullivan
Bob Kasenchak VMWare
Mark Coffey Ori Rafael
Access Innovations, Inc., sullivand@vmware.com
GasBuddy UpSolver
USA
savannah@credpr.com ori@upsolver.com David Weinberger
bob_kasenchak@
Harvard's Berkman Center
Steven Cohen accessinn.com Krishnan Raman
for Internet & Society
BASIS Technology @taxobob LinkedIn
david@weinberger.org
scohen@basistech.com krraman@linkedin.com
Ganes Kesari @dweinberger
Michael Corey Gramener Inc. Robin Rappaport
Tom Wilde
LicenseFortress ganes.kesari@ IRS–RAAS (Research,
Indico
michael@michaelcorey.com gramener.com Applied Analytics, and
tom@indico.com
@Michael_Corey @kesaritweets Statistics)
robin.rappaport@irs.gov Paul Wolmering
Jeff Crume Babak Khosravifar
Actian Corporation
IBM Master Inventor Square Enix Polina Reshetova
paul.wolmering@actian.com
jnquadri@us.ibm.com bkhosravifar@square-emix- EastBanc Technologies
montreal.com preshetova@ Danil Zburivsky
Subhayan Das
eastbanctech.com The Pythian Group
Bristol-Myers Squibb
slack@pythian.com
subhayan.das@bms.com
26 #DataSummit dbta.com/datasummit
SPONSOR
DIRECTORY
grated analytics and AnzoGraph, a graph faster, AI-driven analysis. Gemini provides
analytics database. data availability by unifying all silos with-
out data movement. Led by experts from
Actian Corporation AppDynamics, Cisco, and Splunk, Gemini
2300 Geng Road, Suite 150 is dedicated to helping global customers
Palo Alto, CA 94303 leverage AI for a competitive advantage.
www.actian.com
CloverDX
2111 Wilson Blvd., Suite 320 Find more information at our website.
Platinum Sponsor
Arlington, VA 22201
Actian delivers a competitive advantage www.cloverdx.com
to thousands of enterprise customers
worldwide through innovative hybrid data
Platinum Sponsor Import.io
management, integration and analytic CloverDX helps companies tackle tough Building One
solutions—on premises, in the cloud or data challenges. For individuals or small 12980 Saratoga Avenue, Suite B
both. For more, visit our website. groups that strive to take on a new ap- Saratoga, CA 95070
proach and challenge the status quo www.import.io
within an organization, the CloverDX Data
Gold Sponsor
Integration Platform and our consultancy
services will help deliver tangible, re- Import.io offers a web data integration
Aerospike, Inc. al-world results. See more at our website. platform where data identification, ex-
2525 E Charleston Road, Suite 201 traction (scraping/crawling), preparation,
Mountain View, CA 94043 integration and analysis can occur in a
www.aerospike.com single environment, providing more data
Platinum Sponsor quality and control. Offered as either SaaS
The Aerospike enterprise-grade non-re- DataKitchen or DaaS/managed service, Import.io de-
lational database helps companies pow- One Broadway, 14th Floor livers web data directly to enterprises to
er mission-critical, strategic operational Cambridge, MA 02142 fuel insights and competitive advantage.
applications that make digital transfor- www.datakitchen.io
mation possible. Powered by a patented Platinum Sponsor
Hybrid Memory Architecture and auto- For data and analytic team leaders who
nomic cluster management, Aerospike desire to innovate and struggle to keep up Merrimack College
is well-suited for fraud prevention, digital with customer requests and let embar- 315 Turnpike Street
payments, recommendation engines, rassing data errors slip into production,
real-time bidding and other applications North Andover, MA 01845
our software is a DataOps platform that
that require extreme uptime, performance delivers new business insights by en- www.merrimack.edu
and scale. abling the development and deployment Gold Sponsor
of innovative, high-quality data analytic Merrimack College’s industry-leading on-
pipelines—rapidly. line curriculum is designed to accelerate
skill and credential acquisition to propel
Attunity Inc. you into a high-impact position in one of
70 Blanchard Road, Suite 2 the fastest-growing career fields. Learn
Burlington, MA 01803 practical applications from leading prac-
www.attunity.com
Experian titioners who teach a curriculum created
53 State Street, Suite 20 with top employers and earn credits to-
Platinum Sponsor
Boston, MA 02109 ward your Master of Science in Business
Attunity is a leading provider of data in- www.experian.com Analytics or Data Science.
tegration and data management software
Gold Sponsor
solutions that enable availability, delivery,
and management of data across enter- Experian enables organizations to unlock
the power of data. Whether optimizing
prise platforms, including databases, data
data for better customer experiences or
Oracle
warehouses, data lakes and the cloud. 500 Oracle Parkway
Learn more at our website. preparing data for improved business
intelligence, we empower our clients to Redwood Shores, CA 94065
manage their information assets with www.oracle.com
confidence. We have the data, exper- Diamond Keynote Sponsor
® tise, and proven technology to help our The Oracle Cloud offers a complete
Cambridge Semantics customers quickly turn information into suite of integrated applications for sales,
1 Beacon Street, 15th Floor insight. To learn more, visit edq.com. service, marketing, human resources,
Boston, MA 02108
www.cambridgesemantics.com SHOWCASE HOURS
Platinum Sponsor
Cambridge Semantics Inc., The Smart Gemini Data TUESDAY, MAY 21
Data Company, is a big data management 300 Drakes Landing Road, #210 10:00 a.m. – 6:00 p.m.
and enterprise analytics software com- Greenbrae, CA 94904 Networking Reception
pany that provides a universal semantic www.geminidata.com 5:00 p.m. – 6:00 p.m.
layer to connect and bring meaning to all Diamond Keynote Sponsor
enterprise data. The company offers Anzo Gemini's Autonomous Data Cloud con-
WEDNESDAY, MAY 22
for enterprise knowledge graphs and inte- nects human and machine intelligence for 10:00 a.m. – 2:00 p.m.
28 #DataSummit dbta.com/datasummit
SPONSOR
DIRECTORY
finance, supply chain and manufactur-
ing, plus highly automated and secure
generation 2 Infrastructure featuring the Quest Software Inc.
Oracle Autonomous Database. For more Upsolver
4 Polaris Way 640 W. California Avenue, Suite 210
information about Oracle (NYSE: ORCL),
Aliso Viejo, CA 92656 Sunnyvale, CA 94086
please visit our website.
www.quest.com www.upsolver.com
Platinum Sponsor Platinum Sponsor
Quest provides software solutions for the Upsolver is the shortest path from
rapidly changing world of enterprise IT. streaming data to data lakes, analytics
Pythian Group Inc. We help simplify the challenges caused and machine learning. Upsolver lets you
319 McRae Avenue, Suite 700 by data explosion, cloud expansion, hybrid easily and visually turn event streams into
Ottawa, Ontario, K1Z 0B9, Canada datacenters, security threats and regula- usable data: ingest data, extract metadata
www.pythian.com tory requirements. Our portfolio includes and join with historical big data in real
Diamond Research & solutions for database management, data time. Upsolver's technology adds index-
Keynote Sponsor protection, unified endpoint management, ing, automation and high performance to
Pythian excels at helping businesses identity and access management and Mi- any cloud storage.
around the world use data and the cloud crosoft platform management.
to transform how they compete and win in
the data economy. From cloud automation
to machine learning, Pythian leads the in-
Vertica
SQream Technologies 5400 Legacy Drive
dustry with proven innovative technologies
and deep data expertise. For more than 20 7 World Trade Center, 10th Floor Plano, TX 75024
years Pythian has built its reputation by 250 Greenwich Street www.vertica.com
delivering solutions to the toughest data New York, NY 10007 Platinum Sponsor
challenges faster and better than anyone sqream.com Vertica is the fastest, most advanced SQL
else. Learn more about Pythian and its Platinum Sponsor analytics database, available on-prem-
global experts at www.pythian.com, follow SQream has redefined big data analytics ise, on Hadoop, and multiple clouds—all
@Pythian, and find Pythian on LinkedIn at with SQream DB, a complementary SQL delivered via one unified platform. With
http://linkd.in/pythian. data warehouse harnessing the power of tight integration with Hadoop, Kafka, and
GPU to enable fast, flexible, and cost-ef- Spark and built-in advanced analytics and
ficient analysis of massive datasets of machine learning, Vertica delivers the
terabytes to petabytes. SQream DB sig- highest performance at extreme scale.
nificantly reduces query times, minimizes Vertica. Built for fast. Built for freedom.
data preparation, and enables previously
unobtainable business intelligence.
THE PUBLICATION
FOR THE ERA OF
LIMITED-TIME FREE OFFER! * This is a must-read publication for data scientists, CIOs,
SUBSCRIBE NOW. and other professionals involved with big data projects.
dbta.com/BDQ/Subscribe *Free to qualified U.S. subscribers. Regular subscription rate is $99.95 per year.
30 #DataSummit dbta.com/datasummit