Building a Hadoop-Based Enterprise Data Hub for the Credit and Identity Risk Market
ID Analytics will cover highlights from their multi-year journey to solve big data challenges in the credit and identity risk market. Join Bob Warner, Director of Engineering, and Scott Saufferer...Learn More
Going Beyond the Data Lake: How to Leverage the Entire Analytics Ecosystem
The concept of establishing a data lake for big data has been touted as the “next big thing” that organizations need to embrace...Learn More
Genome Analysis Pipelines, Big Data Style
Powerful new tools exist for processing large volumes of data quickly across a cluster of networked computers. Typical bioinformatics workflow requirements are well-matched to these tools' capabilities...Learn More
Agenda: April 12, 2016
Hyatt Regency La Jolla, San Diego, CA 92122
|8:00am-9:00am||Registration & Breakfast|
|9:00am - 9:15am||Introductions & Welcome|
|9:15am - 10:00am||Streaming in the Extreme Jim Scott, Director, Enterprise Strategy & Architecture, MapR|
|10:00am-10:45am||Build and Operationalize Enterprise Data Lake in Big Enterprises Alex Garbarini, Information Technology Engineer, Cisco|
|11:00am-11:30am||Going Beyond the Data Lake: How to Leverage the Entire Analytics Ecosystem Vik Kapoor, Director of Analytics Technology Architecture & Platforms, Pfizer|
|11:30am-12:00pm||Building a Hadoop-Based Enterprise Data Hub for the Credit and Identity Risk Market Scott Saufferer & Robert Warner, ID Analytics|
|12:00pm-12:30pm||Operationalizing the IoT with Mtell and Apache Spark Alex Bates, CTO, Mtell|
|12:30pm-1:00pm||Genome Analysis Pipelines, Big Data Style Allen Day, Chief Scientist, MapR|
|1:00pm-1:30pm||How to Use Design Thinking to Jumpstart Your Big Data Projects Stefan Groschupf, CEO of Datameer|
Director of Data Operations, ID Analytcs Scott is the Director of Data Operations at ID Analytics, where he’s responsible for the management and operations of database environments company wide; the ingress, processing and egress of all sensitive data assets; and the buildout and operation of multiple Hadoop environments. Prior to taking on Data Operations, Scott was the Director of Systems Architecture and was responsible for the design and implementation of systems infrastructure across the entire company including production and corporate systems. Earlier in his career, Scott was a Principal Architect for wireless provider Cricket Communications. Prior to his role at Cricket, Scott served in various technical and management roles in IT consulting and software companies.
Director of Engineering, ID Analytics Robert is the Director of Engineering at ID Analytics, where he provides engineering leadership to multiple SCRUM teams. In addition, he leads the prototyping and evaluation of new data technologies to be integrated across our product line, and also leads the product teams on projects that provide core features and enhancements for their product line. Robert began at ID Analytics as a senior software engineer on their software engineering team, where he provided architectural guidance, provided support for the product ASP environment, and participated in all aspects of the software development life cycle. Earlier in his career, Robert was a Director at Fair Isaac Corporation (FICO), an analytics software company. He began his career as a senior software engineer for NCR, where he was responsible for Teradata software development.
Principal Data Scientist, MapR Allen is the Principal Data Scientist at MapR Technologies, where he leads interdisciplinary teams to deliver results in fast-paced, high-pressure environments across several verticals in industry. Previously, Allen founded TinyTube Networks which provided the first mobile video discovery and transcoding proxy service, and Ion Flux which provided a medical-grade, cloud-based human genome sequencing service.
Allen holds a PhD in Human Genetics from the School of Medicine at UCLA. His dissertation project was to create the largest public data warehouse for gene expression data. Key components of the project included: operationalizing code beyond the research stage; building and operating a high-performance computing cluster for scale-out; and design and implementation of a schema that supported fast matrix operations for ontology- and graph-based machine learning algorithms. Allen has contributed to a wide variety of open source projects: R (CRAN, Bioconductor), Perl (CPAN, BioPerl), FFmpeg, Cascading, Apache HBase, Apache Storm, and Apache Mahout. He is co-authoring the upcoming O'Reilly Title Just Enough Math with Paco Nathan. Overall, his unique background combines deep technical expertise in data science with a pragmatic understanding of real-world problems. He also pursues interests in linguistics and economics, and — if it hadn’t been obvious — he performs magic.