Loading…
Tuesday, May 13 • 2:00pm - 2:40pm
Technical Deep Dive: Big Data Computations Using Elastic Data Processing in OpenStack Cloud

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The Sahara project (ex. Savanna), integrated project in Juno under the OpenStack Data Processing program, provides users an ability to provision and manage Hadoop clusters on OpenStack, and has seen a great deal of progress, development, and changes during the Icehouse development cycle. The focus of the project is on two primary use cases: on-demand cluster provisioning and on-demand Hadoop tasks execution (Elastic Data Processing).

This presentation takes an in-depth look at Savanna’s EDP facilities. Since Savanna’s initial release, this key feature has been hardened and expanded to support streaming MapReduce and Java workflows, operation over private Neutron networks and execution on transient clusters. We’ll start with a description of EDP’s general concepts and a definition of terms, then its current status in Savanna, supported Data Sources, Job Types, data locality and the roadmap for the Juno release cycle.

Lastly, we’ll show a live demo of EDP to bring all of these concepts together. The demo will cover job and data source definition, job execution and collection of job results.



Speakers
avatar for Alexander Ignatov

Alexander Ignatov

Senior Software Engineer, Mirantis, Mirantis Inc.
Alexander Ignatov (aignatov on irc) is a Senior Software Engineer at Mirantis. He has expertise in networks, Java, high scalability and distributed systems (Hadoop, HBase, Cassandra). Alexander has been involved in the OpenStack Savanna project since its beginning. He is the main... Read More →
avatar for Sergey Lukjanov

Sergey Lukjanov

Senior Development Manager, Mirantis, Mirantis
TM

Trevor McKay

Principal Software Engineer, Red Hat, Inc.
Trevor McKay is a Principal Software Engineer at Red Hat with a background in distributed computing and big data processing, having worked extensively with Apache Spark on OpenStack and now on Kubernetes. He is passionate about simplifying user experience in general and making analytics... Read More →


Tuesday May 13, 2014 2:00pm - 2:40pm EDT
Room B102

Attendees (0)