IC2E 2014

General Information

Cloud Summit

Call For Contributions

Program

Travel Information

IC2E 2014 Tutorials

Using GENI for Cloud Computing Research (Full Day)
Niky Riga (BBN), Sarah Edwards (BBN)
Grid’5000 for Cloud Infrastructure Experimentation (Full Day)
Simon Delamare (CNRS), Laurent Pouilloux (INRIA)
Open Source Cloud Platforms (Morning)
Salman Baset (IBM Research), Upendra Sharma (IBM Research)
Cloud-Hosted Databases: Technologies, Challenges and Opportunities (Afternoon)
Sherif Sakr (National ICT Australia)

Using GENI for Cloud Computing Research (Full Day)

Niky Riga (BBN), Sarah Edwards (BBN)

This hands-on tutorial will introduce attendees to GENI, a nationwide virtual laboratory for networking and distributed systems research (http://www.geni.net). GENI is funded by the National Science Foundation and is free to use for research and education.

The tutorial will be split into two sessions and it is strongly recommended that attendees attend both sessions. Attendees can attend only the second session as long as they meet the pre-requisites described below.

Session 1 (Morning):

An Introduction to GENI as a Cloud Computing testbed.
This talk will include an overview of GENI and GENI concepts including: a demonstration of an experiment on GENI; a description of the various cloud computing, software defined networking, and wireless resources available to experimenters using GENI; and the current deployment status of GENI. Duration: ~60 minutes
A Hands-on Introduction to Experimentation with GENI.
Attendees will gain an understanding of the GENI experimenter workflow by running a simple experiment on GENI. Duration: ~100 minutes
Pre-requisites for Session 1
- Comfort with Unix CLI, SSH with public private keys
- Bring a laptop for the hands-on portion

Session 2 (Afternoon):

GENI and Cloud Computing.
A presentation about how GENI can be used in cloud computing research through example use cases. This presentation will also touch on collaboration efforts with other testbeds. Duration: ~20 minutes
A Hands-on SDN Experiment with GENI.
Attendees will work through an OpenFlow experiment using GENI resources. We assume no prior OpenFlow knowledge and the tutorial will be adjustable for various levels of SDN understanding. Attendees will also get a chance to write their own OpenFlow controller. Duration: ~120 minutes
Running your own Experiment in GENI.
Advice to experimenters for using GENI for their own research: identifying GENI compute and networking resources best suited for their experiment and guidance on debugging their experiments. A description of available resources and a presentation of further GENI capabilities and tools. Duration: ~20 minutes.
Pre-requisites for Session 2 without Session 1
- Comfort with Unix CLI, SSH with public private keys (required)
- Setup of GENI account, instructions are going to be sent closer to the conference (required)
- Review the Introductory slides and complete the hands-on tutorial of the first session (optional)
- GENIEducation/Resources/IntroToGENI.pptx
- GENIEducation/SampleAssignments/LabZero

Presenters Bios:

Dr. Niky Riga has received her Diploma from NTUA, Greece and her MA and PhD from Boston University. She is a Network Scientist at Raytheon BBN Technologies. Since 2010 she is a member of the GENI Project Office and is responsible for supporting GENI experimenters in integrating and deploying their experiments within the GENI infrastructure and ensuring that the deployment makes the best use of GENI resources. Her research focuses on designing and prototyping pioneering transport services for Mobile Ad-hoc Networks. Dr. Riga has led several tutorials in major networking and system conferences.

Sarah Edwards received her BS in Electrical Engineering from Rice University and her MS from the University of California, San Diego. She is a Scientist at Raytheon BBN Technologies. Since 2010 she has been a member of the GENI Project Office where she helps experimenters and instructors use GENI and where she writes software to help experimenters use GENI more easily. Previously, Ms Edwards spent several years designing experiments to validate the design of large systems.

Grid’5000 for Cloud Infrastructure Experimentation

Simon Delamare (Simon.Delamare@ens-lyon.fr)
Laurent Pouilloux (Laurent.Pouilloux@inria.fr)

This tutorial will present how the Grid’5000 platform can be used to deploy and experiment your own Cloud infrastructure at a large scale. Grid’5000 is a scientific instrument designed to support experiment- driven research in all areas of computer science related to parallel, large-scale or distributed computing and networking. Grid’5000 is supported by French major scientific institutions (INRIA, CNRS, ...). Key features of Grid’5000 platform are ability to conduct large-scale and reproducible experiments, while providing to users a high capacity of customization at every level of computer stack (from hardware to high level applications). Further information on https://www.grid5000.fr

During the morning, the tutorial will show how to get started using Grid’5000, and will describe the virtualization and Cloud-related tools it provides. In the afternoon, the tutorial will present how to perform large-scale deployment of virtual machines and Cloud infrastructures, thanks to reconfigurability capability of Grid’5000.

Session 1 (Morning):

The first part of the morning session will be dedicated to the general Grid’5000 usage. After a quick presentation of the testbed, users will learn how to obtain a Grid’5000 account and connect to the platform by configuring their SSH environment. They will learn how to connect to different Grid’5000 sites and to discover, visualize and reserving Grid’5000 resources. Then the tutorial will present how to get root access on nodes by deploying and customizing their own experimental environment.

The second part of the morning session will present how to use KVM on Grid’5000 nodes production environment. First, attendees will learn how to reserve subnets dedicated to virtual machines. Then, it will be demonstrated how to deploy, run and connect to the virtual machines. Contextualization, and cross-site experiments will also be discussed.

Prerequisites: To follow the morning session, participants should have basic knowledge of Linux command line, SSH and virtualization. A laptop with SSH client is also required

Session 2 (Afternoon):

During the afternoon session, the tutorial will show how to deploy OpenStack in an isolated network. First, people will learn how to reserve an isolated VLAN and to deploy the nodes in this VLAN. Then, automatic tools for OpenStack deployment will be presented and people will learn how to access to its Dashboard.

The last tutorial topic will be dedicated to the automatic deployment of custom physical hosts and virtual machines to let attendees perform experiments with their own environments. An automatic experimental engine that performs reservation, deployment and user-defined workflow will finally be presented.

Prerequisites: Participants that do not follow the morning session require a Grid’5000 account and basic knowledge of platform usage. An on-line tutorial is available at https://www.grid5000.fr/mediawiki/index.php/Getting_Started. A laptop with SSH client is also required.

Presenters Bios:

Dr. Simon Delamare is a research engineer from CNRS, working inside the LIP laboratory at ENS Lyon, France. He received his Master at Université Paris 6 and his Ph.D. at Telecom ParisTech in 2010. He is the current operational manager of the technical team of Grid’5000, the large computer science experimentation tested originated from France. His favorite research topics include quality of service and reliability of distributed computing infrastructures.

Dr. Laurent Pouilloux is a research engineer from INRIA, also working inside the LIP laboratory at ENS Lyon, France. He received his Ph.D. at Institut de Physique du Globe de Paris in 2007 and has a strong experience in teaching computer science. He is mainly involved in helping Grid’5000 users to setup large-scale experiments, especially for virtual machines deployment and cloud computing experiments automation.

Open Source Cloud Platforms

Salman Baset (sbaset@us.ibm.com)
Upendra Sharma (upendra.sharma@us.ibm.com)

This tutorial will present two major open-source platforms to deliver the Infrastructure as a Service (IaaS) and the Platform as a Service (PaaS) paradigms. Concretely, the first part will introduce and discuss details of the OpenStack solution, a system for operating an Infrastructure as a Service platform. The second part of this tutorial will focus on the Cloud Foundry proposal, an open source PaaS platform.

Session 1 - Overview of OpenStack (Morning):

OpenStack is an open source software that can be used to convert compute and storage hardware into an IaaS cloud. It provides compute, storage, and network services similar to Amazon EC2, S3 and EBS.

Introduction to OpenStack and its components.
We will give an overview of OpenStack concepts, reference deployment architectures, interaction among components, and provisioning process, through a live deployment of OpenStack.
OpenStack networking
We will give a detailed overview of various OpenStack networking options, Open vSwitch concepts.
OpenStack networking
We will provide an overview of the challenges and problems in operating OpenStack, commonly encountered problems, and their resolutions.

Duration: ~90 min

References

OpenStack Cloud Software. http://www.openstack.org

Session 2 – Cloud Foundry and BOSH (Morning)

Cloud Foundry is an open source cloud-computing Platform as a service Platform developed by VMware released under the terms of the Apache License 2.0. It was released on April 4th 2011. Cloud Foundry is part of the Pivotal Initiative, an independent entity funded by VMware and EMC. It is primarily written in Ruby. Cloud Foundry, essentially, consists of six prime components, namely router, cloud- controller, health manager, droplet execution engines, stager and NATS message bus. We will describe the operation of each of these components in the complete process of application deployment and execution.

Introduction to Cloud Floundry.
We will present a detailed architecture of Cloud Foundry and BOSH and how to use BOSH to deploy Cloud Foundry over OpenStack.
Using CloudFoundry as a developer
We will then describe how to use Cloud Foundry as a developer and its various features/services offered to the developers.

Duration: ~60 min

References

Cloud Foundry by Pivotal. http://docs.cloudfoundry.com/
BOSH: Open source tool for release engineering, deployment and lifecycle management of a large-scale distributed systems. http://docs.cloudfoundry.com/docs/running/bosh/.

Presenters Bios:

Salman A. Baset IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, New York 10598 (sabaset@us.ibm.com). Dr. Salman is working as a Research Staff Member at IBM T. J. Watson Research Center. He received a B.S. degree in Computer System Engineer from GIK Institute of Engineering Sciences and Technology, Pakistan in 2001, his M.S. and Ph.D. in Computer Science from Columbia University. His current research at IBM is focused on tracing and fault injection in distributed systems, particularly OpenStack, DevOps, over-subscription of physical machine resources, automation of software and operating system updates in the cloud, and cloud SLAs. He has been elected as the Release manager of Standard Performance Evaluation Corporation (SPEC) OSGcloud benchmarking sub-committee, which is working on standardizing a cloud benchmark. He is also a co-author of cloud benchmark framework report published by SPEC. Previously, he was involved in the Internet Engineering Task Force (IETF) and is a coauthor of RELOAD protocol for building peer-to-peer communication systems. He is a recipient of Young Scholars Award by Marconi Society in 2008 and a best paper award at IPTCOMM (Principles, Systems, and Applications of IP Telecommunications) in 2010. Currently, he is the chair of Distributed and Fault Tolerant Computing (DFTC) Professional Interest Community (PIC) at IBM.

Upendra Sharma IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, New York 10598 (upendra.sharma@us.ibm.com). Dr. Sharma is a Research Staff Member at IBM T.J. Watson Research Center. He received his Ph.D. in computer science in 2013 from University of Massachusetts Amherst. He received his MS degree in Electrical Engineering from Indian Institute of Technology Bombay, and his BS from Mumbai University. Prior to joining the graduate program at UMass Amherst, he worked as a researcher in IBM India Research Lab. His research interest includes virtualization, distributed systems and cloud computing; currently he is working on IBM's PaaS technology.

Cloud-Hosted Databases: Technologies, Challenges and Opportunities

Sherif Sakr (ssakr@cse.unsw.edu.au)

Cloud computing technology has revolutionized the way computational resources and services are commercialized and delivered to customers. Recently, the cloud has become an increasingly popular platform for hosting software applications in a variety of domains such as e-retail, finance, news and social networking. Thus, we are witnessing a proliferation in the number of applications with a tremendous increase in the scale of the data generated as well as being consumed by such applications. Cloud-hosted database systems powering these applications form a critical component in the software stack of these applications.

The quest for conquering the challenges posed by hosting databases on cloud computing environments has led to a plethora of systems and approaches. In practice, there are three main technologies, which are commonly used for deploying the database tier of software applications in cloud platforms, namely, the services of NoSQL storage systems, Database-as-a-service (DaaS) platforms and virtualized database servers. This tutorial aims to discuss the basic characteristics and the recent advancements of each of these technologies, illustrate the strengths and weaknesses of each technology and presents some opportunities for future work which are required to tackle existing research challenges and bring forward the vision of deploying data-intensive applications on cloud platforms.

Session 1 (Afternoon):

The first session will focus on providing a comprehensive background on the fundamentals of hosting the database tier of software applications in cloud environments. In particular, the session will cover the following list of items:

The requirements and challenges of hosting the database tiers of software applications in cloud environments.
An overview of the emerging technology of NoSQL storage systems.
An overview of Database-as-a-service (DaaS) cloud platforms.

Duration: ~90 min

Session 2 (Afternoon):

The second session will cover the following list of items:

An overview of the virtualized database servers technology and the main representative systems for hosting and managing database tiers of software applications in cloud environment using this technology.
We will crystallize the design choices, compare the strengths and weaknesses of the different technologies and provide a set of guidelines for the adequate scenarios for using each of these technologies.
We will highlight some opportunities for open research challenges and directions for improving the current state-of-the-art.

Duration: ~90 min

Prerequisites
This tutorial does not require any knowledge on hosting database applications in cloud environment.

Presenters Bios:

Dr. Sherif Sakr is a Senior Researcher in the Software Systems Research Group at National ICT Australia (NICTA), ATP lab, Sydney, Australia. He is also a Senior Lecturer in The School of Computer Science and Engineering (CSE) at University of New South Wales (UNSW). He received his PhD degree in Computer Science from Konstanz University, Germany in 2007. He received his BSc and MSc degree in Computer Science from the Information Systems department at the Faculty of Computers and Information in Cairo University, Egypt, in 2000 and 2003 respectively. In 2012, Sherif held a Research MTS position in Alcatel-Lucent Bell Labs. In 2011, Dr. Sakr held a Visiting Researcher with the eXtreme Computing Group (XCG), Microsoft Research Laboratories, Redmond, WA, USA.