SCHEDULE: NOV 10-16, 2012
When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.
Containment Domains - A Scalable, Efficient, and Flexible Resiliency Scheme for Exascale Systems
SESSION: Resilience
EVENT TYPE: Papers, Best Student Paper Finalists
TIME: 2:00PM - 2:30PM
SESSION CHAIR: Bronis R. de Supinski
AUTHOR(S):Jinsuk Chung, Ikhwan Lee, Michael Sullivan, Jee Ho Ryoo, Dongwan Kim, Doe Hyun Yoon, Larry Kaplan, Mattan Erez
ROOM:255-EF
ABSTRACT:
This paper describes and evaluates a scalable and
efficient resiliency scheme based on the concept of containment
domains. Containment domains are a programming construct
that enables applications to express resiliency needs and interact
with the system to tune and specialize error detection,
state preservation and restoration, and recovery schemes. Containment
domains have weak transactional semantics and are
nested to take advantage of the machine hierarchy and to enable
distributed and hierarchical state preservation, restoration,
and recovery as an alternative to non-scalable and inefficient
checkpoint-restart. We evaluate the scalability and efficiency of
containment domains using generalized trace-driven simulation
and analytical analysis and show that containment domains are
superior to both checkpoint restart and redundant execution
approaches.
Chair/Author Details:
Bronis R. de Supinski (Chair) - Lawrence Livermore National Laboratory
Jinsuk Chung - University of Texas at Austin
Ikhwan Lee - University of Texas at Austin
Michael Sullivan - University of Texas at Austin
Jee Ho Ryoo - University of Texas at Austin
Dongwan Kim - University of Texas at Austin
Doe Hyun Yoon - Hewlett-Packard
Larry Kaplan - Cray Inc.
Mattan Erez - University of Texas at Austin
Click here to download .ics calendar file
Click here to download .vcs calendar file
Click here to add event to your Google Calendar
Containment Domains - A Scalable, Efficient, and Flexible Resiliency Scheme for Exascale Systems
SESSION: Resilience
EVENT TYPE: , Best Student Paper Finalists
TIME: 2:00PM - 2:30PM
SESSION CHAIR: Bronis R. de Supinski
AUTHOR(S):Jinsuk Chung, Ikhwan Lee, Michael Sullivan, Jee Ho Ryoo, Dongwan Kim, Doe Hyun Yoon, Larry Kaplan, Mattan Erez
ROOM:255-EF
ABSTRACT:
This paper describes and evaluates a scalable and
efficient resiliency scheme based on the concept of containment
domains. Containment domains are a programming construct
that enables applications to express resiliency needs and interact
with the system to tune and specialize error detection,
state preservation and restoration, and recovery schemes. Containment
domains have weak transactional semantics and are
nested to take advantage of the machine hierarchy and to enable
distributed and hierarchical state preservation, restoration,
and recovery as an alternative to non-scalable and inefficient
checkpoint-restart. We evaluate the scalability and efficiency of
containment domains using generalized trace-driven simulation
and analytical analysis and show that containment domains are
superior to both checkpoint restart and redundant execution
approaches.
Chair/Author Details:
Bronis R. de Supinski (Chair) - Lawrence Livermore National Laboratory
Jinsuk Chung - University of Texas at Austin
Ikhwan Lee - University of Texas at Austin
Michael Sullivan - University of Texas at Austin
Jee Ho Ryoo - University of Texas at Austin
Dongwan Kim - University of Texas at Austin
Doe Hyun Yoon - Hewlett-Packard
Larry Kaplan - Cray Inc.
Mattan Erez - University of Texas at Austin
Click here to download .ics calendar file