BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:1.0 BEGIN:VEVENT DTSTART:20121114T223000Z DTEND:20121114T230000Z LOCATION:255-EF DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: Task parallelism raises the level of abstraction in shared memory parallel programming to simplify the development of complex applications. However, task parallel applications can exhibit poor performance due to thread idleness, scheduling overheads, and work time inflation -- additional time spent by threads in a multithreaded computation beyond the time required to perform the same work in a sequential computation. We identify the contributions of each factor to lost efficiency in various task parallel OpenMP applications and diagnose the causes of work time inflation in those applications.=0A=0AA major cause of work time inflation in NUMA systems is increased latency to access data for computations. To mitigate this source of work time inflation in some applications, we propose a locality framework for task parallel OpenMP programs. As implemented in our extensions to the Qthreads library, locality-aware scheduling demonstrates up to 3X improvement compared to the Intel OpenMP task scheduler. SUMMARY:Characterizing and Mitigating Work Time Inflation in Task Parallel Programs PRIORITY:3 END:VEVENT END:VCALENDAR BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:1.0 BEGIN:VEVENT DTSTART:20121114T223000Z DTEND:20121114T230000Z LOCATION:255-EF DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: Task parallelism raises the level of abstraction in shared memory parallel programming to simplify the development of complex applications. However, task parallel applications can exhibit poor performance due to thread idleness, scheduling overheads, and work time inflation -- additional time spent by threads in a multithreaded computation beyond the time required to perform the same work in a sequential computation. We identify the contributions of each factor to lost efficiency in various task parallel OpenMP applications and diagnose the causes of work time inflation in those applications.=0A=0AA major cause of work time inflation in NUMA systems is increased latency to access data for computations. To mitigate this source of work time inflation in some applications, we propose a locality framework for task parallel OpenMP programs. As implemented in our extensions to the Qthreads library, locality-aware scheduling demonstrates up to 3X improvement compared to the Intel OpenMP task scheduler. SUMMARY:Characterizing and Mitigating Work Time Inflation in Task Parallel Programs PRIORITY:3 END:VEVENT END:VCALENDAR