![]() |
![]() |
|||||||||
|
Title
Carbon: Architectural Support for Fine-Grained Parallelism on Chip Multiprocessors. Authors Sanjeev Kumar, Christopher J. Hughes, Anthony Nguyen. Publication In the Proceedings of IEEE/ACM International Symposium on Computer Architecture (ISCA), San Diego, California, June 2007.
Downloads Abstract
Chip multiprocessors (CMPs) are now commonplace, and the number
of cores on a CMP is likely to grow steadily. However, in order
to harness the additional compute resources of a CMP, applications
must expose their thread-level parallelism to the hardware. One
common approach to doing this is to decompose a program into
parallel "tasks" and allow an underlying software layer to schedule
these tasks to different threads. Software task scheduling can provide
good parallel performance as long as tasks are large compared
to the software overheads.
We examine a set of applications from an important emerging domain: Recognition, Mining, and Synthesis (RMS). Many RMS applications are compute-intensive and have abundant thread-level parallelism, and are therefore good targets for running on a CMP. However, a significant number have small tasks for which software task schedulers achieve only limited parallel speedups. We propose Carbon, a hardware technique to accelerate dynamic task scheduling on scalable CMPs. Carbon has relatively simple hardware, most of which can be placed far from the cores. We compare Carbon to some highly tuned software task schedulers for a set of RMS benchmarks with small tasks. Carbon delivers significant performance improvements over the best software scheduler: on average for 64 cores, 68% faster on a set of loop-parallel benchmarks, and 109% faster on a set of task-parallel benchmarks.
|
||||||||||
![]() |
![]() |
|||||||||