BEGIN:VCALENDAR
PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN
VERSION:1.0
BEGIN:VEVENT
DTSTART:20121113T203000Z
DTEND:20121113T210000Z
LOCATION:155-E
DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: Correlation analysis is a widely used tool in a range of scientific fields, ranging from geology to genetics and astronomy. In astronomy, Two-point Correlation Function (TPCF) is commonly used to characterize the distribution of matter/energy in the Universe. Due to the large amount of computation with massive data, TPCF is a compelling benchmark for future exa-scale architectures.=0A =0AWe propose a novel algorithm that significantly reduces the computation and communication requirement of TPCF. We exploit the locality of histogram values and thus achieve near-linear scaling with respect to number of cores and SIMD-width.=0A =0AOn a 1600-node Zin supercomputer at Lawrence Livermore National Laboratory (1.06 Petaflops), we achieve 90% parallel efficiency and 96% SIMD efficiency, and perform computation on a 1.7 billion particle dataset in 5.3 hours (35–37X faster than previous approaches). Consequently, we now have line-of-sight to achieving the processing power for correlation computation to process billion+ particles telescopic data.
SUMMARY:Billion-Particle SIMD-Friendly Two-Point Correlation on Large-Scale HPC Cluster Systems
PRIORITY:3
END:VEVENT
END:VCALENDAR
