BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:2.0 BEGIN:VEVENT DTSTART:20151119T170000Z DTEND:20151119T173000Z LOCATION:18AB DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: The ability to record and replay program execution helps significantly in debugging non-deterministic parallel applications by reproducing message receive orders. However, the large amount of data that traditional record-and-reply techniques record precludes its practical applicability to massively parallel MPI applications. In this paper, we propose a new compression algorithm, Clock Delta Compression (CDC), for scalable record and replay of non-deterministic MPI applications. CDC defines a reference order of message receives based on a totally ordered relation using Lamport clocks, and only records the differences between this reference logical-clock order and an observed order. Our evaluation shows that CDC significantly reduces the record data size. For example, when we apply CDC to a Monte Carlo particle transport benchmark (MCB), which represents non-deterministic communication patterns, CDC reduces the record size by approximately two orders of magnitude compared to traditional techniques and incurs between 13.1% and 25.5% of runtime overhead. SUMMARY:Clock Delta Compression for Scalable Order-Replay of Non-Deterministic Parallel Applications PRIORITY:3 END:VEVENT END:VCALENDAR