Publication:
Splash-4: A Modern Benchmark Suite with Lock-Free Constructs

Loading...
Thumbnail Image
Date
2022
relationships.isAuthorOfPublication
relationships.isSecondaryAuthorOf
relationships.isDirectorOf
Authors
Gómez-Hernández, Eduardo José ; Cebrian, Juan Manuel ; Kaxiras, Stefanos ; Ros, Alberto
item.page.secondaryauthor
item.page.director
Publisher
IEEE Computer Society
publication.page.editor
DOI
https://doi.org/10.1109/IISWC55918.2022.00015
item.page.type
info:eu-repo/semantics/article
Description
© 2022. The authors. This document is made available under the CC-BY 4.0 license http://creativecommons.org/licenses/by /4.0/ This document is the acepted version of a published work that appeared in final form in 2022 IEEE International Symposium on Workload Characterization (IISWC) To access the final work, see DOI: https://doi.org/10.1109/IISWC55918.2022.00015
Abstract
The cornerstone for the performance evaluation of computer systems is the benchmark suite. Among the many benchmark suites used in high-performance computing and multicore research, Splash-2 has been instrumental in advancing knowledge for both academia and industry. Published in 1995 and with over 5276 citations and counting, this benchmark suite is still in use to evaluate novel architectural proposals. Recently, the Splash-3 suite eliminates important performance bugs, data races, and improper synchronization that plagued Splash-2 benchmarks after the formal definition of the C memory model. However, keeping up with architectural changes while maintaining the same workloads and algorithms (for comparative purposes) is a real challenge. Benchmark suites can misrepresent the performance characteristics of a computer system if they do not reflect the available features of the hardware and architects may end up overestimating the impact of proposed techniques or underestimating others. In this work we introduce a revised version of Splash-3, designated Splash-4, that introduces modern programming techniques to improve scalability on contemporary hardware. We then characterize Splash-3 and Splash-4 in a state-of-the-art simulated architecture, Intel’s Ice Lake with gem5-20 simulator, as well as a real contemporary hardware processor (AMD’s EPYC 7002 series). Our evaluation shows that for a 64-thread execution Splash-4 reduces the normalized execution time by an average of 52% and 34% for AMD’s EPYC and Intel’s Ice Lake, respectively.
Citation
2022 IEEE International Symposium on Workload Characterization (IISWC), Austin, TX, USA, 2022, pp. 51-64
item.page.embargo
Collections