D 2024

Towards Dynamic Autotuning of SpMV in CUSP Library

DEMEK, Miroslav and Jiří FILIPOVIČ

Basic information

Original name

Towards Dynamic Autotuning of SpMV in CUSP Library

Authors

DEMEK, Miroslav (203 Czech Republic, belonging to the institution) and Jiří FILIPOVIČ (203 Czech Republic, guarantor, belonging to the institution)

Edition

San Francisco, USA, IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), p. 14-22, 9 pp. 2024

Publisher

IEEE

Other information

Language

English

Type of outcome

Proceedings paper

Field of Study

10201 Computer sciences, information science, bioinformatics

Country of publisher

United States of America

Confidentiality degree

is not subject to a state or trade secret

Publication form

electronic version available online

RIV identification code

RIV/00216224:14610/24:00137324

Organization unit

Institute of Computer Science

ISBN

979-8-3503-6461-3

UT WoS

001284697300084

EID Scopus

2-s2.0-85200751879

Keywords in English

dynamic autotuning; SpMV; CUPS; Kernel Tuning Toolkit

Tags

Tags

International impact, Reviewed
Changed: 4/4/2025 13:12, Mgr. Eva Špillingová

Abstract

In the original language

Sparse matrix-vector product (SpMV) is a central operation in many iterative methods for solving linear systems and, as such, is an attractive candidate for acceleration on the GPU. However, the performance of the SpMV kernel can vary depending both on the target architecture as well as on the sparsity pattern of the matrix. Thus, to achieve optimal performance, the implementation might need to be adjusted for each particular matrix and architecture. This can be achieved through dynamic autotuning, a method that can optimize a source code during program runtime. In this paper, we present a dynamic autotuning of SpMV kernel included in a production-quality CUSP library. We identify and implement tuning parameters and use the Kernel Tuning Toolkit framework for autotuning of SpMV working with the DIA and ELL sparse matrix formats. The dynamic autotuning integration is fully transparent to the library users - it can be activated just by re-compiling software using our tunable version of the CUSP. The proposed autotuned library is evaluated by comparing it with the original CUSP kernels on a set of representative matrices and by examining the contribution of autotuning. The results show that the autotuned kernels can reach up to about 16.9 × speedup compared to a fixed implementation.

Links

LM2023054, research and development project
Name: e-Infrastruktura CZ
Investor: Ministry of Education, Youth and Sports of the CR