Enabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation

dc.date.accessioned	2024-02-18T17:43:43Z
dc.date.available	2024-02-18T17:43:43Z
dc.date.created	2023-06-21T10:34:26Z
dc.date.issued	2023
dc.identifier.citation	Burchard, Luk Bjarne Hustad, Kristian Gregorius Langguth, Johannes Cai, Xing . Enabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation. Frontiers in Physics. 2023, 11
dc.identifier.uri	http://hdl.handle.net/10852/108238
dc.description.abstract	A new trend in processor architecture design is the packaging of thousands of small processor cores into a single device, where there is no device-level shared memory but each core has its own local memory. Thus, both the work and data of an application code need to be carefully distributed among the small cores, also termed as tiles . In this paper, we investigate how numerical computations that involve unstructured meshes can be efficiently parallelized and executed on a massively tiled architecture. Graphcore IPUs are chosen as the target hardware platform, to which we port an existing monodomain solver that simulates cardiac electrophysiology over realistic 3D irregular heart geometries. There are two computational kernels in this simulator, where a 3D diffusion equation is discretized over an unstructured mesh and numerically approximated by repeatedly executing sparse matrix-vector multiplications (SpMVs), whereas an individual system of ordinary differential equations (ODEs) is explicitly integrated per mesh cell. We demonstrate how a new style of programming that uses Poplar/C++ can be used to port these commonly encountered computational tasks to Graphcore IPUs. In particular, we describe a per-tile data structure that is adapted to facilitate the inter-tile data exchange needed for parallelizing the SpMVs. We also study the achievable performance of the ODE solver that heavily depends on special mathematical functions, as well as their accuracy on Graphcore IPUs. Moreover, topics related to using multiple IPUs and performance analysis are addressed. In addition to demonstrating an impressive level of performance that can be achieved by IPUs for monodomain simulation, we also provide a discussion on the generic theme of parallelizing and executing unstructured-mesh multiphysics computations on massively tiled hardware.
dc.language	EN
dc.rights	Attribution 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.title	Enabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation
dc.title.alternative	ENEngelskEnglishEnabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation
dc.type	Journal article
dc.creator.author	Burchard, Luk Bjarne
dc.creator.author	Hustad, Kristian Gregorius
dc.creator.author	Langguth, Johannes
dc.creator.author	Cai, Xing
cristin.unitcode	185,15,5,0
cristin.unitname	Institutt for informatikk
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1
dc.identifier.cristin	2156482
dc.identifier.bibliographiccitation	info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Frontiers in Physics&rft.volume=11&rft.spage=&rft.date=2023
dc.identifier.jtitle	Frontiers in Physics
dc.identifier.volume	11
dc.identifier.doi	https://doi.org/10.3389/fphy.2023.979699
dc.type.document	Tidsskriftartikkel
dc.type.peerreviewed	Peer reviewed
dc.source.issn	2296-424X
dc.type.version	PublishedVersion
cristin.articleid	979699
dc.relation.project	NFR/329017
dc.relation.project	EU/956213
dc.relation.project	NFR/270053