Hide metadata

dc.date.accessioned2024-02-18T17:43:43Z
dc.date.available2024-02-18T17:43:43Z
dc.date.created2023-06-21T10:34:26Z
dc.date.issued2023
dc.identifier.citationBurchard, Luk Bjarne Hustad, Kristian Gregorius Langguth, Johannes Cai, Xing . Enabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation. Frontiers in Physics. 2023, 11
dc.identifier.urihttp://hdl.handle.net/10852/108238
dc.description.abstractA new trend in processor architecture design is the packaging of thousands of small processor cores into a single device, where there is no device-level shared memory but each core has its own local memory. Thus, both the work and data of an application code need to be carefully distributed among the small cores, also termed as tiles . In this paper, we investigate how numerical computations that involve unstructured meshes can be efficiently parallelized and executed on a massively tiled architecture. Graphcore IPUs are chosen as the target hardware platform, to which we port an existing monodomain solver that simulates cardiac electrophysiology over realistic 3D irregular heart geometries. There are two computational kernels in this simulator, where a 3D diffusion equation is discretized over an unstructured mesh and numerically approximated by repeatedly executing sparse matrix-vector multiplications (SpMVs), whereas an individual system of ordinary differential equations (ODEs) is explicitly integrated per mesh cell. We demonstrate how a new style of programming that uses Poplar/C++ can be used to port these commonly encountered computational tasks to Graphcore IPUs. In particular, we describe a per-tile data structure that is adapted to facilitate the inter-tile data exchange needed for parallelizing the SpMVs. We also study the achievable performance of the ODE solver that heavily depends on special mathematical functions, as well as their accuracy on Graphcore IPUs. Moreover, topics related to using multiple IPUs and performance analysis are addressed. In addition to demonstrating an impressive level of performance that can be achieved by IPUs for monodomain simulation, we also provide a discussion on the generic theme of parallelizing and executing unstructured-mesh multiphysics computations on massively tiled hardware.
dc.languageEN
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleEnabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation
dc.title.alternativeENEngelskEnglishEnabling unstructured-mesh computation on massively tiled AI processors: An example of accelerating in silico cardiac simulation
dc.typeJournal article
dc.creator.authorBurchard, Luk Bjarne
dc.creator.authorHustad, Kristian Gregorius
dc.creator.authorLangguth, Johannes
dc.creator.authorCai, Xing
cristin.unitcode185,15,5,0
cristin.unitnameInstitutt for informatikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1
dc.identifier.cristin2156482
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Frontiers in Physics&rft.volume=11&rft.spage=&rft.date=2023
dc.identifier.jtitleFrontiers in Physics
dc.identifier.volume11
dc.identifier.doihttps://doi.org/10.3389/fphy.2023.979699
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn2296-424X
dc.type.versionPublishedVersion
cristin.articleid979699
dc.relation.projectNFR/329017
dc.relation.projectEU/956213
dc.relation.projectNFR/270053


Files in this item

Appears in the following Collection

Hide metadata

Attribution 4.0 International
This item's license is: Attribution 4.0 International