Hide metadata

dc.date.accessioned2024-02-21T18:25:33Z
dc.date.available2024-02-21T18:25:33Z
dc.date.created2023-11-16T09:44:50Z
dc.date.issued2023
dc.identifier.citationBrasser, R. Grimm, S.L. Hatalova, Petra Stadel, J.G. . Speeding up the GENGA N -body integrator on consumer-grade graphics cards. Astronomy and Astrophysics (A & A). 2023, 678
dc.identifier.urihttp://hdl.handle.net/10852/108417
dc.description.abstractContext. Graphics processing unit (GPU) computing has become popular due to the enormous calculation potential that can be harvested from a single card. The N -body integrator Gravitational ENcounters with GPU Acceleration (GENGA) is built to harvest the computing power from such cards, but it suffers a severe performance penalty on consumer-grade Nvidia GPUs due to their artificially truncated double precision performance. Aims. We aim to speed up GENGA on consumer-grade cards by harvesting their high single-precision performance. Methods. We modified GENGA to have the option to compute the mutual long-distance forces between bodies in single precision and tested this with five experiments. First, we ran a high number of simulations with similar initial conditions of on average 6600 fully self-gravitating planetesimals in both single and double precision to establish whether the outcomes were statistically different. These simulations were run on Tesla K20 cards. We supplemented this test with simulations that (i) began with a mixture of planetesimals and planetary embryos, (ii) planetesimal-driven giant planet migration, and (iii) terrestrial planet formation with a dissipating gas disc. All of these simulations served to determine the accuracy of energy and angular momentum conservation under various scenarios with single and double precision forces. Second, we ran the same simulation beginning with 40 000 self-gravitating planetesimals using both single and double precision forces on a variety of consumer-grade and Tesla GPUs to measure the performance boost of computing the long-range forces in single precision. Results. We find that there are no statistical differences when simulations are run with the gravitational forces in single or double precision that can be attributed to the force prescription rather than stochastic effects. The accumulations in uncertainty in energy are almost identical when running with single or double precision long-range forces. However, the uncertainty in the angular momentum using single rather than double precision long-range forces is about two orders of magnitude greater, but still very low. Running the simulations in single precision on consumer-grade cards decreases running time by a factor of three and becomes within a factor of three of a Tesla A100 GPU. Additional tuning speeds up the simulation by a factor of two across all types of cards. Conclusions. The option to compute the long-range forces in single precision in GENGA when using consumer-grade GPUs dramatically improves performance at a little penalty to accuracy. There is an additional environmental benefit because it reduces energy usage.
dc.languageEN
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleSpeeding up the GENGA N -body integrator on consumer-grade graphics cards
dc.title.alternativeENEngelskEnglishSpeeding up the GENGA N -body integrator on consumer-grade graphics cards
dc.typeJournal article
dc.creator.authorBrasser, R.
dc.creator.authorGrimm, S.L.
dc.creator.authorHatalova, Petra
dc.creator.authorStadel, J.G.
cristin.unitcode185,15,22,91
cristin.unitnameSenter for planetær beboelighet
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode2
dc.identifier.cristin2197432
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Astronomy and Astrophysics (A & A)&rft.volume=678&rft.spage=&rft.date=2023
dc.identifier.jtitleAstronomy and Astrophysics (A & A)
dc.identifier.volume678
dc.identifier.pagecount0
dc.identifier.doihttps://doi.org/10.1051/0004-6361/202347071
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn0004-6361
dc.type.versionPublishedVersion
cristin.articleidA73
dc.relation.projectNFR/332523
dc.relation.projectSIGMA2/nn9010K


Files in this item

Appears in the following Collection

Hide metadata

Attribution 4.0 International
This item's license is: Attribution 4.0 International