Hide metadata

dc.date.accessioned2014-01-08T16:36:23Z
dc.date.available2014-01-08T16:36:23Z
dc.date.created2013-09-26T13:34:43Z
dc.date.issued2013
dc.identifier.citationSu, Huayou Wu, Nan Wen, Mei Zhang, Chunyuan Cai, Xing . Performance of sediment transport simulations on NVIDIA’s Kepler architecture. Procedia Computer Science. 2013, 18, 1275-1281
dc.identifier.urihttp://hdl.handle.net/10852/37955
dc.description.abstractAiming to understand how high-performance CUDA programming can be done for NVIDIA's new Kepler architecture, we have investigated a specific case of simulating sediment transport. The arisen stencil computations have distinct features connected to the two nonlinear partial differential equations that constitute the mathematical model. Consequently, the required CUDA programming effort differs for the two corresponding CUDA kernel functions. While Kepler's new read-only data cache brings enough benefits for one kernel function, performance of the other kernel function is further enhanceable through using the shared memory and so-called halo threads. The highest achieved performance of the stencil computation amounts to 190.45 GFLOPs on a Tesla K20 GPU.
dc.languageEN
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Unported
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/
dc.titlePerformance of sediment transport simulations on NVIDIA’s Kepler architecture
dc.typeJournal article
dc.creator.authorSu, Huayou
dc.creator.authorWu, Nan
dc.creator.authorWen, Mei
dc.creator.authorZhang, Chunyuan
dc.creator.authorCai, Xing
cristin.unitcode185,15,5,52
cristin.unitnameBeregningsorientert matematikk
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1
dc.identifier.cristin1052642
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Procedia Computer Science&rft.volume=18&rft.spage=1275&rft.date=2013
dc.identifier.jtitleProcedia Computer Science
dc.identifier.volume18
dc.identifier.startpage1275
dc.identifier.endpage1281
dc.identifier.doihttp://dx.doi.org/10.1016/j.procs.2013.05.294
dc.identifier.urnURN:NBN:no-40241
dc.type.documentTidsskriftartikkel
dc.type.peerreviewedPeer reviewed
dc.source.issn1877-0509
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/37955/1/iccs2013.pdf
dc.type.versionPublishedVersion


Files in this item

Appears in the following Collection

Hide metadata

Attribution-NonCommercial-NoDerivs 3.0 Unported
This item's license is: Attribution-NonCommercial-NoDerivs 3.0 Unported