Hide metadata

dc.date.accessioned2018-10-17T10:31:24Z
dc.date.available2018-10-17T10:31:24Z
dc.date.created2016-09-09T16:36:02Z
dc.date.issued2016
dc.identifier.citationZahid, Feroz Gran, Ernst Gunnar Bogdanski, Bartosz Johnsen, Bjørn Dag Skeie, Tor . Efficient network isolation and load balancing in multi-tenant HPC clusters. Future generations computer systems. 2017, 72, 145-162
dc.identifier.urihttp://hdl.handle.net/10852/65181
dc.description.abstractMulti-tenancy promises high utilization of available system resources and helps maintaining costeffective operations for service providers. However, multi-tenant high-performance computing (HPC) infrastructures, like dynamic HPC clouds, bring unique challenges, both associated with providing performance isolation to the tenants, and achieving efficient load-balancing across the network fabric. Each tenant should experience predictable network performance, unaffected by the workload of other tenants. At the same time, it is equally important that the network links are balanced, avoiding network saturation. The network saturation can lead to unpredictable application performance, and a potential loss of profit for the cloud service providers. In this paper, we present two significant extensions to our previously proposed partition-aware fattree routing algorithm, pFTree, for InfiniBand-based HPC systems. First, we extend pFTree to incorporate provider defined partition-wise policies that govern how the nodes in different partitions are allowed to share network resources with each other. Second, we present a weighted version of the pFTree routing algorithm, that besides partitions, also takes node traffic characteristics into account to balance load across the network links more evenly. A comprehensive evaluation comprising both real-world experiments and simulations confirms the correctness and feasibility of the proposed extensions.en_US
dc.languageEN
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleEfficient network isolation and load balancing in multi-tenant HPC clustersen_US
dc.typeJournal articleen_US
dc.creator.authorZahid, Feroz
dc.creator.authorGran, Ernst Gunnar
dc.creator.authorBogdanski, Bartosz
dc.creator.authorJohnsen, Bjørn Dag
dc.creator.authorSkeie, Tor
cristin.unitcode185,15,5,71
cristin.unitnameForskningsgruppen for nettverk og distribuerte systemer
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode1
dc.identifier.cristin1379833
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Future generations computer systems&rft.volume=72&rft.spage=145&rft.date=2017
dc.identifier.jtitleFuture generations computer systems
dc.identifier.volume72
dc.identifier.startpage145
dc.identifier.endpage162
dc.identifier.doihttp://dx.doi.org/10.1016/j.future.2016.04.003
dc.identifier.urnURN:NBN:no-67720
dc.type.documentTidsskriftartikkelen_US
dc.type.peerreviewedPeer reviewed
dc.source.issn0167-739X
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/65181/4/CRIStinEntryNr1379833_+EfficientNetworkIsolationAndLoadBalancingInMulti-TenantHPCClusters.pdf
dc.type.versionAcceptedVersion
dc.relation.projectNFR/213283


Files in this item

Appears in the following Collection

Hide metadata

Attribution-NonCommercial-NoDerivatives 4.0 International
This item's license is: Attribution-NonCommercial-NoDerivatives 4.0 International