dc.date.accessioned | 2018-10-17T10:31:24Z | |
dc.date.available | 2018-10-17T10:31:24Z | |
dc.date.created | 2016-09-09T16:36:02Z | |
dc.date.issued | 2016 | |
dc.identifier.citation | Zahid, Feroz Gran, Ernst Gunnar Bogdanski, Bartosz Johnsen, Bjørn Dag Skeie, Tor . Efficient network isolation and load balancing in multi-tenant HPC clusters. Future generations computer systems. 2017, 72, 145-162 | |
dc.identifier.uri | http://hdl.handle.net/10852/65181 | |
dc.description.abstract | Multi-tenancy promises high utilization of available system resources and helps maintaining costeffective operations for service providers. However, multi-tenant high-performance computing (HPC) infrastructures, like dynamic HPC clouds, bring unique challenges, both associated with providing performance isolation to the tenants, and achieving efficient load-balancing across the network fabric. Each tenant should experience predictable network performance, unaffected by the workload of other tenants. At the same time, it is equally important that the network links are balanced, avoiding network saturation. The network saturation can lead to unpredictable application performance, and a potential loss of profit for the cloud service providers.
In this paper, we present two significant extensions to our previously proposed partition-aware fattree routing algorithm, pFTree, for InfiniBand-based HPC systems. First, we extend pFTree to incorporate provider defined partition-wise policies that govern how the nodes in different partitions are allowed to share network resources with each other. Second, we present a weighted version of the pFTree routing algorithm, that besides partitions, also takes node traffic characteristics into account to balance load across the network links more evenly. A comprehensive evaluation comprising both real-world experiments and simulations confirms the correctness and feasibility of the proposed extensions. | en_US |
dc.language | EN | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.title | Efficient network isolation and load balancing in multi-tenant HPC clusters | en_US |
dc.type | Journal article | en_US |
dc.creator.author | Zahid, Feroz | |
dc.creator.author | Gran, Ernst Gunnar | |
dc.creator.author | Bogdanski, Bartosz | |
dc.creator.author | Johnsen, Bjørn Dag | |
dc.creator.author | Skeie, Tor | |
cristin.unitcode | 185,15,5,71 | |
cristin.unitname | Forskningsgruppen for nettverk og distribuerte systemer | |
cristin.ispublished | true | |
cristin.fulltext | postprint | |
cristin.qualitycode | 1 | |
dc.identifier.cristin | 1379833 | |
dc.identifier.bibliographiccitation | info:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=Future generations computer systems&rft.volume=72&rft.spage=145&rft.date=2017 | |
dc.identifier.jtitle | Future generations computer systems | |
dc.identifier.volume | 72 | |
dc.identifier.startpage | 145 | |
dc.identifier.endpage | 162 | |
dc.identifier.doi | http://dx.doi.org/10.1016/j.future.2016.04.003 | |
dc.identifier.urn | URN:NBN:no-67720 | |
dc.type.document | Tidsskriftartikkel | en_US |
dc.type.peerreviewed | Peer reviewed | |
dc.source.issn | 0167-739X | |
dc.identifier.fulltext | Fulltext https://www.duo.uio.no/bitstream/handle/10852/65181/4/CRIStinEntryNr1379833_+EfficientNetworkIsolationAndLoadBalancingInMulti-TenantHPCClusters.pdf | |
dc.type.version | AcceptedVersion | |
dc.relation.project | NFR/213283 | |