Hide metadata

dc.date.accessioned2019-06-04T05:49:13Z
dc.date.available2020-06-14T22:46:20Z
dc.date.created2018-12-11T13:28:04Z
dc.date.issued2018
dc.identifier.citationZahid, Feroz Taherkordi, Amirhosein Gran, Ernst Gunnar Skeie, Tor Johnsen, Bjørn Dag . A Self-Adaptive Network for HPC Clouds: Architecture, Framework, and Implementation. IEEE Transactions on Parallel and Distributed Systems. 2018, 29(12), 2658-2671
dc.identifier.urihttp://hdl.handle.net/10852/68207
dc.description.abstractClouds offer flexible and economically attractive compute and storage solutions for enterprises. However, the effectiveness of cloud computing for high-performance computing (HPC) systems still remains questionable. When clouds are deployed on lossless interconnection networks, like InfiniBand (IB), challenges related to load-balancing, low-overhead virtualization, and performance isolation hinder full potential utilization of the underlying interconnect. Moreover, cloud data centers incorporate a highly dynamic environment rendering static network reconfigurations, typically used in IB systems, infeasible. In this paper, we present a framework for a self-adaptive network architecture for HPC clouds based on lossless interconnection networks, demonstrated by means of our implemented IB prototype. Our solution, based on a feedback control and optimization loop, enables the lossless HPC network to dynamically adapt to the varying traffic patterns, current resource availability, workload distributions, and also in accordance with the service provider-defined policies. Furthermore, we present IBAdapt, a simplified ruled-based language for the service providers to specify adaptation strategies used by the framework. Our developed self-adaptive IB network prototype is demonstrated using state-of-the-art industry software. The results obtained on a test cluster demonstrate the feasibility and effectiveness of the framework when it comes to improving Quality-of-Service compliance in HPC clouds.en_US
dc.languageEN
dc.titleA Self-Adaptive Network for HPC Clouds: Architecture, Framework, and Implementationen_US
dc.typeJournal articleen_US
dc.creator.authorZahid, Feroz
dc.creator.authorTaherkordi, Amirhosein
dc.creator.authorGran, Ernst Gunnar
dc.creator.authorSkeie, Tor
dc.creator.authorJohnsen, Bjørn Dag
cristin.unitcode185,15,5,71
cristin.unitnameDigitale infrastrukturer og sikkerhet
cristin.ispublishedtrue
cristin.fulltextpostprint
cristin.qualitycode2
dc.identifier.cristin1641678
dc.identifier.bibliographiccitationinfo:ofi/fmt:kev:mtx:ctx&ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.jtitle=IEEE Transactions on Parallel and Distributed Systems&rft.volume=29&rft.spage=2658&rft.date=2018
dc.identifier.jtitleIEEE Transactions on Parallel and Distributed Systems
dc.identifier.volume29
dc.identifier.issue12
dc.identifier.startpage2658
dc.identifier.endpage2671
dc.identifier.doihttp://dx.doi.org/10.1109/TPDS.2018.2842224
dc.identifier.urnURN:NBN:no-71381
dc.type.documentTidsskriftartikkelen_US
dc.type.peerreviewedPeer reviewed
dc.source.issn1045-9219
dc.identifier.fulltextFulltext https://www.duo.uio.no/bitstream/handle/10852/68207/2/tpds_self_archived.pdf
dc.type.versionAcceptedVersion


Files in this item

Appears in the following Collection

Hide metadata