HCloud: Resource-Efficient Provisioning in Shared Cloud Systems

Kozyrakis, Christos

doi:10.1145/2872362.2872365

conference paper

HCloud: Resource-Efficient Provisioning in Shared Cloud Systems

Delimitrou, Christina

•

Kozyrakis, Christos

2016

Acm Sigplan Notices

21st International Conference on Architectural Support for Programming Languages and Operating Systems

Cloud computing promises flexibility and high performance for users and cost efficiency for operators. To achieve this, cloud providers offer instances of different sizes, both as long-term reservations and short-term, on-demand allocations. Unfortunately, determining the best provisioning strategy is a complex, multi-dimensional problem that depends on the load fluctuation and duration of incoming jobs, and the performance unpredictability and cost of resources. We first compare the two main provisioning strategies (reserved and on-demand resources) on Google Compute Engine (GCE) using three representative workload scenarios with batch and latency-critical applications. We show that either approach is suboptimal for performance or cost. We then present HCloud, a hybrid provisioning system that uses both reserved and on-demand resources. HCloud determines which jobs should be mapped to reserved versus on-demand resources based on overall load, and resource unpredictability. It also determines the optimal instance size an application needs to satisfy its Quality of Service (QoS) constraints. We demonstrate that hybrid configurations improve performance by 2.1x compared to fully on-demand provisioning, and reduce cost by 46% compared to fully reserved systems. We also show that hybrid strategies are robust to variation in system and job parameters, such as cost and system load.

Type

conference paper

DOI

10.1145/2872362.2872365

Web of Science ID

WOS:000379415100035

WOS:000385493900035

Author(s)

Delimitrou, Christina

Kozyrakis, Christos

Date Issued

2016

Publisher

Assoc Computing Machinery

Publisher place

New York

Published in

Acm Sigplan Notices

Total of pages

16

Volume

51

Issue

4

Start page

473

End page

488

Subjects

datacenter

•

provisioning

•

QoS

•

latency

•

resource efficiency

•

hybrid

•

cloud computing

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

SAIL

Event name	Event place	Event date
21st International Conference on Architectural Support for Programming Languages and Operating Systems	Atlanta, GA	APR 02-06, 2016

Available on Infoscience

October 18, 2016

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/129992