Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
ElastMan: Autonomic Elasticity Manager for Cloud-Based Key-Value Stores
KTH, School of Information and Communication Technology (ICT), Software and Computer systems, SCS.
KTH, School of Information and Communication Technology (ICT), Software and Computer systems, SCS.
2012 (English)Report (Other academic)
Abstract [en]

The increasing spread of elastic Cloud services, together with the pay-asyou-go pricing model of Cloud computing, has led to the need of an elasticity controller. The controller automatically resizes an elastic service, in response to changes in workload, in order to meet Service Level Objectives (SLOs) at a reduced cost. However, variable performance of Cloud virtual machines and nonlinearities in Cloud services, such as the diminishing reward of adding a service instance with increasing the scale, complicates the controller design. We present the design and evaluation of ElastMan, an elasticity controller for Cloud-based elastic key-value stores. ElastMan combines feedforward and feedback control. Feedforward control is used to respond to spikes in the workload by quickly resizing the service to meet SLOs at a minimal cost. Feedback control is used to correct modeling errors and to handle diurnal workload. To address nonlinearities, our design of ElastMan leverages the near-linear scalability of elastic Cloud services in order to build a scale-independent model of the service. Our design based on combining feedforward and feedback control allows to efficiently handle both diurnal and rapid changes in workload in order to meet SLOs at a minimal cost. Our evaluation shows the feasibility of our approach to automation of Cloud service elasticity.

Place, publisher, year, edition, pages
2012. , 14 p.
Series
TRITA-ICT-ECS R, ISSN 1653-7238 ; 12:01
Keyword [en]
Elasticity Controller, Cloud Storage, feedback, feedforward, SLO
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:kth:diva-101660OAI: oai:DiVA.org:kth-101660DiVA: diva2:548449
Funder
ICT - The Next Generation
Note

QC 20120831

Available from: 2012-08-30 Created: 2012-08-30 Last updated: 2014-01-23Bibliographically approved
In thesis
1. Self-Management for Large-Scale Distributed Systems
Open this publication in new window or tab >>Self-Management for Large-Scale Distributed Systems
2012 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Autonomic computing aims at making computing systems self-managing by using autonomic managers in order to reduce obstacles caused by management complexity. This thesis presents results of research on self-management for large-scale distributed systems. This research was motivated by the increasing complexity of computing systems and their management.

In the first part, we present our platform, called Niche, for programming self-managing component-based distributed applications. In our work on Niche, we have faced and addressed the following four challenges in achieving self-management in a dynamic environment characterized by volatile resources and high churn: resource discovery, robust and efficient sensing and actuation, management bottleneck, and scale. We present results of our research on addressing the above challenges. Niche implements the autonomic computing architecture, proposed by IBM, in a fully decentralized way. Niche supports a network-transparent view of the system architecture simplifying the design of distributed self-management. Niche provides a concise and expressive API for self-management. The implementation of the platform relies on the scalability and robustness of structured overlay networks. We proceed by presenting a methodology for designing the management part of a distributed self-managing application. We define design steps that include partitioning of management functions and orchestration of multiple autonomic managers.

In the second part, we discuss robustness of management and data consistency, which are necessary in a distributed system. Dealing with the effect of churn on management increases the complexity of the management logic and thus makes its development time consuming and error prone. We propose the abstraction of Robust Management Elements, which are able to heal themselves under continuous churn. Our approach is based on replicating a management element using finite state machine replication with a reconfigurable replica set. Our algorithm automates the reconfiguration (migration) of the replica set in order to tolerate continuous churn. For data consistency, we propose a majority-based distributed key-value store supporting multiple consistency levels that is based on a peer-to-peer network. The store enables the tradeoff between high availability and data consistency. Using majority allows avoiding potential drawbacks of a master-based consistency control, namely, a single-point of failure and a potential performance bottleneck.

In the third part, we investigate self-management for Cloud-based storage systems with the focus on elasticity control using elements of control theory and machine learning. We have conducted research on a number of different designs of an elasticity controller, including a State-Space feedback controller and a controller that combines feedback and feedforward control. We describe our experience in designing an elasticity controller for a Cloud-based key-value store using state-space model that enables to trade-off performance for cost. We describe the steps in designing an elasticity controller. We continue by presenting the design and evaluation of ElastMan, an elasticity controller for Cloud-based elastic key-value stores that combines feedforward and feedback control.

Place, publisher, year, edition, pages
Stockholm: KTH Royal Institute of Technology, 2012. xix, 266 p.
Series
TRITA-ICT-ECS AVH, ISSN 1653-6363 ; 12:04
Keyword
Self-Management, Autonomic Computing, Control Theory, Distributed Systems, Grid Computing, Cloud Computing, Elastic Services, Key-Value Stores
National Category
Computer Systems
Research subject
SRA - ICT
Identifiers
urn:nbn:se:kth:diva-101661 (URN)978-91-7501-437-1 (ISBN)
Public defence
2012-09-26, Sal E, Forum IT-Universitetet, KTH, Isajordsgatan 39, Kista, 14:00 (English)
Opponent
Supervisors
Funder
ICT - The Next Generation
Note

QC 20120831

Available from: 2012-08-31 Created: 2012-08-30 Last updated: 2014-01-23Bibliographically approved

Open Access in DiVA

fulltext(744 kB)508 downloads
File information
File name FULLTEXT01.pdfFile size 744 kBChecksum SHA-512
7948a73a94a7e46b99d4328942f72c14c290b6930492fc7b05a2119567f7c0382e0e24ec78d3aaf377d1328f3bf47c2f2bdbffb8d30fa1050b45719ee1f27b54
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Al-Shishtawy, AhmadVlassov, Vladimir
By organisation
Software and Computer systems, SCS
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 508 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 371 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf