Profit Maximization in Cloud Computing

Saistha N; Senthil Nathan K

Profit Maximization in Cloud Computing

Saistha N¹, Senthil Nathan K²

PG Scholar, Department of CSE, P.S.V College of Engineering and Technology, Krishnagiri, Tamilnadu, India
Assistant Professor, Department of CSE, P.S.V College of Engineering and Technology, Krishnagiri , Tamilnadu, India

Related article at Pubmed, Scholar Google

Visit for more related articles at International Journal of Innovative Research in Computer and Communication Engineering

Abstract

Cloud computing is a novel paradigm for the provision of computing infrastructure, which aims to shift the location of the computing infrastructure to the network in order to reduce the costs of management and maintenance of hardware and software resources. To maximize the profit, a service provider should understand both service charges and business costs. Our pricing model takes such factors into considerations as the amount of a service, the workload of an application environment, the configuration of a multi-server system, the service-level agreement, the satisfaction of a consumer, the quality of a service, the penalty of a low-quality service, the cost of renting, the cost of energy consumption, and a service provider’s margin and profit. Our approach is to treat a multi-server system as an M/M/m queuing model, such that our optimization problem can be formulated and solved analytically. We evaluate its performance using an analytical model and solve it to obtain important performance factors like mean number of tasks in the system.

Keywords

Cloud computing, multi-server system, pricing model, profit, queuing model, server configuration service charge, service-level agreement, waiting time.

I. INTRODUCTION

Cloud computing is quickly becoming an effective and efficient way of computing resources and computing services consolidation. By centralized management of resources and services, cloud computing delivers hosted services over the Internet, such that accesses to shared hardware, software, databases, information, and all resources are provided to consumer’s on-demand. Cloud computing is able to provide the most cost-effective and energy-efficient way of computing resources management and computing services provision. Cloud computing turns information technology into ordinary commodities and utilities by using the pay-per-use pricing model. However, cloud computing will never be free, and understanding the economics of cloud computing becomes critically important

Cloud computing environment is a three-tier structure, which consists of infrastructure vendors, service providers, and consumers. An infrastructure vendor maintains basic hardware and software facilities. A service provider rents resources from the infrastructure vendors, builds appropriate multi-server systems, and provides various services to users. A consumer submits a service request to a service provider, receives the desired result from the service provider with certain service-level agreement, and pays for the service based on the amount of the service and the quality of the service. A service provider can build different multi-server systems for different application domains, such that service requests of different nature are sent to different multi-server systems. Each multi-server system contains multiple servers, and such a multi-server system can be devoted to serve one type of service requests and applications.

An application domain is characterized by two basic features, i.e., the workload of an application environment and the expected amount of a service. The configuration of a multi-server system is characterized by two basic features, i.e., the size of the multi-server system (the number of servers) and the speed of the multi-server system (execution speed of the servers). Like all business, the pricing model of a service provider in cloud computing is based on two components, namely, the income and the cost. For a service provider, the income (i.e., the revenue) is the service charge to users, and the cost is the renting cost plus the utility cost paid to infrastructure vendors. A pricing model in cloud computing includes many considerations, such as the amount of a service (the requirement of a service), the workload of an application environment, the configuration (the size and the speed) of a multi-server system, the service-level agreement, the satisfaction of a consumer (the expected service time), the quality of a service (the task waiting time and the task response time),the penalty of a low-quality service, the cost of renting, the cost of energy consumption, and a service provider’s margin and profit. The profit (i.e., the net business gain) is the income minus the cost.

To maximize the profit, a service provider should understand both service charges and business costs. The service charge to a service request is determined by two factors, the expected length of the service and the actual length of the service. The expected length of a service (i.e., the expected service time) is the execution time of an application on a standard server with a baseline or reference speed. Once the baseline speed is set, the expected length of a service is determined by a service request itself.

The service requirement (amount of service) measured by the number of instructions to be executed. The longer (shorter, respectively) the expected length of a service is, the more (less, respectively) the service charge is. The actual length of a service (i.e., the actual service time) is the actual execution time of an application. The actual length of a service depends on the size of a multi-server system, the speed of the servers (which may be faster or slower than the baseline speed), and the workload of the multi-server system. Notice that the actual service time is a random variable, which is determined by the task waiting time once a multi-server system is established. There are many different service performance metrics in service-level agreements. Our performance metric in this paper is the task response time (or the turnaround time), i.e., the time taken to complete a task, which includes task waiting time and task execution time. The service-level agreement is the promised time to complete a service, which is a constant times the expected length of a service.

If the actual length of a service is (or, a service request is completed) within the service-level agreement, the service will be fully charged. However, if the actual length of a service exceeds the service-level agreement, the service charge will be reduced. The longer (shorter, respectively) the actual length of a service is, the more (less, respectively) the reduction of the service charge is. In other words, there is penalty for a service provider to break a service-level agreement. If the actual service time exceeds certain limit (which is service request dependent), a service will be entirely free with no charge.

The cost of a service provider includes two components the renting cost and the utility cost. The renting cost is proportional to the size of a multi-server system, i.e., the number of servers. The utility cost is essentially the cost of energy consumption and is determined by both the size and the speed of a multi-server system. The faster (slower, respectively) the speed is, the more (less, respectively) the utility cost is. To calculate the cost of energy consumption, we need to establish certain server speed and power consumption models. To increase the revenue of business, a service provider can construct and configure a multi-server system with many servers of high speed. Since the actual service time (i.e., the task response time) contains task waiting time and task execution time, more servers reduce the waiting time and faster servers reduce both waiting time and execution time. Hence, a powerful multi-server system reduces the penalty of breaking a service-level agreement and increases the revenue. However, more servers (i.e., a larger multi-server system) increase the cost of facility renting from the infrastructure vendors and the cost of base power consumption. Furthermore, faster servers increase the cost of energy consumption. Such increased cost may counterweight the gain from penalty reduction. Therefore, for an application environment with specific workload which includes the task arrival rate and the average task execution requirement, a service provider needs to decide an optimal multi-server configuration (i.e., the size and the speed of a multi-server system), such that the expected profit is maximized.

In this paper, the problem of optimal multi-server configuration for profit maximization in a cloud computing environment is presented. Our approach is to treat a multi-server system as an M/M/m queuing model, such that our optimization problem can be formulated and solved analytically. We consider two server speed and power consumption models, namely, the idle-speed model and the constant-speed model.

The probability density function (pdf) of the waiting time of a newly arrived service request is derived. This result is significant in its own right and is the base of our discussion. The expected service charge to a service request is calculated. Based on these results, we get the expected net business gain in one unit of time, and obtain the optimal server size and the optimal server speed numerically.

Here, P[e] is used to denote the probability of an event e. For a random variable x, we use fx(t) to represent the probability density function of x, and Fx(t) to represent the cumulative distribution function (cdf) of x and x¯ to represent the expectation of x. A cloud computing service provider serves users’ service requests by using a multiserver system, which is constructed and maintained by an infrastructure vendor and rented by the service provider. The architecture detail of the multi-server system can be quite flexible. Examples are blade servers and blade centers where each server is a server blade, clusters of traditional servers where each server is an ordinary processor, and multi-core server processors where each server is a single core. We will simply call these blades/processors/cores as servers. Users (i.e., customers of a service provider) submit service requests (i.e., applications and tasks) to a service provider, and the service server system. By taking an economic approach to providing service-oriented and utility computing, a service provider allocates resources and schedules tasks in such a way that the total profit earned is maximized.

II. RELATED WORKS

Cloud service differs from traditional hosting in three principal aspects. First, it is provided on demand; second, it is elastic since users that use the service have as much or as little as they want at any given time (typically by the minute or the hour); and third, the service is fully managed by the provider. Due to dynamic nature of cloud environments, diversity of user requests, and time dependency of load, providing agreed quality of service (QoS) while avoiding overprovisioning is a difficult task. Since many of the large cloud centers employ virtualization to provide the required resources such as PMs, we consider PMs with a high degree of virtualization. Real cloud providers offer complex requests for their users. For instance, in Amazon EC2, the user is allowed to run up to On-Demand or Reserved Instances, and up to 100 Spot Instances per region. We examined the effects of various parameters including ST arrival rate, task service time, the virtualization degree, and ST size on task rejection probability and total response delay. The stable, transient and unstable regimes of operation for given configurations have been identified so that capacity planning is going to be a less challenging task for cloud providers.

The cluster RMS supports four main functionalities: resource management; job queuing; job scheduling; and job execution. It manages and maintains status information of the resources such as processors and disk storage in the cluster system. Jobs submitted into the cluster system are initially placed into queues until there are available resources to execute the jobs. The cluster RMS then invokes a scheduler to determine how resources are assigned to jobs. After that, the cluster RMS dispatches the jobs to the assigned nodes and manages the job execution processes before returning the results to the users upon job completion.

Performance evaluation focuses on a producer by using a user-centric cost evaluation factor to determine the average yield or earning each cluster manager achieves. Simulation results show that considering and balancing the potential gain of accepting a task instantly with the risk of future loss provides better returns for competing cluster managers. To allow service providers to decide whether to accept work, a resource provider will offer predictions of how many resources it will have available when, using a set of tuples. The new algorithms are compared with previously proposed approaches, and evaluated across a range of operating conditions: load, resource price and quantity, utility function shape (client impatience), and resource uncertainty level. Our experiments show that the new algorithms can extract higher pro rates than the previous ones. In the future, we would like to experiment with nontechnical aspects of the economics-based approach, such as determining how to help customers express utility functions easily. We would also like to explore alternative ways to describe and manage risk. Resources to run the jobs are obtained from a separate resource provider, who offers predictions of the number of resources likely to be available at different times in the future, and their price.

Cloud systems automatically control and optimize resource use by leveraging a metering capability at some level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts). Resource usage can be monitored, controlled, and reported, providing transparency for both the provider and consumer of the utilized service.

III. MULTI SERVER MODEL

A cloud computing service provider serves users’ service requests by using a multi-server system, which is constructed and maintained by an infrastructure vendor and rented by the service provider. The architecture detail of the multiserver system can be quite flexible. Examples are blade servers and blade centers where each server is a server blade, clusters of traditional servers where each server is an ordinary processor, and multi-core server processors where each server is a single core We will simply call these blades/processors/cores as servers. Users (i.e., customers of a service provider) submit service requests (i.e., applications and tasks) to a service provider, and the service provider serves the requests (i.e., run the applications and perform the tasks) on a multi-server system.

Here multi-server system is treated as an M/M/m queuing system which is elaborated as follows. There is a Poisson stream of service requests with arrival rate λ, i.e., the inter-arrival times are independent and identically distributed exponential random variables with mean 1/ λ.

Notice that although an M/G/m queuing system has been considered, the M/M/m queuing model is the only model that accommodates an analytical and closed form expression of the probability density function of the waiting time of a newly arrived service request. Let μ = 1 / x¯ = s / r¯ be the average service rate, i.e., the average number of service requests that can be finished by a server of S in one unit of time. The server utilization is ρ = λ/ mμ , which is the average percentage of time that a server of S is busy. Let Pk denote the probability that there are k service requests (waiting or being processed) in the M/M/m queuing system for S.

IV. POWER CONSUMPTION MODEL

Power dissipation and circuit delay in digital CMOS circuits can be accurately modelled by simple equations, even for complex microprocessor circuits. CMOS circuits have dynamic, static, and short-circuit power dissipation; however, the dominant component in a well-designed circuit is dynamic power consumption P (i.e., the switching component of power), which is approximately P =aCV 2f, where a is an activity factor, C is the loading capacitance, V is the supply voltage, and f is the clock frequency We will consider two types of server speed and power consumption models. In the idle-speed model, a server runs at zero speed when there is no task to perform. Since the power for speed s is the average amount of energy consumed by a server in one unit of time , the speed of a server is zero when it is idle. In the constant-speed model, all servers run at the speed s even if there is no task to perform. Again, we use P to represent the power allocated to multi-server system S.

If all the servers have a fixed speed s, the execution time of a service request with execution requirement r is known as x = r/s. The response time to the service request is T = W + x = W + r/s. The response time T is related to the service charge to a customer of a service provider in cloud computing. To study the expected service charge to a customer, we need a complete specification of a service charge based on the amount of a service, the service-level agreement, the satisfaction of a consumer, the quality of a service, the penalty of a low-quality service, and a service provider’s margin and profit.

V. NET BUSINESS GAIN

Since the number of service requests processed in one unit of time is λ in a stable M/M/m queuing system, the expected service charge in one unit of time is λC, which is actually the expected revenue of a service provider. Assume that the rental cost of one server for unit of time is β. Also, assume that the cost of energy is γ per Watt. The cost of a service provider is the sum of the cost of infrastructure renting and the cost of energy consumption, i.e., βm+γP. Then, the expected net business gain (i.e., the net profit) of a service provider in one unit of time is G= λC – (βm + γP), which is defined as the revenue minus the cost. The more service requests bring more revenue and net business gain; however, after the number of service requests per unit of time reaches certain point, the excessive waiting time causes increased lateness penalty, so that there is no revenue and negative business gain. There are two situations that cause negative business gain. In the first case, there is no enough business (i.e., service requests). In this case, a service provider should consider reducing the number of servers m and/or server speed s, so that the cost of infrastructure renting and the cost of energy consumption can be reduced. In the second case, there is too much business (i.e., service requests). In this case, a service provider should consider increasing the number of servers and/or server speed, so that the waiting time can be reduced and the revenue can be increased. However, increasing the number of servers and/or server speed also increases the cost of infrastructure renting and the cost of energy consumption. Therefore, we have the problem of selecting the optimal server size and/or server speed so that the profit is maximized.

VI. PROFIT MAXIMIZATION

expected services charge which has an upper bound ar¯ ; on the other hand, the cost of a service provider (i.e., the rental cost and base power consumption) increases, so that the net business gain is actually reduced.

VII. CONCLUDING REMARKS

We have proposed a pricing model for cloud computing which takes many factors into considerations, such as the requirement r of a service, the workload of an application environment, the configuration (m and s) of a multi-server system, the service level agreement c, the satisfaction (r and s) of a consumer, the quality (W and T) of a service, the penalty d of a low-quality service, the cost of renting, the cost of energy consumption, and a service provider’s margin and profit a. By using an M/M/ m queuing model, we formulated and solved the problem of optimal multi-server configuration for profit maximization in a cloud computing environment. Our discussion can be easily extended to other service charge functions. Our methodology can be applied to other pricing models. At three-tier cloud structure, which consists of infrastructure vendors, service providers and consumers, the latter two parties are particular interest to us. Clearly, scheduling strategies in this scenario should satisfy the objectives of both parties. Our contributions include the development of a pricing model using processor-sharing for clouds, the application of this pricing model to composite services with dependency consideration, and the development of two sets of profit-driven scheduling algorithms.

VIII. ACKNOWLEDGEMENT

The success and final outcome of this project required a lot of guidance and assistance from many people and I am extremely fortunate to have got this all along the completion of my project work. Whatever I have done is only due to such guidance and assistance and I would not forget to thank them. I respect and thank Mr. B. SAKTHIVEL M.E., Head of the Department of Computer Science and Engineering for providing all support and guidance which made me to complete the project on time. I take this opportunity to express my profound gratitude and deep regards to my guide Mr. K. SENTHIL NATHAN M.E., Assistant Professor or his exemplary guidance, monitoring and constant encouragement throughout the course of this project. The blessing, help and guidance given by him time to time shall carry me a long way in the journey of life on which I am about to embark .Lastly, I thank almighty, my parents, husband and children for their constant encouragement without which this assignment would not be possible.

References

Armbrust. M et al., “Above the Clouds: A Berkeley View of Cloud Computing,” Technical Report No. UCB/EECS-2009-28, Feb. 2009.
Buyya .R, Yeo C.S, Venugopal .S , Broberg . J, and Brandic .J, “Cloud Computing and Emerging IT Platforms: Vision, Hype, and Reality for Delivering Computing as the Fifth Utility,” Future Generation Computer Systems, vol. 25, no. 6, pp. 599-616, 2009.
Chandrakasan A.P, Sheng S, and Brodersen R.W, “Low-Power CMOS Digital Design,” IEEE J. Solid-State Circuits, vol. 27, no. 4, pp. 473- 484, Apr. 1992.
Chun B.N and Culler D.E, “User-Centric Performance Analysis of Market-Based Cluster Batch Schedulers,” Proc. Second IEEE/ ACM Int’l Symp. Cluster Computing and the Grid, 2002.
Durkee . D, “Why Cloud Computing Will Never be Free,” Comm. ACM, vol. 53, no. 5, pp. 62-69, 2010.
Irwin D.E, Grit J.E, and Chase J.S, “Balancing Risk and Reward in a Market-Based Task Service,” Proc. 13th IEEE Int’l Symp. High Performance Distributed Computing, pp. 160-169, 2004.
Junwei Cao, Kai Hwang, Kequin Li, Albert Y. Zomaya, “Optimal Multiserver configuration for profit maximization in cloud computing Vol 24, June 2013
Khazaei. H, Misic .J, and Misic V.B, “Performance Analysis of Cloud Computing Centers Using M/G/m/m+r Queuing Systems,” IEEE Trans. Parallel and Distributed Systems, vol. 23, no. 5, p. 936-943, May 2012.
Mell . P and Grance .T, “The NIST Definition of Cloud Computing,” Nat’l Inst. of Standards and Technology, http://csrc.nist. gov/groups/SNS/cloud-computing/, 2009.
Sherwani. J, Ali. N, Lotia . N, Hayat .Z, and Buyya .R, “Libra: A Computational Economy-Based Job Scheduling System for Clusters,” Software - Practice and Experience, vol. 34, pp. 573-590, 2004.
Yeo C.S and Buyya . R, “A Taxonomy of Market-Based Resource Management Systems for Utility-Driven Cluster Computing,” Software - Practice and Experience, vol. 36, pp. 1381-1419, 2006.