The increasing use of computers in various fields has led to the popularization of cloud computing due to its advantages, such as large scale, high performance, low power consumption, high reliability, and low cost. However, the rapid growth of business volume and kinds of computer hardware resources, along with complex computer architecture, poses challenges in allocating hardware resources. Traditional methods struggle to meet the varying requirements of different businesses, hinder automatic tuning, and limit the improvement of overall system performance. In this regard, we propose an automatic QoS-aware resource partitioning framework that aims to maximize the overall performance of the system. Our contributions include an automatic performance tuning framework for cloud environments, a Deep Q-learning Network (DQN) based performance optimization method, and achieving a speed-up of at most 1.73 times compared to uniform partitions for CPU overload scenarios involving throughput-aware and latency-critical workloads. The proposed framework addresses the challenges faced by cloud computing by adopting different allocation methods for hardware resources that maximize performance in different application scenarios. The framework efficiently utilizes emerging hardware resource control capabilities to improve hardware and system performance. The use of DQN in performance optimization allows the framework to learn from past experiences and adapt to different situations, resulting in better resource allocation decisions. The proposed framework can significantly reduce the waste of resources and unnecessary expenditures for governments and enterprises. Our framework can serve as a guide for future research in cloud computing and related fields.
|