基于QoS的动态反馈分布式文件系统负载均衡优化算法

168 浏览量更新于2024-08-28 收藏 497KB PDF 举报

本文主要探讨了一种基于服务质量（QoS）的自适应动态反馈负载均衡算法，用于提升分布式文件系统中的负载均衡性能。该算法结合了多种负载均衡策略的优点，特别关注于提高任务处理的效率和可靠性。核心思想是通过一个由文件数量和总文件大小组成的QoS指标来衡量请求的任务需求。在分布式文件系统中，主节点根据这个QoS度量设置阈值，筛选出能够满足任务需求的存储节点。这样做的目的是为了确保任务能够在最适合的资源上执行，从而优化整个系统的性能负载比。算法设计的关键在于实时考虑多个关键性能指标，包括CPU利用率、内存使用、磁盘I/O占用率、网络带宽使用以及硬盘使用情况。这些因素对负载均衡的实时决策至关重要，因为它们直接影响了任务分配的公平性和系统的响应速度。通过动态调整，算法能够根据节点的实际工作状态进行灵活的负载分配，避免资源的过度集中或闲置，从而提高整体的QoS水平。此外，文章还着重考虑了网络异构性的影响，这意味着主节点在调度任务时会考虑到不同网络条件下的节点性能差异。这有助于确保即使在网络环境复杂多变的情况下，也能实现高效且稳定的负载平衡。这种自适应动态反馈负载均衡算法不仅考虑了任务本身的规模和需求，还充分考虑了服务器的资源利用情况和网络环境，旨在提供一个在分布式文件系统中具有高度适应性和可靠性的负载均衡解决方案。其研究结果对于优化大规模分布式系统中的性能瓶颈，提升用户体验具有重要意义，并可能在未来的云计算和大数据环境中得到广泛应用。

32 Journal of Communications and Information Networks

from the real-time load value and the normalized

performance.

• The concept of QoS is introduced into the dis-

tributed ﬁle system. We use a tuple composed of the

number of ﬁles and total ﬁle size of requested the

task as a measure of QoS. The master node sets a

threshold based on the QoS of the requested task to

ﬁlter the set of storage nodes that meet the require-

ments of the task.

2 Related work

Load balancing is a higher-level load allocation strat-

egy than load sharing. It must distribute the system

load to each node, eliminate or avoid any load imbal-

ance problems, and optimize the overall performance

of the distributed ﬁle system. Load balancing algo-

rithms can be divided into two categories

[11]

: static

load balancing

[12]

and dynamic load balancing

[9,13]

Static load balancing

[14,15]

is also known as state-

independent balancing. It determines a load allo-

cation strategy before a task is triggered, meaning

the master node does not consider the real-time load

status of each storage node while processing a re-

quest, but instead operates based on known sys-

tem static information to make decisions and assign

tasks. The advantages of static load balancing are

that the logic is simple, the overhead is small, and a

task request can be quickly allocated to each storage

node. However, it does not consider the real-time

load of the storage nodes or dynamic changes in the

system state. The task assignments are made blindly

and the accuracy is low, causing task allocation to

be uneven and limiting the system load balance

[10]

Dynamic load balancing

[10]

focuses on the state

of information in the system, by analyzing the real-

time load of each storage node, tasks are allocated

to the storage nodes dynamically. The advantages

of dynamic load balancing are that it can adjust the

allocation of tasks in real time based on the load in-

formation of the storage nodes, adapt to changes in

the load state of the system, and that it has excellent

ﬂexibility. However, it also has some disadvantages.

The master node must periodically collect the status

of the storage nodes necessitating, frequent informa-

tion exchange between the master node and storage

nodes to make network overhead, resulting in a waste

of network bandwidth

[10]

The weighted rotation scheduling algorithm

[16]

an upgraded version of the round robin algorithm.

The round robin algorithm assigns all tasks in turn

to each storage node in the system. The round robin

algorithm causes all work for nodes to be handled in

a circular pattern. In other words, each node is ﬁxed

with a time slice and performs a task at designated

time on its turn

[17,18]

. A scheduling and load balanc-

ing algorithm that considers the capabilities of each

VM, the task length of each requested job, and the

interdependency of multiple tasks was proposed

[17]

As a result, some nodes may encounter heavy loads

while others may have no task requests. This is-

sue could be improved by using a weighted round

robin algorithm, where each node can to possess a

speciﬁc number of requests according to its assigned

weight

[19,20]

. The weighted rotation scheduling al-

gorithm sets diﬀerent weighting factors for diﬀerent

storage nodes based on their processing ability. The

assignment of tasks is based on the weights, and

higher priorities are assigned to storage nodes with

higher weight factors. The algorithm is more eﬃ-

cient when dealing with requests with smaller time

spans, because the load becomes unbalanced when

handling tasks with large time spans.

In the minimum connection scheduling algo-

rithm

[21]

, the master node of the distributed ﬁle sys-

tem detects and records the current number of active

connections of its storage nodes in real time. When

a new request arrives, the master node assigns it to

the storage node with the smallest number of active

connections, and increments the number of active

connections for that node by one. When the task

is completed, the number of active connections is

decremented by one. If the processing capacity of

all the storage nodes in the distributed ﬁle system

is the same, the minimum connection scheduling al-

gorithm will distribute requests with large loads to

each storage node in a balanced manner, which is

more eﬃcient. However, the real environment of a

剩余10页未读，继续阅读

weixin_38615397

粉丝: 6
资源: 895

基于QoS的动态反馈分布式文件系统负载均衡优化算法

Research on Indoor Location Algorithm Based on Wi-Fi

ROBUST ADAPTIVE DYNAMIC PROGRAMMING

A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers

引用Distributed adaptive coverage control algorithm for mobile sensor networks

design 3rd order Adaptive Torque Filter

3rd order Adaptive Torque Filter

[11] G P. He, Z W. Wang, Z. Chen. Distributed Collaborative Localization Algorithm of AUV Cluster Based on Adaptive Residual Weight [J].Ship science and technology, 2022, 44(18): 101-1. 请详细解释一下，这个参考文献的格式

The application of adaptive mesh constraints requires the definition of an adaptive mesh domain.

research on adaptive job shop scheduling problems based on dueling double dq

bad adaptive filter value

最新资源