面向网格结构云服务的偏斜感知矩阵分解方法

PDF格式 | 2.82MB | 更新于2024-08-27 | 31 浏览量 | 举报

"本文提出了一种针对网格结构云服务的偏斜度感知矩阵分解方法，旨在优化服务间的通信并提升网络可见性。" 在云服务领域，为了满足客户端的可扩展性和快速响应需求，现代云服务越来越多地以分布式服务网格的形式部署。在这种结构中，服务与服务之间的通信频繁发生。然而，网格中的任意两个节点之间都可能出现问题事件，这强调了最大化网络可视性的必要性。现有的先进方法是通过低秩矩阵分解来基于潜在因子模型建模对间往返时间（RTT）。传统的潜在因子模型将RTT表示为一个低秩矩阵分解，其中每个潜在因子对应于分解模型中的一个秩-1组件，并被所有节点对共享。然而，不同的节点对通常会经历到一组偏斜的隐藏因子，这是现有模型未充分考虑的。在本文中，作者提出了名为SMF（Skewness-Aware Matrix Factorization）的新方法，该方法将矩阵分解拆分为基本的秩-1潜在因子单元。 SMF方法的核心创新在于它认识到并考虑了矩阵中的数据分布偏斜性。这种偏斜度反映了节点对之间潜在通信延迟的差异性。通过更精确地捕捉这种偏斜性，SMF能够提供更准确的RTT预测，从而有助于更好地理解和优化服务间的通信性能。此外，这种方法还能帮助识别和缓解网络中的热点问题，提高整个服务网格的稳定性。在实际应用中，SMF可能用于预测和预防云服务中的延迟问题，帮助系统管理员提前调整资源分配，以应对可能的服务质量下降。通过这种方式，SMF可以促进云服务的高效运行，减少故障发生的可能性，提升客户满意度。为了验证SMF的有效性，文章可能会详细介绍实验设计、实施步骤以及与现有方法的比较。这些实验结果可能包括性能指标的提升，如预测准确性、响应时间和资源利用率的改善。通过这些实证分析，作者将展示SMF相对于传统矩阵分解方法的优势，进一步证明其在复杂网格结构云服务环境中的适用性和实用性。 "A Skewness-Aware Matrix Factorization Approach for Mesh-Structured Cloud Services"这篇研究论文提出了一个新颖的解决方案，用于解决分布式服务网格中通信性能优化的问题，特别是在考虑数据偏斜性的情况下，这对于理解并改进大规模云服务的性能至关重要。

1600 IEEE/ACM TRANSACTIONS ON NETWORKING, VOL. 27, NO. 4, AUGUST 2019

set of latent factors, which may not hold when node pairs

experience diverse hidden factors. Our work addresses this

challenge via a skewness-aware matrix factorization model.

C. Matrix Completion

Our study is related with the matrix-completion theory [5],

which recovers an incomplete matrix via a subset of observed

entries. For a rank-rm×n matrix (r  (m, n)) that meets an

incoherent condition

, a unique rank-r matrix can be recovered

with a high probability. Minimizing the matrix rank exactly is

NP-hard [5]. OR1MP [43] iteratively ﬁnds a rank-one matrix

out of the approximation residual with the SVD. However,

it is generally impossible to exactly recover the SVD result

for a partially observed matrix. Further, different node pairs

are likely to be correlated with heterogeneous latent factors.

D. Trafﬁc Matrix Interpolation

Real-world trafﬁc matrices are usually incomplete. Con-

sequently, interpolating missing entries becomes important.

Trafﬁc matrix interpolation is a related, but different problem,

with different properties. Xie et al. [46]–[48] exploit hidden

spatial and temporal structures with three-dimensional low-

rank tensors, which effectively reduces the estimation error.

Zhang et al. [53] interpolate incomplete trafﬁc matrices with

structure regularized low-rank matrix factorization and local

interpolation procedures. LENS [7] models the trafﬁc matrices

as the sum of multiple matrices that are positively correlated

with the trafﬁc matrix. Our work proposes a unifying model

that keeps the low-rank interpretation and adapts well to

skewed latent factors.

III. P

ROBLEM STATEMENT

We ﬁrst present the measurement environment for

the mesh-structured cloud services, then introduce the

matrix-factorization results, and discuss the open questions.

A. Measurement Architecture

A service mesh typically consists of a set of nodes located in

mega data center networks or edge data-center networks. Each

node hosts a set of networked micro-services, as discussed

in the introduction. Service-to-service communication is fre-

quent, while the latency between sending service requests and

obtaining responses should meet network SLAs. We assume

that, the service mesh should have synchronized their clocks,

as otherwise we could not correlate the network problems in

different locations. The synchronization protocols such as Net-

work Time Protocol (NTP) [2] or the IEEE 1588 Precise Time

Protocol (PTP) [1] can provide millisecond-level precision for

geo-distributed nodes.

The measurement system is comprised of two main com-

ponents inspired by the software deﬁned networks [23], [33],

[50]: a data plane that consists of service-mesh nodes and a

control plane on a logically centralized server.

(i) At the control plane, the logically centralized controller

schedules the Round-Trip Time (RTT) measurement process

Incoherence [5] states that the singular vectors spread out to help the matrix

be loosely aligned with the coordinate axes.

in the data plane. The controller randomly samples a small list

of nodes as probing targets for each service-mesh node. The

choices of probing targets are randomized for different nodes

for load balancing. The number of probing targets depends on

the node’s measurement capability. For a scale of hundreds

of nodes, selecting tens of probing targets sufﬁces to obtain a

good level of accuracy.

Further, the controller handles the churns of nodes, since

an ofﬂine node is useless and should be detected and ﬁl-

tered. Accordingly, the controller keeps the online status of

the data plane as volatile states in the main memory. Each

online node periodically sends a heart-beating message to the

centralized controller to notify its online status; as a response,

the controller piggybacks a list of sampled online nodes. The

frequency of the heart-beat messages is platform-dependent,

where stable platforms could choose a long period, while edge

platforms should choose a relatively short period (e.g., one

minute) to reﬂect system churns.

(ii) At the data plane, each service-mesh node performs a

number of measurements towards other nodes in the same

platform. It downloads the list of probing targets from the

controller, and measures the RTT status towards these probing

targets in a periodical approach. After collecting the RTT

samples in an interval, each node uploads the RTT results

to the persistent storage that is accessed by the controller.

The data plane could use any kinds of measurement meth-

ods. For example, at the network or transport level, the data

plane may choose ICMP or TCP protocol based measurement

methods; at the application level, the data plane could use

RPC or HTTP protocol based methods. Generally, the RTT

value amounts to the absolute difference between the time

of sending a request message to the probing target and that

of receiving the response message from this probing target.

The unit of a measurement interval determines the granularity

of the monitoring process. Increasing the sampling interval

towards a probing target yields a coarser granularity.

B. Challenges for RTT Matrix Completion

For a set of N nodes, the pairwise RTTs between N nodes

in an interval can be represented as a N-by-N matrix D.

The state-of-the-art approaches predict pairwise RTT values

based on the matrix factorization approach, which factorizes

amatrixD ∈ R

N×N

as a product of two low-dimensional

factor matrices F ∈ R

N×r

and G ∈ R

N×r

,i.e.,D ≈ FG

where r  N,and

denotes the transpose of a matrix.

A matrix factorization model is equivalent to a sum of a set

of rank-one matrices:

D = FG



(1)

where F

denote the k-th (k ≤ r) column vector of the

matrix F and G, respectively. An objective function seeks

to minimize the approximation residual between the observed

entries and the sum of the rank-one matrices:

min

F,G



D −



k=1

∗k



(2)

剩余13页未读，继续阅读

weixin_38517113

粉丝: 3

面向网格结构云服务的偏斜感知矩阵分解方法

Skewness-aware clustering tree for unevenly distributed spatial sensor nodes in smart city

Introduction to ANSYS ICEMCFD

exponential-skewness:指数分布偏度

rayleigh-skewness:瑞利分布偏度

poisson-skewness:泊松分布偏度

chisquare-skewness:卡方分布偏度

t-skewness:学生的t分布偏度

skewreduction.m- skewness reduction（转换方法）：数据转换技术，以减少非正态数据的偏度。 （非正态数据到正态数据）-matlab开发

workbench skewness

最新资源

skewreduction.m- skewness reduction（转换方法）：数据转换技术，以减少非正态数据的偏度。（非正态数据到正态数据）-matlab开发