无线传感器网络的网内数据融合技术综述

需积分: 9 25 浏览量更新于2024-08-02 2 收藏 449KB PDF 举报

"这篇论文全面回顾了无线传感器网络中网内数据融合技术的现有文献。作者Elena Fasolo、Michele Rossi、Jörg Widmer和Michele Zorzi探讨了不同协议层的解决方案，并强调了跨层设计方法对于优化性能的重要性。文章指出了当前存在的问题，并提出了未来研究的方向。" 在无线传感器网络中，网内数据融合是一种关键的技术，它允许在网络内部对收集的数据进行处理和聚合，从而减少通信开销，提高能源效率和网络寿命。无线传感器网络由大量小型设备组成，这些设备具有感知、计算和通信功能，广泛应用于环境监测、军事侦察、健康监护等多个领域。论文首先定义了分类现有解决方案的适当标准，这些标准可能基于数据处理的层次、数据类型或融合算法。接着，作者深入讨论了从物理层到应用层的各个协议层中的数据融合技术。物理层关注的是信号传输和接收，而数据链路层则处理错误检测和纠正，网络层处理路由选择，传输层负责端到端的数据传输，应用层则与具体任务相关，如数据解释和决策。在数据融合中，跨层设计是一个重要的概念，它意味着不同层次的协议需要协同工作以优化整体性能。例如，物理层的编码策略可能会影响数据链路层的错误率，进而影响网络层的路由选择和传输层的数据包重组。论文还讨论了无线传感器网络中的一些挑战，如能量效率、网络覆盖、安全性以及数据的实时性和准确性。随着节点数量的增加，如何有效地管理和协调这些节点，同时保持低功耗，是研究人员必须解决的关键问题。此外，由于网络的分布式特性，确保数据的安全传输和防止恶意攻击也是一个重大挑战。最后，作者提出了一些未来研究的潜在方向，包括开发新的融合算法来提高数据质量，优化能源利用，以及探索新的硬件平台和技术，以支持更复杂的融合操作。他们还鼓励研究如何将机器学习和人工智能融入数据融合过程，以实现更智能的决策和自适应网络行为。这篇论文为无线传感器网络中的数据融合提供了全面的视角，对理解该领域的现状和未来发展具有重要价值。通过深入研究和解决当前的问题，无线传感器网络将在各种应用场景中发挥更大的潜力。

sensor readings are routed up the aggregation tree.

For the distribution phase, TAG uses a tree based routing

scheme rooted at the sink node. The sink broadcasts a message

asking nodes to organize into a routing tree and then sends its

queries. In each message there is a ﬁeld specifying the level,

or distance from the root, of the sending node (the level of the

root is equal to zero). Whenever a node receives a message

and it does not yet belong to any level, it sets its own level to

be the level of the message plus one. It also elects the node

from which it receives the message as its parent. The parent is

the node that is used to route messages towards the sink. Each

sensor then rebroadcasts the received message adding its own

identiﬁer (ID) and level. This process continues until all nodes

have been assigned an ID and a parent. The routing messages

are periodically broadcast by the sink in order to keep the tree

structure updated. After the construction of the tree, the queries

are sent along the structure to all nodes in the network. TAG

adopts the selection and aggregation facilities of the database

query languages (SQL). Accordingly, TAG queries have the

following form:

SELECT{agg(expr), attrs} from SENSOR

WHERE{selPreds}

GROUP BY{attrs}

HAVING{havingPreds}

EPOCH DURATION i

In practice, the sink sends a query, where it speciﬁes the

quantities that it wants to collect (attrs ﬁeld), how these must

be aggregated (agg(expr)) and the sensors that should be

involved in the data retrieval. This last request is speciﬁed

through the WHERE, GROUP and HAVING clauses [5]. Fi-

nally, an EPOCH duration ﬁeld speciﬁes the time (in seconds)

each device should wait before sending new sensor readings.

This means the readings used to compute an aggregate record

all belong to the same time interval, or epoch.

During the data collection phase, due to the tree structure,

each parent has to wait for data from all of its children before

it can send its aggregate up the tree. Epochs are divided into

shorter intervals called communication slots. The number of

these slots equals the maximum depth of the routing tree. The

slot mechanism gives a nice beneﬁt. As the time is slotted,

sensor nodes can be put to sleep until the next scheduled

transmission interval. In practice, a node goes back to sleep

soon after it has ﬁnished sending its readings to its parent.

Data aggregation is performed by all intermediate nodes.

However, in order not to limit TAG to the few and very

simple aggregation functions deﬁned by the SQL language

(such as COUNT, MIN, MAX, SUM, and AVERAGE) a

more general classiﬁcation is accounted for by partitioning

aggregates according to the Duplicate Sensitivity, Exemplary

and Summary and Monotonic properties [5].

As for most tree-based schemes, TAG may be inefﬁcient in

case of dynamic topologies or link/device failures: as discussed

above, trees are particularly sensitive to failures at intermediate

nodes as the related subtree may become disconnected. In

addition, as the topology changes, TAG has to re-organize the

tree structure and this means high costs in terms of energy

consumption and overhead.

Directed Diffusion [1] - Directed Diffusion is a reactive data

centric protocol. The routing scheme is speciﬁcally tailored

for those applications where one or few sinks ask some

speciﬁc information by ﬂooding the network with their queries.

Directed Diffusion is organized in three phases (see Fig. 2,

originally shown in [1]): 1) interest dissemination, 2) gradient

setup and 3) data forwarding along the reinforced paths

(path reinforcement and forwarding). When a certain sink is

interested in collecting data from the nodes in the network,

it propagates an interest message (interest dissemination),

describing the type of data the node is interested in, and

setting a suitable operational mode for its collection. Each

node, on receiving the interest, re-broadcasts it to its neighbors.

In addition, the node sets up interest gradients, i.e., vectors

containing the next hop that has to be used to propagate the

result of the query back to the sink node (gradient setup).

As an illustrative example (see Fig. 2), if the Sink sends an

interest which reaches nodes a and b, and both forward the

interest to node c, then node c sets up two vectors indicating

that the data matching that interest should be sent back to

a and/or b. The strength of such a gradient can be adapted,

which may result in a different amount of information being

redirected to each neighbor. To this end, various metrics such

as the node’s energy level, its communication capability and

its position within the network can be used. Each gradient is

related to the attribute it has been set up for. As the gradient

setup phase for a certain interest is complete, only a single

path for each source is reinforced and used to route packets

towards the sink (path reinforcement and forwarding).

Data aggregation is performed when data is forwarded to

the sink by means of proper methods, which can be selected

according to application requirements. The data gathering tree

(i.e., reinforced paths) must be periodically refreshed by the

sink and this can be expensive in case of dynamic topologies.

A tradeoff, depending on the network dynamics, is involved

between the frequency of the gradient setup (i.e., energy

expenditure) and the achieved performance. A valuable feature

of Directed Diffusion consists of the local interaction among

nodes in setting up gradients and reinforcing paths. This allows

for increased efﬁciency as there is no need to spread the

complete network topology to all nodes in the network.

We observe that attention is to be paid to MAC Layer

design. Consider as an example the IEEE802.11 wireless

technology. As said above, queries are propagated by means

of broadcasts (basic access in IEEE802.11). However, data is

sent back to the sink via unicast transmissions. This means

that when either the node density increases or the duplicate

suppression rule is not used, due to MAC collisions and

subsequent backoffs, the delay may become excessively large.

Hence, the local trafﬁc should be kept at an acceptably low

level in order to avoid collisions. Several approaches [36], [48],

[49] have been proposed to reduce the control trafﬁc generated

by the local interactions among nodes with Directed Diffusion.

In these solutions, the authors use properly deﬁned aggregation

trees with the main purpose of reducing both trafﬁc and delay.

In [48] a modiﬁed version of Directed Diffusion, Enhanced

Directed Diffusion (EDD), is proposed. This protocol jointly

exploits Directed Diffusion to collect data and a cluster-based

architecture to increase the efﬁciency of the local interactions

(decreasing local trafﬁc and related collisions). A similar

approach is investigated in [50].

PEGASIS [3] - The key idea in Power-Efﬁcient GAthering

in Sensor Information Systems (PEGASIS) is to organize the

sensor nodes in a chain. Moreover, nodes take turns to act as

the chain leader, where at every instant the chain leader is the

only node allowed to transmit data directly to the sink. In this

way, it is possible to evenly distribute the energy expenditure

among the nodes in the network. The chain can be built either

in a centralized (by the sink) or distributed manner (by using

a greedy algorithm at each node). In both cases, however, the

construction of the chain requires global knowledge of the

network at all nodes. The chain building process starts with the

node furthest from the sink. Then the closest neighbor to this

node is chosen as the next one in the chain and so on. Nodes

take turns to act as leader according to the following rule:

node i is elected as the leader in round i. If there are N nodes

in the network, rounds cyclically take values in {1, 2, . . . , N}

according to a TDMA schedule. As a consequence, the leader

is not always the same but, during each transmission round, it

is at a different position in the chain. Note that in this scheme

a direct communication channel from each sensor to the sink

is required.

In PEGASIS, each node receives data from a neighbor and

aggregates it with its own reading by generating a single

packet of the same length. Subsequently, such an aggregate

is transmitted to the next node in the chain until the packet

reaches the current chain leader. At this point the leader

includes its own data into the packet and sends it to the sink.

A possible drawback of the scheme comes from the distance

among neighbors. In fact, when the neighbors along the chain

are too distant the energy expenditure can be very high.

In addition, transmission energies are not evenly distributed

but depend on the actual distances between the nodes and

their neighbors, i.e., nodes with distant neighbors dissipate

more energy. PEGASIS can therefore be enhanced by not

allowing such nodes to become leaders, for example using a

threshold-based leader election policy. The main disadvantages

of PEGASIS are the necessity of having a complete view of the

network topology at each node for a proper chain construction

and that all nodes must be able to transmit directly to the sink.

This makes the scheme unsuitable for those networks with a

time varying topology. In addition, also link failures and packet

losses may affect the performance of this protocol. In fact, the

failure of any intermediate node compromises the delivery of

all data aggregated and sent by the previous nodes in the chain.

Hence, some improvements to the scheme may be needed in

order to increase its robustness.

DB-MAC [7] - A different approach to route packets by

performing data aggregation is presented in [7], where the

routing and the MAC protocols are jointly designed. The

primary objective of the Delay Bounded Medium Access

Control (DB-MAC) [7] scheme is to minimize the latency

for delay bounded applications while taking advantage of

data aggregation mechanisms for increased energy efﬁciency.

DB-MAC adopts a CSMA/CA contention scheme based on

an RTS/CTS/DATA/ACK handshake. The protocol is most

suitable for those cases where different sources sense an event

almost at the same time and, due to the delay constraints, have

to send their measurements right away to the sink. In such

cases, the generated data ﬂows can be dynamically aggregated

while routing them towards the sink. This gives rise to an ag-

gregation tree, which is built on the ﬂy and without having any

knowledge about the network topology. The MAC protocol is

very similar to the IEEE802.11 RTS/CTS Access [51] with

some minor modiﬁcations: RTS/CTS messages are exploited

to perform data aggregation and backoff intervals are com-

puted by taking into account the priorities assigned to different

transmissions. In particular, each node takes advantage of the

transmissions from other nodes by overhearing CTSs in order

to facilitate data aggregation. This leads to choosing the relay

node among those nodes that already have some packets to

transmit in their queue. This is implemented to promote data

aggregation with the information stored along the path.

As an example, refer to the scenario in Fig. 3. We have

two nodes S

and S

which want to transmit their packets

to the sink using one of their neighbors (R

and R

in the

ﬁgure) as the relay. At the beginning of the contention, a node

transmits a newly generated packet by setting its priority to the

maximum value. The packet priority is subsequently decreased

at each traversed node. Because P

) > P

), S

wins the

contention for the medium and sends its packet to R

which

decreases its priority by one unit. After this, P

) becomes

equal to the priority of the packet just transmitted, which is

now stored at node R

. If S

is placed in the coverage area

of both S

and R

, it can overhear all messages exchanged

between these two nodes (remember that the packet at S

still

has to be forwarded). If this is the case, S

may now want to

send its packet to R

instead of R

as it knows that R

already

has one packet in its queue (the packet previously transmitted

by S

). This facilitates in-network aggregation. DB-MAC

gives an example of how routing and data aggregation may

inﬂuence each other, and shows that, in most cases, energy

efﬁcient solutions are achieved only through a cross-layer

design. The advantage of this strategy is the ﬂexible and

distributed procedure for the construction of aggregation trees,

which appears to be suitable for wireless networks with a

dynamic topology.

Further Algorithms - Regarding the tree-based approaches,

many additional solutions have been proposed to solve the

problem of efﬁciently constructing aggregation trees. The

authors in [36] deﬁne an efﬁcient, distributed and energy

aware heuristics (EADAT) to build the aggregation tree. A nice

feature of such an approach is that the tree construction process

only relies on a local knowledge of the network topology.

Hence, the costs incurred in updating the tree in response to

node mobility, device failures and duty cycles may be limited.

In addition, to further increase the energy savings, the scheme

in [36] uses an aggregation tree rooted at the sink where all

non-leaf sensors perform data aggregation while leaf nodes

can turn off their radios in order to save energy. In [52],

the problem of constructing the optimal aggregation tree is

treated from a game theoretic perspective. The authors develop

a framework including payoff functions that take into account

剩余25页未读，继续阅读

nieshen

粉丝: 1
资源: 11

无线传感器网络的网内数据融合技术综述

mongoose-cast-aggregation插件：强制转换聚合管道以优化性能

实现连续事件段聚合的simple-segment-aggregation.js库

Python库 'hestia-earth-aggregation-0.6.1' 详细介绍与安装

Correlation-Model-Based Data Aggregation in Wireless Sensor Networks

Trust-based Secure Aggregation in Wireless Sensor Networks

wireless sensor networks-data aggregation

privacy preserving robust data aggregation in wireless sensor networks

An application-specific protocol architecture for wireless microsensor networks

Pyramid Spatial-Temporal Aggregation for Video-based Person Re-Identification(2022-05-10-21-08-15).marginpkg

Self-optimizing adaptive transmission mode selection for LTE-WLAN aggregation

最新资源