手机轨迹分类中的Cell-ID相似度测量

研究论文

10 浏览量更新于2024-08-27 收藏 1.55MB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇研究论文探讨了如何利用手机的cell-id轨迹来衡量并分类移动电话用户的路径相似性，以此为基础进行路线分类。论文指出，基于物理位置的轨迹分类方法存在额外成本，如使用GPS导致的能耗增加。相比之下，通过识别连接的手机信号塔（cell-tower）的id（cell-id），可以无需额外硬件或网络服务就获取轨迹信息。" 本文重点讨论了cell-id轨迹相似度测量在移动电话路线分类中的应用。Cell-id轨迹是通过记录用户手机连接的基站ID来形成的，这种方法无需依赖精确的GPS坐标，因此降低了数据获取的成本，特别是在考虑到移动设备的能源效率时更为实用。作者 Mingqi Liu、Ling Chen、Yanbin Shen 和 Gencai Chen 提出，尽管cell-id轨迹不如物理位置轨迹精确，但它们能提供足够的信息来识别用户的常规行动模式和路线。文章强调了相似度测量的重要性，因为它是对轨迹进行聚类和分类的关键步骤。在没有精确地理位置的情况下，找到一种有效的方法来比较和匹配cell-id轨迹成为了研究的核心。论文可能涉及了多种相似度计算方法，如欧氏距离、余弦相似度或者其他专为此目的设计的定制距离度量。通过对cell-id轨迹的相似性分析，可以将移动电话用户的路径进行分类，例如，识别出用户的通勤路线、常去的地点等。这在位置感知应用中具有广泛的应用价值，如智能交通管理、个性化推荐系统、用户行为分析以及公共安全等领域。此外，文章还可能探讨了处理cell-id轨迹数据的挑战，包括cell-tower覆盖范围的变化、用户的室内定位问题以及如何处理由于网络切换引起的轨迹片段化。这些问题的解决对于提高分类准确性和路线预测的可靠性至关重要。最后，论文可能包含了实验部分，展示了提出的cell-id轨迹相似度测量方法在实际数据集上的性能，并与其他已有的基于位置的方法进行了对比。这些实验结果为cell-id轨迹在实际应用中的有效性提供了实证支持。这篇研究论文贡献了一种新的、成本效益高的途径来理解和分类移动电话用户的路径行为，这对于理解大规模移动数据和开发相关服务具有深远意义。

资源详情

资源推荐

trajectory classiﬁcation methods which deﬁne a distance function

to measure the similarity between trajectories also require that

the physical locations are available [18,22]. The popular distance

functions include Euclidean Distance [23], Hausdorff Distance

[24], Fréchet Distance [25] and Dynamic Time Wrapping Distance

(DTW) [26], etc.

For clustering and classifying cell-id trajectory data with no

regard to physical locations, the most challenging problem is to

deﬁne a similarity measure that can effectively capture the rela-

tions between cell-id trajectories. Görnerup [27] quantiﬁed the

similarity between two cell-id trajectories based on common sub-

set measure (i.e. the ratio of the number of common cell-ids shared

by the two cell-id trajectories to the total number of unique

cell-ids in the two cell-id trajectories). However, the common sub-

set measure does not account for the sequential nature of the

cell-id trajectories, basically reduced to an unordered set of

cell-ids to be calculated. Bayir et al. [28] formed cell tower clusters

by examining the oscillation patterns (oscillation is the phe-

nomenon that a mobile phone switches back and forth constantly

between multiple cell towers), and converted a cell-id trajectory to

a sequence of clusters. Then, the cell-id trajectory similarity could

be calculated by using such as Longest Common Subsequence

(LCSS) [29]. However, cell towers are almost uniformly distributed

in physical space, so it is always difﬁcult to determine the border of

a cluster. Laasonen [30] and Yavas et al. [31] treated a cell-id tra-

jectory as a string, and deﬁned the similarity measure based on

string alignment algorithms [32]. However, the similarity between

cell towers themselves is not taken into account. Thus, two cell-id

trajectories which differ from each other with respect to multiple

similar cell towers (cell towers which are spatially close to each

other) would be considered as dissimilar ones.

3. Method

3.1. Preliminary

The basic data explored in this paper are cell-id trajectories

deﬁned as follow. Fig. 1 gives an overview of the system architec-

ture of our method. The core of our method is a cell-id trajectory

similarity measure that takes into account the cell tower similarity

(which is explored based on handoff patterns implied in the cell-id

trajectory data), the order of cell-ids in the cell-id trajectory and

the time duration of the connected cell towers. The trajectory clus-

tering algorithm uses the similarity measure to divide mobile

phone users’ cell-id trajectories into different groups to form route

patterns, and the route classiﬁcation algorithm uses the similarity

measure to match the current cell-id trajectory of the mobile

phone user to one of the route patterns. The route pattern here

can be viewed as a representative cell-id trajectory, which reﬂects

the repeated moving behaviors of the mobile phone user.

Deﬁnition 1. A cell-id trajectory is a sequence of cell-id logs

CTraj = hCL

,...,CL

i. Cell-id log CL

is a pair CL

=(ID, Duration),

where ID and Duration are respectively identiﬁer and time duration

of the connected cell towers.

3.2. Cell tower similarity measure

Since a cell-id trajectory is represented by a sequence of

cell-ids, it could be processed as a string [30,31]. However, it

ignores the potential similarity between cell towers. As shown in

Fig. 2, three cell-id trajectories (i.e. Traj

, Traj

and Traj

) move

among the area covered by multiple cell towers. These three trajec-

tories have almost totally different string representation: Traj

cor-

responds to ‘‘ACFJOSV’’, Traj

corresponds to ‘‘AGKPTW’’ and Traj

corresponds to ‘‘ABEIN’’. However, Traj

should have much higher

similarity with Traj

than with Traj

since they pass through the

area covered by cell towers that are spatially close to each other.

However, the physical locations of the cell towers are not avail-

able, and it is impossible to capture the spatial closeness between

cell towers by only observing their identiﬁers. Thus, we try to

explore the similarity between cell towers by analyzing their hand-

off patterns. Given a pair of cell towers, we deﬁne their handoff

pattern in a cell-id trajectory as the mutual switches between each

other. This idea is based on the fact that handoff usually occurs

from one cell tower to its neighboring cell towers in order to guar-

antee the continuation of active calls [33], so switch between a pair

of cell towers with higher spatial closeness often occurs more fre-

quently in a shorter interval.

Given a cell tower identiﬁer pair (ID

, ID

) and a set of cell-id

trajectories TS ={T

,...,T

}, we calculate the similarity score of

and ID

taking into account both the frequency and interval of

their mutual switches as Eq. (1). In the equation, TS

is a subset

of TS in which each cell-id trajectory contains both ID

and ID

, TS

is another subset of TS in which each cell-id trajectory contains

either ID

or ID

. SN

is the number of mutual switches between

and ID

in T

, interval

is the number of other cell-ids between

and ID

in the jth switch in T

. SC(ID

, ID

) represents the ﬁnal

similarity score of ID

and ID

. The inverse tangent function is

used to ensure that the similarity score falls in the range of

[0,1]. In order to verify the effectiveness of Eq. (1), we analyze

the correlation between cell tower similarities (calculated based

on Eq. (1) ) and cell tower distances (calculated based on cell

towers’ physical locations which could be queried from online

war-drivin g databases). As shown in Fig. 3, the similarity

generally increases as the distance decreases. It veriﬁes the idea

behind Eq. (1), i.e. to give larger similarity for cell tower pairs that

are spatially closer to each other.

SCðID

; ID

Þ¼

 tan

1

jTS

i¼1

j¼1

1þinter

jTS

jþjTS

ð1Þ

Example 1. We give an example to explain the cell tower

similarity calculation process. Given a cell tower pair (c

, c

) and

TS ={T

, T

}, where T

= hc

, c

i, T

= hc

, c

i, T

= hc

, c

i, TS

={T

}, TS

={T

} (the connected time

durations of the cell towers are ignored here for simpliﬁcation),

there are three switches between c

and c

in T

(i.e. SN

= 3): The

ﬁrst switch occurs from c

(at index = 1) to c

(at index = 2), the

second switch occurs from c

(at index = 2) to c

(at index = 3), and

the third switch occurs from c

(at index = 5) to c

(at index = 8).

We do not force the switches to occur at consecutive positions

since the mobile phone might constantly dither between several

cell towers before transferring to a proper one due to the

oscillation problem [28]. Thus, the similarity score of (c

, c

)

in T

is 1/(1 + 0) + 1/(1 + 0) + 1/(1 + 2) = 2.33, and SC(c

, c

tan

1

(2.33/(1 + 1))  (2/

) = 0.55.

3.3. Cell-id trajectory similarity measure

Since the cellular handoff patterns are proven to be stable over

the same route [18], two cell-id trajectories are similar if they have

long time duration to connect to similar cell towers in similar

orders. Based on this intuition, we propose an optimal alignment

algorithm to ﬁnd the optimal alignment of two cell-id trajectories,

and then the similarity between the two cell-id trajectories is cal-

culated based on the optimal alignment. The proposed optimal

alignment algorithm is based on the concept of DNA alignment

[34]. However, the difference is that we should take the following

facts into account.

M. Lv et al. / Knowledge-Based Systems 89 (2015) 181–191

183

剩余10页未读，继续阅读

weixin_38738783

粉丝: 5
资源: 903

手机轨迹分类中的Cell-ID相似度测量

cell_id.rar_Cell-ID_cell id_cell id范围_cell id_leatherzrp

在ASP.Net中通过cell-id和LAC获取位置信息

已通过图片管理辽PD6885黄牌秦皇岛九福物流有限公司叶红建13842929049金海粮油2023-05-01当天有效x0jhly2023-04-29 21:09以上代码为网页源码，帮我写一段python程序从以上代码中找出drivernam和checkTim并保存数据库中

cell-class-name传参数

eltable 的header-cell-style

header-cell-style用法

怎么用cell-class-name自定义样式

element ui cell-click

el-table @cell-mouseenter

el-table cell-click

element-plus的cell-class-name方法

header-cell-style的属性写成函数

VUE 封装一个van-cell-group组件可以修改内容并在其他页面引用

cell-mouse-enter

eltable中有:header-cell-class-name="headerBg"吗

怎么给uview中的u-cell-item 设置点击事件

基于大模型技术的算力产业监测服务平台设计

This_honeypot_supports_Telnet_and_SSH_two_protocol_FF-Pot.zip

吉他谱_What I've Done - Linkin Park.pdf

最新资源

已通过
图片管理
辽PD6885
黄牌
秦皇岛九福物流有限公司
叶红建
13842929049
金海粮油
2023-05-01
当天有效
x
0
jhly
2023-04-29 21:09
以上代码为网页源码，帮我写一段python程序从以上代码中找出drivernam和checkTim并保存数据库中