云中高效隐私信息检索：支持关键词且降低成本

4 浏览量更新于2024-08-26 收藏 189KB PDF 举报

在云计算环境中，仅仅保护用户查询内容免受数据库服务器的窥探是远远不够的。隐私问题不仅涉及查询内容本身，还包括用户的访问模式。这些模式可以通过细致观察被泄露，因此，确保服务器对查询一无所知，包括访问模式，是至关重要的。这要求在提供服务的同时，确保不会带来过高的计算或通信成本。然而，现有的解决方案往往因为其实际通信和计算开销而效率低下，而且许多方案不支持关键词搜索，这限制了其实用性。本文的主要挑战是如何在保证用户隐私的同时，实现经济高效的私人信息检索（Private Information Retrieval，简称PIR）技术。PIR允许用户从大量数据中检索特定信息，而无需向服务器透露所查询的具体项。传统方法可能涉及到全量数据的传输，这显然不切实际。为解决这一问题，作者提出了一种名为KSPIR的新机制，它引入了定价策略来平衡隐私保护和成本效益。 KSPIR的核心思想在于设计一个机制，通过定价策略调整服务器的工作量，使其在处理查询时只处理必要的数据部分，从而降低通信量。这种设计旨在最小化通信成本，同时考虑到用户的查询需求，尤其是对于支持关键字搜索的需求。通过巧妙地利用分布式计算和加密技术，KSPIR能够在保护用户隐私的同时，提供一种既高效又能支持复杂搜索操作的解决方案。该论文深入探讨了KSPIR机制的具体实现细节，包括如何设计有效的密钥分配、数据编码和查询构造，以及如何根据查询频率动态调整价格，以鼓励服务器在合理成本范围内执行任务。此外，文章还分析了KSPIR在不同应用场景下的性能，并与其他现有PIR方法进行了比较，突显出其在实际云计算环境中的优势。这篇研究论文为云计算中的隐私保护提供了一个新的视角，通过引入定价机制，KSPIR在保证用户隐私的同时，实现了在大规模数据库中进行关键字搜索的实用性和经济性，这对于云计算服务的提供商和用户来说都是一个重要的进展。

Practical Private Information Retrieval Supporting

Keyword Search in the Cloud

Mengke Yu

∗

, Kaichen Yang

∗

, Lingbo Wei

†

and Jinyuan Sun

‡

∗

Key Laboratory of Electromagnetic Space Information

University of Science and Technology of China

Email: yumk@mail.ustc.edu.cn, ykcdxt@mail.ustc.edu.cn

†

Shanghai Jiao Tong University, Email: weilib@hotmail.com

‡

University of Tennessee, Email:jysun@utk.edu

Abstract—In cloud computing environment, just protecting

the contents of the queries from users to a large database server

is far away from enough. Because it does not protect the leak of

access patterns from careful observations. It is thus important to

make sure the server learning nothing about the queries including

access patterns. However, this implies an expensive computation

or communication cost of all the data on the server. Existing

solutions are not efﬁcient due to their impractical communication

and computation cost. Besides, most of them do not support

keyword search. In this paper, we introduce the mechanism

of pricing to solve the problem of impractical cost. Using our

scheme called KSPIR, we achieve the minimum communication

and computation cost according to the ﬂexible privacy and budget

speciﬁed by users. It is indeed a kind of tradeoff between the

cost of retrieval and the degree of privacy. It is worth noting

that it also supports keyword search. It allows users to retrieve

the data items containing the keywords they are interested in.

The experimental results conﬁrm the correctness and efﬁciency

of KSPIR.

I. INTRODUCTION

With the rapid development of cloud computing, out-

sourced storage architectures become more prevalent than ever.

Data owners do not have to purchase expensive hardware for

storing or managing a large amount of data locally. More and

more data owners, therefore, are willing to upload their data to

cloud servers. Users who are interested in the data can retrieve

them anywhere from terminals with great ﬂexibility.

But it also brings a threat to users’ privacy. Cloud servers

are untrusted and vulnerable to malicious attacks. Encryption

protects data privacy and prevents unauthorized accesses, but it

is far away from enough because it does not protect the leakage

of access patterns from careful observations. Trafﬁc analysis

techniques can reveal sensitive information against privacy

[1]. For example, when users are in personalized search and

recommendation services through service providers, such as

Google, access history could disclose users’ retrieval habits or

identities; the frequencies of requests can reveal the popularity

of data; accesses to the same data may indicate the relationship

between multiple users.

This work was supported in part by the Natural Science Foundation of

China (NSFC) under Grants 61202140 and 61328208, by the Program for

New Century Excellent Talents in University under Grant NCET-13-0548, by

the Innovation Foundation of the Chinese Academy of Sciences under Grand

CXJJ-14-S132, and by the Youth Innovation Promotion Association of the

Chinese Academy of Sciences.

Fortunately, Private Information Retrieval (PIR) is a tech-

nique allowing users’ retrieving, without the server learning

which data items have been retrieved. It not only protects the

contents of queries, but also protects users’ access patterns.

In the formal PIR setting, the database is modeled as a n-

bit string DB = m

, ..., m

. The user obtains a particular

bit m

, and the server does not learn i. It is the foundation

of many privacy-sensitive applications, including anonymous

email, patent databases, domain name registration, and anony-

mous communication networks .

Obviously, a trivial but information-theoretically secure

way for users is to download the entire database and make

requests after decrypting the whole database. After that, they

then encrypt and upload them to the server again. However,

the communication cost is too huge to be practical. There-

fore, researchers came up with a large quantity of non-trivial

solutions. The number of bits transferred between users and

the server has to be smaller than the size of the whole

database. Unfortunately, the deployment of these protocols

on real hardware is mostly orders of magnitude slower than

trivially transferring of the entire database [2].

According to the formal PIR setting, most existing PIR

solutions only allow the retrieval by the address i, but i is not

available to users in cloud computing environment. To tackle

this challenge, extensions of the basic PIR [3][4][5] supporting

keyword search are proposed. Our paper is inspired by Michael

et al. [3], because it is among the best to design the scheme

which can also support keyword search and protect access

patterns. However, the bad news is that the hard problem of

huge cost in communication and computation also exists here.

This is exactly the problem we are going to solve.

As we all know, applying PIR to the database consisting of

n data items can make sure that the server cannot recognize

retrieving requests among n items. But users do not always

have such strict requirements on privacy. Sometimes, users

may only expect their retrieving requests to be hidden among

k data items (k << n). A relate concept is k-Anonymity

in which the probability to recognize an individual correctly

is at most 1/k. We introduce the mechanism of pricing here

and propose a scheme called KSPIR that achieves the tradeoff

between the cost of retrieval and the degree of privacy. That

is to say, users can sacriﬁce their privacy to exchange for less

cost or achieve higher level of privacy at the expense of higher

cost. More speciﬁcally, users can design the level of privacy

they need and the money they can afford at most. Then the

2014 Sixth International Conference on Wireless Communications and Signal Processing (WCSP)

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38691194

粉丝: 5

云中高效隐私信息检索：支持关键词且降低成本

基于属性的数据检索和语义关键字搜索，用于电子医疗云

云计算中加密数据的模糊关键字搜索方法.pdf

通过云中的加密数据进行高效，富有表现力的关键字搜索

TMDS：支持云存储中关键字搜索的薄型数据共享方案

terraform-x2go-firefox:云中的私人浏览

加密云数据的两步式安全多关键字搜索

带有关键字搜索的量子后安全公钥广播加密

具有对云计算中的加密数据的访问控制的关键字搜索

云计算中多个数据所有者的安全联合多关键字搜索

信息检索文档

最新资源