云环境下的加密字符串最长公共子序列计算方法

需积分: 5 29 浏览量更新于2024-08-25 收藏 155KB PDF 举报

"这篇研究论文探讨了在加密字符串环境下计算和检索最长公共子序列的问题。作者包括 Minghao Zhao、Zhen Li、Yilei Wang 和 Qiuliang Xu，分别来自山东大学、山东财经大学、鲁东大学以及山东省软件工程重点实验室。论文提出了解决这一问题的新方法，考虑了云计算和外包计算对数据安全与隐私保护的需求。" 正文: 最长公共子序列(Longest Common Subsequence, LCS)是算法领域的一个基础问题，主要应用于信息处理和生物信息学等多个领域。LCS寻找两个或多个序列中的最长子序列，该子序列不必连续但必须保持原始顺序。这个问题是NP难的，通常采用动态规划方法求解，虽然速度相对较快，但需要较大的内存空间。随着云计算和外包计算的发展，处理大量数据的难题得到了缓解。用户可以将数据上传到云端，由云服务器进行处理。然而，数据安全和隐私保护成为关注焦点，用户希望在上传前对数据进行加密，同时允许云服务提供商在不解密的情况下对数据进行有效操作。论文针对这一挑战，提出了在加密字符串上进行LCS计算和检索的新策略。这种方法旨在保留LCS算法的效率，同时确保数据在传输和处理过程中的安全性。通过特定的加密技术和安全协议，可能实现了在加密数据上进行动态规划或其他算法的近似计算，从而降低了内存需求，并且保持计算的正确性。此外，论文可能还讨论了如何设计适应这种加密环境的索引结构，以便快速检索最长公共子序列。这可能涉及到高效的数据结构，如平衡查找树或哈希表，它们能够在加密状态下来支持必要的查找和比较操作。论文的贡献可能包括以下几点： 1. 设计了一种新的加密算法，使得可以在不暴露原始数据的情况下计算LCS。 2. 提出了一种内存效率更高的动态规划变体，适合处理加密数据。 3. 构建了适应加密环境的索引结构，提高了检索效率。 4. 对所提方法进行了安全性分析和性能评估，证明了其在实际应用中的可行性和优势。这篇研究论文为云计算环境下的数据安全处理提供了一种创新解决方案，特别是对于那些需要处理敏感信息的应用，如医疗记录或金融交易，具有重要的理论和实践价值。

展开

Longest Common Sub-sequence Computation and

Retrieve for Encrypted Character Strings

Minghao Zhao

, Zhen Li

1,2

, Yilei Wang

3,4

, Qiuliang Xu

School of Computer Science and TechnologyShandong University, Jinan, China

School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China

School of Information and Electric Engineering, Ludong University, Yantai, Shandong

Shandong Provincial Key Laboratory of Software Engineering, Jinan, China

Email: zhaominghao@hrbeu.edu.cn; xql@sdu.edu.cn

Abstract—Longest Common Sub-sequence is a basic

algorithm problem. It serves as a basic component for a variety

of applications in information processing and bioinformatics. It is

a NP-hard problem and often manipulated using dynamic

programming, which is relatively fast but involves large memory

space. Fortunately, cloud computing and outsourced computing

provides a practical method for overload alleviation. However,

for the security and privacy concern, clients hope to encrypt their

data before upload them to the cloud, meanwhile maintain the

ability for the cloud to process on the data. In this paper, we

propose a method to computing Longest Common Sub-sequence

using somewhat homomorphic encryption. Beyond that, we show

how to use our achievement into searchable encryption to achieve

rich expressiveness.

Keywords—homomorphic encryption, information retrieve,

longest Common Sub-sequence, searchable encryption

I. INTRODUCTION

With the proliferation of newly emerging technology in

multimedia, software and storage, we have stepped into the era

of big data, and data have become a torrent flowing into every

area of the global economy [1]. However, generally big data

processing involves tremendous overhead for storage and

computing (e.g. processor), which is a huge burden for

individuals and small or medium-sized enterprises. Fortunately,

cloud computing and outsourced computing provides a feasible

method for burden release for the client, and gets their

popularity nowadays. Having considered the fact that the

service provider is generally untrustworthy, individuals begin

to pay increasingly attention on data security and privacy. It is

desirable to provide a method that enables the cloud service

provider to process on the data, meanwhile prevent him to

acquire sensitive information about the data.

Generally, Full-homomorphic encryption (FHE) [2] and

secure multi-party computation (SMPC) [3-4] are generic

cryptographic achieving this target. Specifically, Full-

homomorphic encryption is an encryption scheme that enables

the cloud to perform any computation on the ciphertext, and

multi-party computation is a cryptographic protocol that

enables distributed parties to jointly compute functionality

without revealing each party’s input and output. However,

although after many years research, the low efficiency of FHE

and SMPC still prevent them for real application. Thus,

researchers are engaged in designing specific and appropriative

method for certain problems without using full-homomorphic

encryption and secure multiparty computation.

Longest Common Sub-sequence is a basic algorithm

problem which captures a wide range of application in

computational biology, computational linguistics and other

utilization of string manipulation. In this paper, we propose a

method to compute longest common sub-sequence use

somewhat homomorphic encryption. In addition, we will show

how our method can be used in searchable encryption to

achieve richer expensiveness.

II. PRELIMINERARISE

A. String Manipulation for Rich Expensiveness in Searchable

Encryption.

Searchable encryption is a cryptographic primitive that

enables a client to outsource his encrypted document to the

cloud, meanwhile maintains the ability to perform keyword

based search and retrieval the document. The security ensures

that only minimal information is leaked to the cloud. After

firstly proposed by song et al. [5], many scholars conduced to

researches on rich expensiveness, such as fuzzy search [6],

ranked search [7] and Boolean query [8].

In traditional searchable encryption, keyword is regarded as

the basic search unit, which indicates that, the search term is

restricted to a certain keyword (or logic combination of

multiple keywords). As they treat the keyword as an entire

component rather than stings, theme scheme do not support

wildcard search, sub-string search and regular expression query.

Generally, using keywords as search criteria is satisfiable

for a wide range of languages, especially for isolating language

and fusional language. But in terms of agglutinative language

(e.g. Finnish and German), in which a word is composed of

many morpheme. We seldom search for such a long word, but

instead of that, we tend to search for the morphemes. Existing

natural language parsing tools can be used for morpheme

division, but it is difficult to be absolutely accurate. Thus a

simple and method is to support any query text string, instead

of the whole word. Especially in CJK language (i.e. Chinese,

Japanese and Korean), the text is a sequence on a large

alphabet, and each symbol has an independent meaning. We

2016 19th International Conference on Network-Based Information Systems

DOI 10.1109/NBiS.2016.82

496

2016 19th International Conference on Network-Based Information Systems

DOI 10.1109/NBiS.2016.82

496

下载后可阅读完整内容，剩余3页未读，立即下载

身份认证购VIP最低享 7 折!

30元优惠券

weixin_38690275

粉丝: 7

云环境下的加密字符串最长公共子序列计算方法

ACM小组内部预定函数.pdf

计算机常用算法

【计算机】十大经典算法

公共子序列问题的算法分析

CSP-J字符串处理精要：高效解决字符串相关问题（字符串处理高手）

【CSP-S提高组字符串处理艺术】：字符串处理的高级技巧与方法

【解锁计算光刻神秘代码】：字符串指令的全面应用与优化

C语言字符串搜索：掌握技巧，绕过常见错误陷阱

【字符串相似度比较：Java实现回文检测与编辑距离】

【字符串操作进阶】：如何双倍提升代码效率与性能

最新资源