涉密信息搜索系统研究与实现:PDF和Office文档安全检测技术议.doc

版权申诉
0 下载量 153 浏览量 更新于2024-02-23 收藏 4.4MB DOC 举报
ity in personal computers has become larger and larger. At the same time, with the popularization of the Internet, people need to deal with more and more information, so they will store more files on personal computers. Especially in government and related confidential departments, there is a great security risk in the sensitive files on the host. Therefore, this paper designs a confidential information search system for PDF and Office documents, which conducts confidential keyword retrieval for PDF files and the Office series document, detects files that may carry secret confidential information, timely discovers leakage vulnerabilities, and maintains the security of the country and relevant government departments. Through sixteen weeks of study and research, I have implemented a confidential information search system for PDF and Office documents, and this system can efficiently conduct keyword searches for PDF and Office Word, Excel, and PPT documents and efficiently complete detection tasks.【Keywords】format parsing, content extraction, pattern matching algorithm.