压缩屏幕内容图像超分辨率研究

需积分: 5 122 浏览量更新于2024-06-19 收藏 5.94MB PDF 举报

本文档《Compressed Screen Content Image Super Resolution》聚焦于当今高度互联世界中的关键领域——屏幕内容图像超分辨率处理。随着远程协作和通信（如虚拟会议与在线教育）的普及，屏幕上产生的数据量急剧增长，对屏幕内容的压缩技术的需求也随之增强，它是存储、传输和展示这些内容的基础工具。作者们，来自香港城市大学计算机科学系的 Mengwang 等人，以及来自字节跳动和抖音集团的 Jizheng Xu、Lizhang 和 Junruli，以及北京大学数字媒体研究所的 Kaizhang、Shiqi Wang 和 Siwei Ma，共同探讨了在压缩屏幕内容图像面临不同压缩失真级别的情况下，如何解决实际世界中的超分辨率问题。他们提出了一项专门针对此类挑战的新方法，即设计了一个压缩屏幕内容图像超分辨率的专用数据集，旨在提升在压缩过程中丢失细节的图像质量。该研究的核心在于探索如何通过多假设（multi-hypothesis）和高效的算法来恢复压缩图像的原始清晰度。这可能包括利用深度学习技术，比如卷积神经网络（CNN），来分析和解码压缩过程中的信息，同时处理各种类型的压缩失真，如JPEG压缩导致的块效应或量化噪声。通过训练模型以适应不同级别的压缩，研究人员试图最小化重建图像与原始图像之间的差距，以实现更真实的超分辨率效果。此外，文中可能还涵盖了压缩图像超分辨率技术的关键指标，如峰值信噪比（PSNR）、结构相似性指数（SSIM）或视觉信息量（VIF），用于评估算法性能。研究者可能还讨论了如何优化压缩率与图像质量之间的权衡，以及如何将这种方法应用到实际场景中，例如提高视频会议的清晰度或在线课程的视觉体验。这篇论文是IT领域的一个重要贡献，它不仅解决了压缩屏幕内容图像的超分辨率问题，还可能为其他涉及实时或大流量图像处理的应用提供新的解决方案和技术思路。对于从事图像处理、视频通信或者多媒体技术研究的人员来说，理解和应用其中的方法具有很高的价值。

209:4 M. Wang et al.

new standards to better support diverse application scenarios such as city security, online learning,

and cloud gaming.

Screen content coding tools are specically designed to facilitate the coding of computer-

generated content, orienting to the application of screen sharing, animation, gaming, and a mix-

ture of content. The new generation of video coding standards species the screen content coding

technologies as low-level coding tools, catering to the market regarding the increased capacity

of the non-camera captured content. In VVC, ve screen content coding tools are designed [30],

including the Transform Skip with Residual Coding (TSRC) [29], Block-based Dierential

Pulse-Coded Modulation (BDPCM) [2], Intra Block Copy (IBC) [50], Adaptive Color Trans-

form (ACT) [60] and the palette mode [36]. The IBC, palette mode, and ACT are inherited from

the predecessor HEVC screen content coding extensions. The VVC is anticipated to become the

main solution for screen content and mixture content coding, owing to the considerable coding

performance improvement over the previous standard.

2.2 Image Super Resolution and Restoration

Numerous eorts have been dedicated to the restoration of the high-quality images where the low-

quality to high-quality mapping relationship could be eectively represented through deep neural

networks. To be more specic, with deep neural networks, low-level features are extracted and

formulated from the low-quality input LR image, approaching the HR by feature accumulation and

reorganization. Learning-based SR schemes have successfully surpassed the traditional SR schemes

from the perspective of the quantitative and qualitative evaluations, as the intrinsic connections

between the LR and HR could be well understood by the neural network.

Beyond the classical SRCNN, Kim et al. [16] proposed to progressively increase the network

depth, with the goal of exploring the restoration capability of the large-scale model. The net-

work design exhibits to be eective in enhancing the performance of the learning-based SR, such

that bundles of works have been proposed, including the residual networks, densely connected

networks, attention-based mechanism, recursive learning, and transformer networks. Enhanced

Deep Super Resolution (EDSR) network was investigated by Lim et al. [25], wherein the struc-

ture of the residual net [15] was modied to better adapt to the low-level recovery task. In [61], the

attention mechanism is involved in the SR network wherein the feature channels are adaptively re-

scaled according to the attention allocation. Residual dense network [63] was proposed for image

SR which exploited the hierarchical features from the LR with densely connected convolutional

layers. Yang et al. [57] investigated a deep edge guided recurrent residual network to progres-

sively compensate the high-frequency information, which could well handle JPEG artifacts during

high resolution recovery. Anwar et al. [3]proposedtheDensely Residual Laplacian Network

(DRLN), which exploits features at multiple scales with cascading residual structure, densely con-

nected structure, and Laplacian attention model. In [20], symmetrical residual connection struc-

tures are explored, which employ symmetrical nested residual connections with multiple paths,

leading to the enhancement of the restoration performance and the increase of computing speed.

Kernel attention network [58] is investigated for single image super resolution wherein the net-

work could adjust the receptive eld size by changing the input scales and kernel selection. In this

way, the network can learn the distinguished features through multi-scale perceiving. Recently,

Ma et al. [27] propose a meta-learning-based fusion network, which is capable of generating an

HR image by fusing deterministic and stochastic images, resulting the improvement of the per-

ceptual quality. A sequential hierarchical SR network is studied which considers the correlations

among features with dierent scales [26]. Meanwhile, image restoration, especially in terms of

resolving the compression distorted images, has attracted much attention. The restoration could

be implemented as out-loop lters for the reconstructed images, aiming at eliminating the coding

ACM Trans. Multimedia Comput. Commun. Appl., Vol. 19, No. 6, Article 209. Publication date: July 2023.

剩余19页未读，继续阅读

qq_45493351

粉丝: 0
资源: 2

压缩屏幕内容图像超分辨率研究

# 加载原始图像和压缩后的图像 original_image = Image.open('0.jpg') compressed_image = Image.open('1.jpg')

Image and Video Processing in the Compressed Domain.pdf

Xampling-Compressed Sensing.pdf

Docker从入门到实践.compressed.pdf

Lecture6-compressed.pdf

video_20201227_160109 - Compressed with FlexClip (1).mp4

【船级社】 NK Guidelines for Compressed Natural Gas Carriers.pdf

Compressed Sensing of EEG using BSBL.rar_BLOCK SPARSE_BSBL_Bayes

Single Image Super Resolution Based on Multi-scale Self-Similarity Structure in The CS Frame

最新资源