深度学习模型压缩：约束优化策略与多类型融合

版权申诉

PDF格式 | 6.1MB | 更新于2024-07-06 | 115 浏览量 | 举报

本文档探讨了模型压缩作为一种约束优化方法在神经网络领域的应用，特别是在第五部分中关注于模型的综合压缩策略。标题"模型压缩作为约束优化，并应用于神经网络。第五部分：合并压缩"表明研究者Miguel A. Carreira-Perpiñán和Yerlan Idelbayev专注于深度学习模型的效率提升，通过将不同的压缩技术结合使用，以期实现更好的性能。模型压缩是近年来深度学习领域的一个关键课题，主要涉及的技术包括量化、低秩近似和剪枝。问题的核心在于，如何确定哪种类型的压缩方法最适合特定的神经网络模型，或者能否通过巧妙地组合这些压缩技术来进一步提高性能。作者将其视为一个优化问题，目标是在保持模型性能的同时，使权重参数等于各个独立压缩部分的加权和。这个优化过程不仅考虑了单个压缩技术的效果，还试图找到它们的最佳组合。为了实现这一点，论文提出了一种算法，该算法能够学习各个压缩部分的参数，使得整体模型在损失函数上的表现最优。实验结果展示了在深度神经网络中，通过这种方法，可以在误差与压缩程度的权衡下，发现显著优于单一压缩技术的模型。这表明，不同的压缩类型之间存在互补的优势，通过结合使用，可以显著提高模型的压缩效率和性能。总结来说，这篇论文的主要贡献在于提出了一个理论框架和实际算法，用于有效地整合各种模型压缩技术，以达到在深度学习模型中实现更高效、更优化的压缩效果。这对于减少模型大小、提升计算效率以及推动人工智能技术的实际应用具有重要意义。

Algorithm 1 Pseudocode (quadratic-penalty version)

input training data, neural net architecture with weight s w

w ← arg min

L(w) reference net

, θ

← arg min

,θ

kw − ∆

(θ

) − ∆

(θ

init

for

µ = µ

< µ

< · · · < ∞

w ← arg min

L(w) +

kw − ∆

(θ

) − ∆

(θ

L step

while

alternation does not converge

← arg min

k(w − ∆

(θ

)) − ∆

(θ







C step

← arg min

k(w − ∆

(θ

)) − ∆

(θ

if kw − ∆

(θ

) − ∆

(θ

)k is small enough then exit the loop

return

w, θ

, θ

problems involve discrete and continuo us variables and can be NP-hard (such a s quantization with an

adaptive codebook). Converge nc e can be established quite generally for convex functions [

4, 42]. For

nonconvex functions, convergence res ults are complex and mor e restrictive [38]. One simple case where

convergence occurs is if the objective in (3) (i.e., each ∆

) is continuously diﬀerentiable and it has a unique

minimizer over each θ

[

5, Proposition 2.7.1]. However, in certain cases the optimization ca n be solved

exactly without any alternation. We give a speciﬁc result next.

4.1 Exactly solvable C step

Solution of the C s tep (eq.

3) does not need to be an alternating optimization. Below we give an e xact

algorithm for the additive combination of ﬁxed codebook quantization (e.g., {−1, +1}, {−1, 0, +1}, etc.)

and sparse corrections .

Theorem 4.1 (Exac tly solvable C step for combination of ﬁxed codebook quantization + s parse correc tions).

Given a ﬁxed codebook C consider compression of the weights w

with an additive combinations of quantized

values q

∈ C and sparse corrections s

min

q,s

− (q

+ s

))

s.t. ksk

≤ κ, (5)

Then the following provides one optimal solution (q

∗

, s

∗

): ﬁrst s et q

∗

= closes t(w

) in codebook for each i,

then solve for s: min

− q

∗

− s

))

s.t. ksk

≤ κ.

Proof. Imag ine we know the optimal set of nonzeros o f the vector s, which we denote as N . Then, for the

elements not in N , the optimal s olution is s

∗

= 0 and q

∗

= closest (w

). For the elements in N , we can ﬁnd

their optimal solution by solving independently for each i:

min

− (q

+ s

))

s.t. q

∈ C.

The solution is s

∗

= w

− q

for arbitrary chosen q

∈ C. Using this, we can rew rite the e q.

5 as

i/∈N

− q

∗

)

This is minimized by taking as set N the κ largest in magnitude elements of w

− q

∗

(indexed over i).

Hence, the ﬁnal solution is: 1) Set the elements of N to be the κ largest in magnitude e lements of w

− q

∗

(there may be multiple such sets, any one is valid). 2) For each i in N : set s

∗

= w

− q

∗

, and q

∗

= a ny

element in C. For each i not in N: set s

∗

= 0, q

∗

= closest(w

) (there may be 2 closest values, any one is

valid). This c ontains multiple solutions. One particular one is as given in the theorem statement, where we

set q

∗

= closest (w

) for every i, which is practically more desirable because it leads to a smaller ℓ

-norm of

5 Experiments on CIFAR10

We evaluate the eﬀectiveness of additively combining compressio ns on deep nets of diﬀerent sizes on the

CIFAR10 (VGG16 a nd ResNets). We systematically study each combination of two o r three compressio ns

剩余28页未读，继续阅读

易小侠

粉丝: 6646

深度学习模型压缩：约束优化策略与多类型融合

BP_imagecompression_BP_图像压缩_BP神经网络_

model_compression:PyTorch模型压缩

ml.rar_DCT compression_DCT image_dct_image compression_压缩算法

image-compress.rar_BP 图像压缩_bp compression_image compression_neur

LZW.rar_LZW Compression Sour_delphi compression_lzw compression

RLE.rar_RLE_compression rle_rle compression_rle压缩

model_compression, 基于知识提取方法的模型压缩实现.zip

imagecompression.rar_imagecompression_matlab 图像压缩_压缩图像_图像 压缩_图像压

svd.rar_SVD_image compression_svd compression

fft.rar_audio compression_doc_fft_fft compression

最新资源

imagecompression.rar_imagecompression_matlab 图像压缩_压缩图像_图像压缩_图像压