稀疏建模：从方程系统的稀疏解到信号与图像的建模

需积分: 10 118 浏览量更新于2024-07-18 1 收藏 5.38MB PDF 举报

"From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images" 这篇由Alfred M. Bruckstein, David L. Donoho和Michael Elad共同撰写的论文，发表在2009年《应用与工业数学学会评论》(SIAM Review)第51卷第1期，探讨了从稀疏方程组解法到信号和图像的稀疏建模的主题。文章深入研究了如何将稀疏解的概念应用于信号和图像处理领域，这是现代数据科学和计算数学中的一个重要课题。稀疏表示（Sparse Representation）是现代信号处理和机器学习的核心概念之一。它指的是用尽可能少的非零元素来表示复杂的信号或数据，这在高维数据的压缩、降噪和分类等方面具有显著优势。在本文中，作者可能讨论了如何利用稀疏性解决线性方程组的问题，如通过正交匹配追踪（Orthogonal Matching Pursuit, OMP）等算法找到最佳的稀疏解。 OMP是一种著名的稀疏恢复算法，用于寻找能够最好地解释观测数据的最小支撑集。在信号处理中，如果一个信号可以被表示为少数几个基向量的线性组合，那么OMP可以有效地找出这些基，并构建出信号的稀疏表示。这个方法在压缩感知（Compressive Sensing, CS）理论中有广泛应用，该理论指出，可以通过远小于信号维数的观测数据来重构信号，只要信号本身是稀疏的。信号和图像的稀疏建模则涉及将复杂的数据分解为基本元素的组合，如小波、原子或字典元素。这种建模方法对于图像去噪、压缩、分类以及特征提取等任务非常有效。例如，在图像处理中，使用稀疏模型可以将图像表示为少数几个基本图像块的组合，从而简化处理过程并提高处理效果。文章可能还涵盖了基于稀疏表示的其他算法和技术，如 basis pursuit (BP) 和 l1-最小化，它们都是为了找到最稀疏的解而设计的优化问题。此外，可能会讨论这些技术在实际应用中的挑战和限制，如噪声的影响、选择合适的字典以及计算复杂度等问题。这篇论文深入探讨了从理论到实践的稀疏表示方法，对理解和应用稀疏模型于信号和图像处理有着重要的参考价值。它不仅对数学家和计算机科学家，也对那些在数据科学、图像处理和通信等领域工作的专业人士有着深远的影响。

SPARSE MODELING OF SIGNALS AND IMAGES 41

mention work in quantum information theory, constructing error-correcting codes us

ing a collection of orthogonal bases with minimal coherence, obtaining similar bounds

on the mutual coherence for amalgams of orthogonal bases [11].

Mutual coherence, relatively easy to compute, allows us to lower bound the spark,

which is often hard to compute.

LEMMA 4 (see [46]). For any matrix A C Rnxr, the following relationship holds:

(7) spark(A) > 1 + (A)

Proof. First, modify the matrix A by normalizing its columns to unit f2 norm,

obtaining A. This operation preserves both the spark and the mutual coherence. The

entries of the resulting Gram matrix G - ATA satisfy the following properties:

{Gk,k = I: 1 < k < m} and {|Gk,j| < A : 1 < k,j < m, k # j}.

Consider an arbitrary minor from G of size p x p, built by choosing a subgroup

of p columns from A and computing their sub-Gram matrix. From the Gershgorin

disk theorem [91], if this minor is diagonally dominant i.e., if ,joi JG,j I < JGj,jI for

every i then this submatrix of G is positive definite, and so those p columns from

A are linearly independent. The condition p < 1 + 1/,u implies positive definiteness

of every p x p minor, and so spark(A) > p + 1 > 1 + 17/. 0

We have the following analogue of Theorem 2.

THEOREM 5 (uniqueness: mutual coherence [46]). If a system of linear equations

Ax = b has a solution x obeyzng lIx lo < 2 (1 + 1//(A)), this solution is necessarily

the sparsest possible.

Compare Theorems 2 and 5. They are parallel in form, but with different as

sumptions. In general, Theorem 2, which uses spark, is sharp and far more powerful

than Theorem 5, which uses the coherence and so only a lower bound on spark. The

coherence can never be smaller than 1/#, and, therefore, the cardinality bound of

Theorem 5 is never larger than v1/2. However, the spark can easily be as large as n,

and Theorem 2 then gives a bound as large as n/2.

We have now given partial answers to the questions Qi and Q2 posed at the start

of this section. We have seen that any sufficiently sparse solution is guaranteed to

be unique among sufficiently sparse solutions. Consequently, any sufficiently sparse

solution is necessarily the global optimizer of (PO). These results show that searching

for a sparse solution can lead to a well-posed question with interesting properties. We

now turn to discuss Q3 practical methods for obtaining solutions.

2.2. Pursuit Algorithms: Practice. A straightforward approach to solving (PO)

seems hopeless; we now discuss methods which, it seems, have no hope of working

but which, under specific conditions, will work.

2.2.1. Greedy Algorithms. Suppose that the matrix A has spark(A) > 2 and

the optimization problem (PO) has value val(Po) = 1, so b is a scalar multiple of some

column of the matrix A. We can identify this column by applying m tests one per

column of A. This procedure requires order O(mn) flops, which may be considered

reasonable. Now suppose that A has spark(A) > 2ko, and the optimization problem

is known to have value val(Po) = ko. Then b is a linear combination of at most ko

columns of A. Generalizing the previous solution, one might try to enumerate all

7n) = O(mko) subsets of ko columns from A and then to test each one. Enumeration

takes O(mkonko2) flops, which seems prohibitively slow in many settings.

This content downloaded from 39.174.135.235 on Mon, 12 Nov 2018 08:24:27 UTC

All use subject to https://about.jstor.org/terms

42 ALFRED M. BRUCKSTEIN, DAVID L. DONOHO, AND MICHAEL ELAD

A greedy strategy abandons exhaustive search in favor of a series of locally op

timal single-term updates. Starting from xo = 0 it iteratively constructs a k-term

approximant xk by maintaining a set of active columns initially empty and, at

each stage, expanding that set by one additional column. The column chosen at each

stage maximally reduces the residual ?2 error in approximating b from the currently

active columns. After constructing an approximant including the new column, the

residual ?2 error is evaluated; if it now falls below a specified threshold, the algorithm

terminates.

Exhibit 1 presents a formal description of the strategy and its associated notation.

This procedure is known in the literature of signal processing by the name orthogonal

matching pursuit (OMP), but is very well known (and was used much earlier) by other

names in other fields-see below.

Task: Approximate the solution of (Po): min, llxllo subject to Ax =b.

Parameters: We are given the matrix A, the vector b, and the error threshold Eo.

Initialization: Initialize k = 0, and set

* The initial solution x? 0.

* The initial residual ro b - Axo = b.

* The initial solution support So - Support{x?} =0.

Main Iteration: Increment k by I and perform the following steps:

* Sweep: Compute the errors e(j) = min,j lajzj -rk-l12 for all j using the

optimal choice z} = afrk2l/flaj -.

* Update Support: Find a minimizer jo of e(j): V j ? Sk-1, E(jo) < e(j), and

update Sk = S Uk- {jo}.

* Update Provisional Solution: Compute xk, the minimizer of IIAx-bi subject

to Support{x} =sk.

* Update Residual: Compute rk = b - Axk.

* Stopping Rule: If llrk112 < co, stop. Otherwise, apply another iteration.

Output: The proposed solution is xk obtained after k iterations.

Exhibit 1. OMP a GA for approximating the solution of (Po).

If the approximation delivered has k0 nonzeros, the method requires 0(komn)

flops in general; this can be dramatically better than the exhaustive search, which

requires 3 (nmk?ko 2) flops.

Thus, this single-term-at-a-time strategy can be much more efficient than exhaus

tive search if it works! The strategy can fail badly, i.e., there are explicit examples

(see [154, 155, 36]) where a simple k-terin representation is possible, but this approach

yields an n-term (i.e., dense) representation. In general, all that can be said is that

among single-term-at-a-time strategies, the approximation error is always reduced by

as much as possible, given the starting approximation and the single-term-at-a-time

constraint. This explains why this type of algorithm has earned the name "greedy

algorithm" in approximation theory.

Many variants on this algorithm are available, offering improvements in accuracy

or in complexity or both [118, 34, 33, 23, 130, 30, 159, 82]. This family of GAs is

well known and extensively used, and, in fact, these algorithms have been reinvented

in various fields. In the setting of statistical modeling, greedy stepwise least squares

is called forward stepwise regression and has been widely used since at least the

1960s [31, 90]. When used in the signal processing setting this goes by the name of

This content downloaded from 39.174.135.235 on Mon, 12 Nov 2018 08:24:27 UTC

All use subject to https://about.jstor.org/terms

剩余48页未读，继续阅读

guh1992w

粉丝: 0

稀疏建模：从方程系统的稀疏解到信号与图像的建模

PyTorch Sparse矩阵模块torch_sparse-0.6.5安装说明

"基于压缩感知的三维视频错误隐藏技术研究

scikit-sparse扩展库：Python稀疏矩阵处理新选择

5 Key Properties of Weak Solutions to Partial Differential Equations: Generalized Solutions Beyond ...

Solving Differential Equations with ode45: Unveiling the 3 Secrets of Performance Optimization

Application of Transpose Matrix in Numerical Computation: A Powerful Tool for Accelerating Matrix ...

【Fundamentals】Detailed Explanation of Basic Structure and Usage of MATLAB Functions

Solving Differential Equations with ode45: In-depth Analysis, Unveiling Advanced Usage and ...

[Advanced] Implementing Partial Least Squares Regression (PLSR) Mathematical Modeling Algorithm in ...

Optimization Problems in MATLAB Control Systems: Parameter Tuning and Algorithm Implementation

最新资源