Skip to content Skip to navigation

Connexions

You are here: Home » Content » Compressive Sensing

Navigation

Content Actions

  • Download module PDF
  • Add to ...
    Add the module to:
    • My Favorites
    • A lens
    • An external social bookmarking service
    • My Favorites (What is 'My Favorites'?)
      'My Favorites' is a special kind of lens which you can use to bookmark modules and collections directly in Connexions. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need a Connexions account to use 'My Favorites'.
    • A lens (What is a lens?)

      Definition of a lens

      Lenses

      A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

      What is in a lens?

      Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

      Who can create a lens?

      Any individual Connexions member, a community, or a respected organization.

    • External bookmarks
  • E-mail the authors

Recently Viewed

This feature requires Javascript to be enabled.

Compressive Sensing

Module by: Mark Davenport, Ronald DeVore, Chris Rozell

We now consider a different setting. Suppose xNxNx in ℝ^N and we wish to sample xxx, where taking a sample means the application of a linear functional λNλNλ in ℝ^N to xxx. Next, we prescribe a budget of nnn samples, and consider all linear encoders using nnn samples. We can write these nnn linear functionals as an n×Nn×Nn times N matrix Φ:NnΦ:NnΦ :ℝ^N rightarrow ℝ^n. We then consider a decoder Δ:nNΔ:nNΔ :ℝ^n rightarrow ℝ^N. Our approximation to xxx is thus Δ(Φ(x))Δ(Φ(x))Δ { \( {Φ { \( x \)}} \)}.

To make the problem precise, we first pick a measure for distortion: error = x - Δ ( Φ ( x ) ) p . error = x - Δ ( Φ ( x ) ) p . We next must make some assumption about xxx. For example, we can assume that x Σ k = { x : x i = 0 for i Λ , Λ k }, x Σ k = { x : x i = 0 for i Λ , Λ k }, x in Σ_k ={ lbrace {x : x_i = 0 "for" i notin Λ , ♯ Λ <= k} rbrace} or x τ x τ or x w τ x w τ

We recall our basic problem: a signal xNxNx in ℝ^N will be “sampled” or “sensed” by applying the nnn linear projections represented by the columns of the sampling matrix Φn×NΦn×NΦ_{n times N}. The resulting measurements are given in the vector ynyny in ℝ^n, where y=Φxy=Φxy =Φ x. We will assume that n<Nn<Nn <N, meaning that in addition to thinking of of the sampling operation ΦΦΦ as an encoder, we can also view it as a projection to a lower dimensional linear subspace. In either case, we would like the measurements yyy to preserve as much information about the signal xxx as possible. To proceed in finding optimal solutions, we must formalize this problem and define how we will measure this information loss.

Moving forward, a critical quantity for us will be the null space of the sampling matrix, N=N(Φ)={x:Φx=0}N=N(Φ)={x:Φx=0}N =N { \( Φ \)} ={ lbrace {x : Φ x = 0} rbrace}. Because we are trying to take as few measurements as possible, we will assume that we are taking measurements efficiently so that the rows of ΦΦΦ are all linearly independent. Therefore, assume that rank(Φ)=nrank(Φ)=n"rank" { \( Φ \)} =n, implying that NNN has dimension (Nn)(Nn) \( {N - n} \). The non-trivial null space of ΦΦΦ means that it is not an invertible mapping. For any measurement vector yyy we can define the class of all observable signals that would result in the same measurement, (y):={x:Φx=y}N(y):={x:Φx=y}Nℱ { \( y \)} :={ lbrace {x : Φ x = y} rbrace} subseteq ℝ^N. The class (y)(y)ℱ { \( y \)} can always be written as a sum of a vector in the class and a vector in the null space of the sampling matrix, ( y ) = x 0 + N ( y ) = x 0 + N , where x 0 ( y ) x 0 ( y ) x 0 in ℱ { \( y \)} . If we have two vectors x 0 , x 1 ( y ) x 0 , x 1 ( y ) , then by linearity we know that Φ ( x 1 x 0 ) = 0 Φ ( x 1 x 0 ) = 0 . This fact implies that ( x 1 x 0 ) N ( x 1 x 0 ) N , and consequently that x 1 x 0 + N x 1 x 0 + N .

Associated to our encoder ΦΦΦ, we shall have to describe a decoder ΔΔΔ, which is a (not necessarily linear) map Δ:nNΔ:nNΔ :ℝ^n rightarrow ℝ^N. This decoder will take the measurements yyy and try to recover xxx as closely as possible, x¯=Δ(Φx)x¯=Δ(Φx){overline x} =Δ { \( {Φ x} \)}. In order to design the best decoder ΔΔΔ, we must specify our metric for measuring the quality of the estimate x¯x¯overline x. For the moment we shall always think of taking an optimal decoder for the problem at hand. Later we shall discuss specific and concrete decoders.

To begin our discussion of the efficiency of the enoder ΦΦΦ we consider the following problem.

Problem:

We will fix nnn and NNN and try to find ΦΦΦ and a decoder ΔΔΔ such that for all input signals in a sparsity class x Σ k x Σ k x in Σ k we can get perfect reconstruction, Δ(Φx)=xΔ(Φx)=xΔ { \( {Φ x} \)} =x. We will be interested in determining what the largest value of kkk is that we can find such an encoder/decoder pair.

An important role in this problem and later problems of compressed sensing is played by certain submatrices of Φ Φ Φ . Given a set T{1,2,,N}T{1,2,,N}T subseteq { lbrace {1 , 2 , dotslow , N} rbrace}, representing a collection of column indices we define the matrix ΦTΦTΦ_T as the one formed from ΦΦΦ by using the columns from the set TTT. The matrix ΦTΦTΦ_T is a (n×#(T))(n×#(T)) \( {n times italic "#" { \( T \)}} \) matrix. But sometimes we will also use the same notation ΦTΦTΦ_T to denote the matrix obtained from ΦΦΦ by setting all entries not in the columns of TTT to zero.

We can now state the following theorem.

theorem 1

The following statements are all equivalent for any given ΦΦΦ:

  1. There exists a decoder ΔΔΔ such that Δ(Φx)=xΔ(Φx)=xΔ { \( {Φ x} \)} =x for all x Σ k x Σ k x in Σ k .
  2. N ( Φ ) Σ 2 k = { 0 } N ( Φ ) Σ 2 k = { 0 } N { \( Φ \)} intersection Σ 2 k ={ lbrace 0 rbrace} .
  3. Φ T Φ T has rank #(T)#(T)italic "#" { \( T \)} for all TTT with #(T)=2k#(T)=2kitalic "#" { \( T \)} =2 k.
  4. Φ T t Φ T Φ T t Φ T is non-singular (i.e., invertible) for all TTT with #(T)=2k#(T)=2kitalic "#" { \( T \)} =2 k.

Proof

The equivalence of b c d b c d is simple linear algebra. First let us prove that ababa rightarrow b. Assume aaa. Suppose that there was a vector η N Σ 2 k η N Σ 2 k η in N intersection Σ 2 k . We know that we can write η = x 0 x 1 η = x 0 x 1 η =x 0 - x 1 , where x0,x1x0,x1x 0 ,x 1 both have support less than kkk. We could write ηηη simply as a composite vector with support 2k2k2 k where the first half of ηηη is x0x0x 0 and the second half of ηηη is x1x1x 1, η = ( x 0 x 1 ) η = ( x 0 x 1 ) η ={ \( {x 0 \lline x 1} \)}. We know that ηNηNη in N, which implies that Φη=0Φη=0Φ η =0. It then follows that Φ x 0 = Φ x 1 Φ x 0 = Φ x 1 Φ x 0 =Φ x 1, and we have as a consequence that Δ Φ x 0 = Δ Φ x 1 Δ Φ x 0 = Δ Φ x 1 Δ Φ x 0 =Δ Φ x 1 and finally that x 0 = x 1 x 0 = x 1 x 0 =x 1. This proves that η=0η=0η =0 as desired.

Now let us prove that babab rightarrow a. Suppose that we have a measurement Φx=ynΦx=ynΦ x =y in ℝ^n, where x Σ k x Σ k x in Σ k. We will define the decoder Δ(y)Δ(y)Δ { \( y \)} to be the signal ̄ x (y) ̄ x (y){̄x} in ℱ { \( y \)} with smallest support. Since xxx has support kk <= k so will ̄ x ̄ x ̄x. We claim that there is no other x(y)Σkx(y)Σkx^′ in ℱ { \( y \)} intersection Σ k. Indeed, if xxx^′ existed, then xxNΣ2kxxNΣ2kx - x^′ in N intersection Σ 2 k. But, the only vector in NΣ2kNΣ2kN intersection Σ 2 k is zero, implying that x=xx=xx =x^′. This finally gives us that Δ(Φx)=xΔ(Φx)=xΔ { \( {Φ x} \)} =x. □

Using the previous theorem, we can turn to the question of finding good encoder/decoder pairs.1 Given a fixed NNN, how large can kkk be, and what is the best ΦΦΦ? Given a fixed nnn, the largest kkk is k=n2k=n2k ={⌊ n over 2 ⌋}. Alternatively, we can say that given a fixed kkk, we need at least n=2kn=2kn =2 k measurements. Another way to say this is that there exist encoding matrices Φ2k×NΦ2k×NΦ_{2 k times N} such that any selection of 2k2k2 k columns are linearly independent. Examples are the DFT matrix or the Vandermonde matrix corresponding to interpolation at distinct points z1,,zNz1,,zNz_1 , dotslow ,z_N.

Footnotes

  1. A question about whether we will ever really find natural signals in ΣkΣkΣ k brings to ming a story... Once upon a time, a man was floating over the countryside in a hot air balloon. The man in the balloon yelled down to a stranger on the ground and asked “Where am I?” The man on the ground thought for about 5 minutes and then answered “You’re in a hot air balloon.” The man in the balloon responded with “You must be a mathematician,” to which the man on the ground answered “Yes, how did you know?” “Because,” replied the man in the balloon, “you had to think a long time before you answered, your answer was very precise, and your answer was completely useless!” So, yes, we may be dealing with a limited model, but we have to crawl before we can walk.

Comments, questions, feedback, criticisms?

Send feedback