<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE document PUBLIC "-//CNX//DTD CNXML 0.5 plus MathML//EN" "http://cnx.rice.edu/cnxml/0.5/DTD/cnxml_mathml.dtd">
<document xmlns="http://cnx.rice.edu/cnxml" xmlns:m="http://www.w3.org/1998/Math/MathML" xmlns:md="http://cnx.rice.edu/mdml/0.4" xmlns:bib="http://bibtexml.sf.net/" id="m10423">

  <name>Sub-Band Processing</name>

  <metadata>
  <md:version>2.14</md:version>
  <md:created>2001/12/31</md:created>
  <md:revised>2005/10/05 15:26:10.668 GMT-5</md:revised>
  <md:authorlist>
      <md:author id="schniter">
      <md:firstname>Phil</md:firstname>
      
      <md:surname>Schniter</md:surname>
      <md:email>schniter@ee.eng.ohio-state.edu</md:email>
    </md:author>
  </md:authorlist>

  <md:maintainerlist>
    <md:maintainer id="jgrab">
      <md:firstname>Jacob</md:firstname>
      
      <md:surname>Grabczewski</md:surname>
      <md:email>jgrab@owlnet.rice.edu</md:email>
    </md:maintainer>
    <md:maintainer id="mjhaag">
      <md:firstname>Michael</md:firstname>
      
      <md:surname>Haag</md:surname>
      <md:email>mjhaag@rice.edu</md:email>
    </md:maintainer>
  </md:maintainerlist>
  
  <md:keywordlist>
    <md:keyword>analysis</md:keyword>
    <md:keyword>constant-Q</md:keyword>
    <md:keyword>critically sampled</md:keyword>
    <md:keyword>perfect reconstrucion</md:keyword>
    <md:keyword>sub-band</md:keyword>
    <md:keyword>synthesis</md:keyword>
  </md:keywordlist>

  <md:abstract>This module covers the fundamentals of Filterbanks and explains why the are very useful in filter design.</md:abstract>
</metadata>


  <content>
    <section id="sec1">
      <name>Why Filterbanks?</name>
      
      <section id="s1_sub1">
	<name>Sub-band Processing</name>
	<para id="p1_s1s1">
	  There exist many applications in modern signal processing
	  where it is advantageous to separate a signal into different
	  frequency ranges called <term>sub-bands</term>.  The
	  spectrum might be partitioned in the uniform manner
	  illustrated in <cnxn strength="8" target="fig1"/>, where the
	  sub-band width
	  <m:math>
	    <m:apply>
	      <m:eq/>
	      <m:ci><m:msub>
		<m:mi>Δ</m:mi>
		<m:mi>k</m:mi>
	      </m:msub></m:ci>
	      <m:apply>
		<m:divide/>
		<m:apply>
		  <m:times/>
		  <m:cn>2</m:cn>
		  <m:pi/>
		</m:apply>
		<m:ci>M</m:ci>
	      </m:apply>
	    </m:apply>
	  </m:math> 
	  is identical for each sub-band and the band centers are
	  uniformly spaced at intervals of
	  <m:math>
	    <m:apply>
	      <m:divide/>
	      <m:apply>
		<m:times/>
		<m:cn>2</m:cn>
		<m:pi/>
	      </m:apply>
	      <m:ci>M</m:ci>
	    </m:apply>
	  </m:math>.
	</para>
	
	<figure id="fig1">
	  <media type="image/png" src="wf_f1.png"/>
	</figure>

	<para id="p1a_s1s1">
	  Alternatively, the sub-bands might have a logarithmic
	  spacing like that shown in <cnxn strength="8" target="fig2"/>.
	</para>
	
	<figure id="fig2">
	  <media type="image/png" src="wf_f2.png"/>
	</figure>

	<para id="p1b_s1s1">
	  For most of our discussion, we will focus on uniformly
	  spaced sub-bands.
	</para>

	<para id="p2_s1s1">
	  The separation into sub-band components is intended to make
	  further processing more convenient.  Some of the most
	  popular applications for sub-band decomposition are audio
	  and video source coding (with the goal of efficient storage
	  and/or transmission).
	</para>

	<para id="p3_s1s1">
	  <cnxn target="fig3" strength="8"/> illustrates the use of
	  sub-band processing in MPEG audio coding.  There a
	  psychoacoustic model is used to decide how much quantization
	  error can be tolerated in each sub-band while remaining
	  below the hearing threshold of a human listener.  In the
	  sub-bands that can tolerate more error, less bits are used
	  for coding.  The quantized sub-band signals can then be
	  decoded and recombined to reconstruct (an approximate
	  version of) the input signal.  Such processing allows, on
	  average, a 12-to-1 reduction in bit rate while still
	  maintaining "CD quality" audio.  The psychoacoustic model
	  takes into account the <term>spectral masking</term>
	  phenomenon of the human ear, which says that high energy in
	  one spectral region will limit the ear's ability to hear
	  details in nearby spectral regions.  Therefore, when the
	  energy in one sub-band is high, nearby sub-bands can be
	  coded with less bits without degrading the perceived quality
	  of the audio signal.  The MPEG standard specifies
	  32-channels of sub-band filtering.  Some psychoacoustic
	  models also take into account "temporal masking" properties
	  of the human ear, which say that a loud burst of sound will
	  temporarily overload the ear for short time durations,
	  making it possible to hide quantization noise in the time
	  interval after a loud sound burst.
	</para>

	<figure id="fig3">
	  <media type="image/png" src="wf_f3.png"/>
	</figure>

	<para id="p4_s1s1">
	  In typical applications, non-trivial signal processing takes
	  place between the bank of analysis filters and the bank of
	  synthesis filters, as shown in <cnxn target="fig4" strength="8"/>.  We will focus, however, on filterbank
	  design rather than on the processing that occurs between the
	  filterbanks.
	</para>

	<figure id="fig4">
	  <media type="image/png" src="wf_f4.png"/>
	</figure>

	<para id="p5_s1s1">
	  Our goals in filter design are:

	  <list id="list1" type="enumerated">
	    <item>
	      Good sub-band frequency separation
	      (<foreign>i.e.</foreign>, good "frequency selectivity").
	    </item>
	    <item>
	      Good reconstruction (<foreign>i.e.</foreign>, 
	      <m:math>
		<m:apply>
		  <m:approx/>
		  <m:apply>
		    <m:ci type="fn" class="discrete">y</m:ci>
		    <m:ci>n</m:ci>
		  </m:apply>
		  <m:apply>
		    <m:ci type="fn" class="discrete">x</m:ci>
		    <m:apply>
		      <m:minus/>
		      <m:ci>n</m:ci>
		      <m:ci>d</m:ci>
		    </m:apply>
		  </m:apply>
		</m:apply>
	      </m:math> for some integer delay
	      <m:math><m:ci>d</m:ci></m:math>) when the sub-band
	      processing is lossless.
	    </item>
	  </list>
	</para>		      
	
	<para id="p6_s1s1">
	  The first goal is driven by the assumption that the sub-band
	  processing works <emphasis>best</emphasis> when it is given
	  access to cleanly separated sub-band signals, while the
	  second goal is motivated by the idea that the sub-band
	  filtering should not limit the reconstruction performance
	  when the sub-band processing (<foreign>e.g.</foreign>, the
	  coding/decoding) is lossless or nearly lossless.
	</para>
      </section>
      
    </section>
    
  </content>
</document>
