Skip to content Skip to navigation

Connexions

You are here: Home » Content » Fixed-Point Quantization

Navigation

Content Actions

  • Download module PDF
  • Add to ...
    Add the module to:
    • My Favorites
    • A lens
    • An external social bookmarking service
    • My Favorites (What is 'My Favorites'?)
      'My Favorites' is a special kind of lens which you can use to bookmark modules and collections directly in Connexions. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need a Connexions account to use 'My Favorites'.
    • A lens (What is a lens?)

      Definition of a lens

      Lenses

      A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

      What is in a lens?

      Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

      Who can create a lens?

      Any individual Connexions member, a community, or a respected organization.

      What are tags? tag icon

      Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

    • External bookmarks
  • E-mail the author
  • Rate this module (How does the rating system work?)

    Rating system

    Ratings

    Ratings allow you to judge the quality of modules. If other users have ranked the module then its average rating is displayed below. Ratings are calculated on a scale from one star (Poor) to five stars (Excellent).

    How to rate a module

    Hover over the star that corresponds to the rating you wish to assign. Click on the star to add your rating. Your rating should be based on the quality of the content. You must have an account and be logged in to rate content.

    (0 ratings)

Lenses

What is a lens?

Definition of a lens

Lenses

A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual Connexions member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

This content is ...

In these lenses

  • Real-Time DSP with MATLAB display tagshide tags

    This module is included inLens: DSP with MATLAB lens
    By: Bhaskar BhattacharyaAs a part of collection:"Digital Signal Processing Laboratory (ECE 420 55x)"

    Comments:

    "Real-Time DSP with MATLAB"

    Click the "Real-Time DSP with MATLAB" link to see all content selected in this lens.

    Click the tag icon tag icon to display tags associated with this content.

Recently Viewed

This feature requires Javascript to be enabled.

Tags

(What is a tag?)

These tags come from the endorsement, affiliation, and other lenses that include this content.

Fixed-Point Quantization

Module by: Douglas L. Jones

Summary: Finite word lengths introduce quantization error in fixed-point systems. Truncation quantization causes a larger maximum error and a negative bias compared to rounding, but is easier to implement in hardware. Similarly, wraparound overflow is typically worse than saturation, but also requires more hardware.

Note: Your browser may not currently support MathML. See our browser support page for additional details. You can always view the correct math in the PDF version.

The fractional BB-bit two's complement number representation evenly distributes 2B 2 B quantization levels between -1 -1 and 12-(B1) 1 2 B 1 . The spacing between quantization levels is then 22B=2-(B1) Δ B 2 2 B 2 B 1 Δ B Any signal value falling between two levels is assigned to one of the two levels.

X Q =Qx X Q Q x is our notation for quantization. e=Qxx e Q x x is then the quantization error.

One method of quantization is rounding, which assigns the signal value to the nearest level. The maximum error is thus Δ B 2=2-B Δ B 2 2 B .

Figure 1
(a) (b)
Figure 1(a) (subfig1aFixed-PointQuant.png)Figure 1(b) (subfig1bFixed-PointQuant.png)

Another common scheme, which is often easier to implement in hardware, is truncation. Qx Q x assigns xx to the next lowest level.

Figure 2
(a) (b)
Figure 2(a) (subfig2aFixed-PointQuant.png)Figure 2(b) (subfig2bFixed-PointQuant.png)
The worst-case error with truncation is Δ=2-(B1) Δ 2 B 1 , which is twice as large as with rounding. Also, the error is always negative, so on average it may have a non-zero mean (i.e., a bias component).

Overflow is the other problem. There are two common types: two's complement (or wraparound) overflow, or saturation overflow.

Figure 3
wraparoundsaturation
(a) (b)
wraparound (subfig3aFixed-PointQuant.png)saturation (subfig3bFixed-PointQuant.png)
Obviously, overflow errors are bad because they are typically large; two's complement (or wraparound) overflow introduces more error than saturation, but is easier to implement in hardware. It also has the advantage that if the sum of several numbers is between -11 -1 1 , the final answer will be correct even if intermediate sums overflow! However, wraparound overflow leaves IIR systems susceptible to zero-input large-scale limit cycles, as discussed in another module. As usual, there are many tradeoffs to evaluate, and no one right answer for all applications.

Comments, questions, feedback, criticisms?

Send feedback