Skip to content Skip to navigation

Connexions

You are here: Home » Content » Fixed-Point Number Representation

Navigation

Content Actions

  • Download module PDF
  • Add to ...
    Add the module to:
    • My Favorites
    • A lens
    • An external social bookmarking service
    • My Favorites (What is 'My Favorites'?)
      'My Favorites' is a special kind of lens which you can use to bookmark modules and collections directly in Connexions. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need a Connexions account to use 'My Favorites'.
    • A lens (What is a lens?)

      Definition of a lens

      Lenses

      A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

      What is in a lens?

      Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

      Who can create a lens?

      Any individual Connexions member, a community, or a respected organization.

      What are tags? tag icon

      Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

    • External bookmarks
  • E-mail the author
  • Rate this module (How does the rating system work?)

    Rating system

    Ratings

    Ratings allow you to judge the quality of modules. If other users have ranked the module then its average rating is displayed below. Ratings are calculated on a scale from one star (Poor) to five stars (Excellent).

    How to rate a module

    Hover over the star that corresponds to the rating you wish to assign. Click on the star to add your rating. Your rating should be based on the quality of the content. You must have an account and be logged in to rate content.

    (0 ratings)

Lenses

What is a lens?

Definition of a lens

Lenses

A lens is a custom view of Connexions content. You can think of it as a fancy kind of list that will let you see Connexions through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to Connexions materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual Connexions member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

This content is ...

In these lenses

  • Real-Time DSP with MATLAB display tagshide tags

    This module is included inLens: DSP with MATLAB lens
    By: Bhaskar BhattacharyaAs a part of collection:"Digital Signal Processing Laboratory (ECE 420 55x)"

    Comments:

    "Real-Time DSP with MATLAB"

    Click the "Real-Time DSP with MATLAB" link to see all content selected in this lens.

    Click the tag icon tag icon to display tags associated with this content.

Recently Viewed

This feature requires Javascript to be enabled.

Tags

(What is a tag?)

These tags come from the endorsement, affiliation, and other lenses that include this content.

Fixed-Point Number Representation

Module by: Douglas L. Jones

Summary: Specialized DSP hardware typically uses fixed-point number representations for lower cost and complexity and greater speed. Interpretation of two's-complement binary numbers as signed fractions between -1 and 1 allows integer arithmetic to be used for DSP computations, but introduces quantization and overflow errors.

Note: Your browser may not currently support MathML. See our browser support page for additional details. You can always view the correct math in the PDF version.

Fixed-point arithmetic is generally used when hardware cost, speed, or complexity is important. Finite-precision quantization issues usually arise in fixed-point systems, so we concentrate on fixed-point quantization and error analysis in the remainder of this course. For basic signal processing computations such as digital filters and FFTs, the magnitude of the data, the internal states, and the output can usually be scaled to obtain good performance with a fixed-point implementation.

Two's-Complement Integer Representation

As far as the hardware is concerned, fixed-point number systems represent data as BB-bit integers. The two's-complement number system is usually used: k=binary integer representationif0k2B11bit-by-bit inverse-k+1if-2B1k0 k binary integer representation 0 k 2 B 1 1 bit-by-bit inverse k 1 2 B 1 k 0

Figure 1
Figure 1 (fig1FixedPoint.png)
The most significant bit is known at the sign bit; it is 0 when the number is non-negative; 1 when the number is negative.

Fractional Fixed-Point Number Representation

For the purposes of signal processing, we often regard the fixed-point numbers as binary fractions between -11 -1 1 , by implicitly placing a decimal point after the sign bit.

Figure 2
Figure 2 (fig2FixedPoint.png)
or x=- b 0 +i=1B1 b i 2-i x b 0 i B 1 1 b i 2 i This interpretation makes it clearer how to implement digital filters in fixed-point, at least when the coefficients have a magnitude less than 1.

Truncation Error

Consider the multiplication of two binary fractions

Figure 3
Figure 3 (fig3FixedPoint.png)
Note that full-precision multiplication almost doubles the number of bits; if we wish to return the product to a BB-bit representation, we must truncate the B1 B 1 least significant bits. However, this introduces truncation error (also known as quantization error, or roundoff error if the number is rounded to the nearest BB-bit fractional value rather than truncated). Note that this occurs after multiplication.

Overflow Error

Consider the addition of two binary fractions;

Figure 4
Figure 4 (fig4FixedPoint.png)
Note the occurence of wraparound overflow; this only happens with addition. Obviously, it can be a bad problem.

There are thus two types of fixed-point error: roundoff error, associated with data quantization and multiplication, and overflow error, associated with data quantization and additions. In fixed-point systems, one must strike a balance between these two error sources; by scaling down the data, the occurence of overflow errors is reduced, but the relative size of the roundoff error is increased.

Note:

Since multiplies require a number of additions, they are especially expensive in terms of hardware (with a complexity proportional to B x B h B x B h , where B x B x is the number of bits in the data, and B h B h is the number of bits in the filter coefficients). Designers try to minimize both B x B x and B h B h , and often choose B x B h B x B h !

Comments, questions, feedback, criticisms?

Send feedback