Skip to content Skip to navigation

Connexions

You are here: Home » Content » Delay and Sum Beamforming with a 2D Array: Introduction

Navigation

Lenses

What is a lens?

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

This content is ...

Affiliated with (What does "Affiliated with" mean?)

This content is either by members of the organizations listed or about topics related to the organizations listed. Click each link to see a list of all content affiliated with the organization.
  • Rice University ELEC 301 Projects

    This module is included inLens: Rice University ELEC 301 Project Lens
    By: Rice University ELEC 301As a part of collection: "ELEC 301 Projects Fall 2005"

    Click the "Rice University ELEC 301 Projects" link to see all content affiliated with them.

  • Rice Digital Scholarship

    This module is included in aLens by: Digital Scholarship at Rice UniversityAs a part of collection: "ELEC 301 Projects Fall 2005"

    Click the "Rice Digital Scholarship" link to see all content affiliated with them.

Also in these lenses

  • Lens for Engineering

    This module is included inLens: Lens for Engineering
    By: Sidney BurrusAs a part of collection: "ELEC 301 Projects Fall 2005"

    Click the "Lens for Engineering" link to see all content selected in this lens.

Recently Viewed

This feature requires Javascript to be enabled.
 

Delay and Sum Beamforming with a 2D Array: Introduction

Module by: Sara Joiner, Austin Bratton, Ray Hwong, Jeanne Guillory. E-mail the authors

Summary: In this module we discuss the theory behind delay and sum beamforming with a two-dimensional array, the background of our ELEC 301 project.

1. Introduction

Beamforming is the discipline that takes a set of microphones, usually in an array, and a set of point source signals (in a space that we assumed to be R3 or very nearly so) and tries to focus on one signal source to the exclusion of both noise and other signal sources. In this project, we assume a single source and use the old and powerful technique of delay and sum beamforming implemented over a 2-dimensional array arranged as a 3 by 3 square with the center missing. Having a 2-dimensional array allows the location of a source to be determined up to an ambiguity of reflection across plane of the array.

2. Delay and Sum Beamforming

Delay and sum beamforming is quite true to its name as it merely takes the set of signals, delays and maybe weights them by varying amounts, and then adds them all together. The size of the delays is determined by the direction (for farfield) or point (for nearfield) at which the set of microphones is aimed. Despite its simplicity, delay and sum manages to achieve optimal noise suppression for the case of a point source in a background of white noise. Of course, normal signal processing applies, and one can do better than just delay and sum if information about the signal other than location is known a priori. For example, if it is known that the signal is bandlimited and baseband, then a suitable lowpass filter can be applied to further suppress noise.

2.1 Nearfield Processing

Though not implemented, nearfield calculations are both more computationally intensive and accurate. If it is assumed that the microphones have some sort of center for distance, then the center can be designated as the origin for the coordinate system. A point source at a point (xs, ys, zs) would then emit a signal s(t). A microphone at a point (xm, ym, zm) would then receive a signal m(t). Assuming that signal propogates uniformly with speed v and that signal strength is equal to the original signal strength divided by the square of the distance, we can conclude that the received signal is:

Figure 1
 (equation.jpg)

2.2 Farfield Processing

In this project, it was assumed that the array was always operating in farfield, an approximation in which the source is assumed to be far enough away that the spherical waves it emits can be approximated with plane waves. It is accurate in the limit where the distance between the microphones and the source is large enough so that the angle between the source and each microphone does not change significantly.

3. Complications

3.1 Time Quantization

Since all of the processing is done in a digital environment, we must work with samples of the signals and not the signals themselves. Because of this, it is not possible to implement an arbitrary time shift as any shift must be done in increments of the sample period. To remedy this, the signals were interpolated digitally by upsampling them and then putting them through a lowpass filter with cutoff corresponding to the amount of upsampling. An equiripple filter was chosen for the lowpass filter as there appears to be no constraints as to the exact shape of the filter and because an equiripple filter would avoid the Gibb’s phenomena found in a direct approximation of an ideal lowpass filter. Using this interpolation, greater resolution can be achieved in the time shifts, though the drawback is the large amount of additional data that must now also be processed. In fact, even though the concept of delay and sum is incredibly simple, the amount of computation that must be done because of the upsampling is often prohibitively high. It is impossible for the amount of interpolation to be too high, but if it is too low, then it is entirely possible that the direction of the source will be inaccurate or entirely wrong as the algorithm will be unable to shift the signals to where they match enough.

3.2 Aliasing, Resolution, and Sampling Frequency

Given a fixed sampling frequency, there is always the ”normal” aliasing associated the Nyquist Theorem, restricting the fully reconstructable signals to those that are bandlimited to half of the sampling frequency. Something similar occurs with array spacing, and if proper care is not taken, aliasing may occur in spatial dimensions. Using the spatial analogue of the Nyquist Theorem, the minimum spacing between microphones must be at most half the wavelength corresponding to highest frequency present. Thus, to achieve any resolution at all for higher frequency signals, smaller arrays must be used; however, with a smaller array, the precision with which a direction can be determined is diminished. It appears that there is an uncertainty principal at odds with beamforming in its spatial dimensions.

3.3 Unknown Source Location

This is the main focus of the project: to try to locate a source using an array of microphones and then focus the array in the direction of the source, obtaining greater suppression of noise than would be possible using only one microphone. Since the direction of the source is unknown, we decided to scan for the source by sweeping all possibilities. This is where the far field approximation significantly reduces computational complexity. Using nearfield, any algorithm would be forced to evaluate all possible combinations of three coordinates. With farfield, there are only two angles to deal with as opposed to three coordinates so there is far less to compute.

Due to lack of computing power, we were forced to make a few, less-than-desirable assumptions in order to make the algorithm run at all without crashing. One of these simplifications was using only three of the microphones to perform the sweep of possible angles. A further simplification was to assume that the three microphones could be broken into two pairs in the calculations for determining the pair of angles from which the maximum was coming from. Further, hardware and computer limitations limited sampling to a rate of 8000 Hz from each of eight microphones and made the processing cost of upsampling prohibitive beyond a factor of around 10.

Content actions

Download module as:

PDF | EPUB (?)

What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

Downloading to a reading device

For detailed instructions on how to download this content's EPUB to your specific device, click the "(?)" link.

| More downloads ...

Add module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks