Skip to content Skip to navigation

Connexions

You are here: Home » Content » Focus Detection

Navigation

Recently Viewed

This feature requires Javascript to be enabled.
 

Focus detection

Focus Detection:

One important aspect of images is focus. While qualitatively deciding whether an image is in focus or not is relatively easy, quantitatively it can be quite difficult. One way to detect whether or not an image is in focus is by examining its power spectrum.

Power Spectrum and Focus

It is generally assumed that natural images are made up of fractals, and it can be shown that the power spectrum (power as a function of frequency) of a natural image should fall off as

1 f 2 1 f 2 size 12{ { {1} over {f rSup { size 8{2} } } } } {}
(1)

where f is the frequency.

As an image goes out of focus, it becomes blurred. That is to say that the edges are less sharp. If an image contains less sharp edges, its power spectrum will contain less high-frequency power. The power spectrum of an out-of-focus image should, therefore, fall off faster than an in-focus image.

So by calculating the power spectrum and examining its linear regression on a loglog plot (log[power] vs log[frequency]), we can get an indicator of focus.

Calculating the Power Spectrum

The power specturm is simply the square of the two dimensional Fourier transform:

P k x , k y = F k x , k y 2 P k x , k y = F k x , k y 2 size 12{P left (k rSub { size 8{x} } ,k rSub { size 8{y} } right )= lline F left (k rSub { size 8{x} } ,k rSub { size 8{y} } right ) rSup { size 8{2} } rline } {}
(2)

where the two dimensional Fourier transform is given by:

F k x , k y = x = 0 N 1 y = 0 N 1 f k x , k y 2 e j2π N xk x + yk y F k x , k y = x = 0 N 1 y = 0 N 1 f k x , k y 2 e j2π N xk x + yk y size 12{F left (k rSub { size 8{x} } ,k rSub { size 8{y} } right )= Sum cSub { size 8{x=0} } cSup { size 8{N - 1} } { Sum cSub { size 8{y=0} } cSup { size 8{N - 1} } {f left (k rSub { size 8{x} } ,k rSub { size 8{y} } right ) rSup { size 8{2} } } } e rSup { size 8{ { { - j2π} over {N} } left ( ital "xk" rSub { size 6{x} } + ital "yk" rSub { size 6{y} } right )} } } {}
(3)

Note that denotes an individual image pixel. You may have noticed that the above equations define a square image. While a non-symmetric two dimensional Fourier transform exists, using square images eases the process.

Because whether or not an image is in focus depends on the magnitude of power as a function of frequency, once the two dimensional power spectrum is computed as above, we radially average the spectrum. That is, the average of the values which lie on a circle a distance R from the origin is taken. Because frequency increases linearly in all directions from the origin, radially averaging the power spectrum gives the average power at one frequency, effectively collapsing the two dimensional spectrum to one dimension. It should be noted that Fkx,kyFkx,ky size 12{F left (k rSub { size 8{x} } ,k rSub { size 8{y} } right )} {} has been centered around baseband, meaning the frequency of the rotionally averaged power spectrum extends from 0 to N/2 -1.

The power spectrum’s falloff on a loglog plot can now be examined to determine focus.

Illustrative Example of Focus Analysis on Entire Image

The following images show the results of a linear regression of the power spectrum on a loglog plot for an in-focus image and an out-of-focus image.

Figure 1
Focus analysis of an in-focus image
Focus analysis of an in-focus image (graphics1.png)
Figure 2
Focus analysis of an out-of-focus image
Focus analysis of an out-of-focus image (graphics2.png)

As expected, the out-of-focus image yielded a linear regression with a slope of -3.3, while the in-focus image yielded a linear regression with a slope of -2.3, indicating that the out-of-focus image has fewer high frequency components.

Determining Regions of Focus

Because frequency and power should be related exponentially as stated before, the loglog plot should display a linear relationship. Taking the linear regression of the loglog plot leads to an estimate of the frequency fall off. For example, if the linear regression where to return a slope of -2, we know that the power spectrum falls off as 1f21f2 size 12{ { {1} over {f rSup { size 8{2} } } } } {}.

The same principles used to determine whether or not an image is in focus can be used to determine what region of an image is in focus. Because cameras can only focus on one spatial plane, in a single picture certain objects will be more in focus than others. To determine which region of an image is in focus, one simply has to divide the image into separate spatial region and then use the methods described above on each region. The region whose power spectrum conforms most closely to the 1f21f2 size 12{ { {1} over {f rSup { size 8{2} } } } } {} fall off can be considered the center of focus in the image.

Content actions

Download module as:

PDF | EPUB (?)

What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

Downloading to a reading device

For detailed instructions on how to download this content's EPUB to your specific device, click the "(?)" link.

| More downloads ...

Add module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks