Share on Facebook Share on Twitter Email
Answers.com

Fourier optics

 
Photography Encyclopedia: Fourier optics

The Fourier model for image formation provides a simple and elegant explanation of the limits of resolution of an optical system and of the nature of the image itself. The underlying principle is the same as that used for the analysis of sounds or electronic signals. The whole of sound reproduction theory is based on the principle that any sound, no matter how complex, is made up of sinusoidal (wave-form) components of various amplitudes, frequencies, and phases. The converse, that any sound pattern can be built up from an appropriate spectrum of sinusoids, is equally true. This is called respectively Fourier analysis and Fourier synthesis, after Joseph Fourier (1768-1830), who first worked out the mathematical principles.

The precise optical analogy is that the light beam emitted (reflected or transmitted) by any object, no matter how complex, is identical with the pattern of diffraction that would be transmitted by a spectrum of sinusoidal diffraction gratings of various transmittances, spatial frequencies, and spatial phases, and, for a two-dimensional object, orientations. Similarly, any desired light beam can be constructed by superimposing appropriate sinusoidal gratings on a plane wavefront. One small extension is necessary. Because (unlike a sonic or electrical signal) a transmittance cannot be negative, there is always a ‘d.c.’ component present in an optical grating, equal to half the amplitude of the grating.

The Fourier approach treats imaging as a diffraction phenomenon, and a lens as a device for performing an optical Fourier analysis (in two dimensions) of the sinusoidal functions that together create the shape of the wavefront from the object. The principle is approached by considering a simple sinusoidal grating (Fig. 1). This produces a single pair of diffracted plane wavefronts. If the grating is situated at the front focal plane of the lens, these wavefronts will converge towards the rear focal plane, forming two spots; this can be readily demonstrated using laser light. The electrical field in this plane is the Fourier transform (i.e. the spatial frequency spectrum) of the object grating, and represents—as it must—a single spatial frequency.

When the object is more complicated, the diffraction pattern at the rear focal plane is also more complicated. But it still consists of pairs of spots. Each pair indicates a specific amplitude, spatial frequency, and orientation of the originating sinusoidal grating by (respectively) their intensity, separation, and angle. Indeed, simple objects can be identified from their diffraction patterns. The letter V of (Fig. 2) shows the characteristic pattern of two rectangles at an angle to one another. One thing the visible pattern does not show is the spatial phases of the gratings; but they are nevertheless present in the electric field.

A fundamental property of a Fourier transform is that it obeys the commutative law. This implies that if a Fourier transform is carried out on a Fourier transform, the result is just the original function. So a second lens positioned with its front focal plane in the rear focal plane of the first will produce a replica of the original object at its own rear focal plane (Fig. 3). This, of course will not be news to a photographer: it is part of traditional geometrical optics. What was not generally appreciated, and not actually seen before lasers appeared, was the presence of the diffraction pattern that proved the existence of the optical Fourier transform in the principal focal plane of the lens. (It is, of course, present even with white light, but is smeared out by dispersion and lack of coherence.)

One important outcome of the Fourier principle is that it indicates the absolute limit of resolution of a lens. Because the highest spatial frequencies in the object wavefront are diffracted to the largest angle, they may miss the lens altogether (Fig. 4). Closing the lens aperture down progressively cuts off the higher spatial frequencies, just as applying a high-frequency blocking filter removes the highest sound frequencies from a hi-fi amplifier. The effect was first noticed in the early 1950s, when it was seen that the image of a Sayce grating (a rectangular intensity-profile grating, Fig. 5a) became markedly sinusoidal in profile at the fine-detail end as the aperture of the lens under test was closed down, indicating that the higher spatial frequencies present in the bars were not being recorded (Fig. 5b).

Fig.-1
Fig.-1


Fig.-2
Fig.-2


Fig.-3
Fig.-3


Fig.-4
Fig.-4


Fig.-5a
Fig.-5a


Fig.-5b
Fig.-5b

— Graham Saxby

Bibliography

  • Steward, E. G., Fourier Optics: An Introduction (1989).
  • Saxby, G., The Science of Imaging: An Introduction (2002)
Search unanswered questions...
Enter a question here...
Search: All sources Community Q&A Reference topics
Wikipedia: Fourier optics
Top

Fourier optics is the study of classical optics using techniques involving Fourier transforms and can be seen as an extension of the Huygens-Fresnel principle. The underlying idea is that any "wide" wave which moves forward (like an ocean wave coming towards the shore) can actually be thought of as an infinite amount of "wave points", all of which could (when they collide with something in the way) move relatively independently of each other. The theorem basically says square objects can be made by combining an infinite amount of curved objects. If a wave is far enough away from something that it can be simplified to a square block moving forward, a Fraunhofer diffraction would be created. When the wave is close enough that more attention must be paid to the individual wave points and the wave can only be simplified to a round ball instead of a square block, a Fresnel diffraction would be created. These mathematical simplifications and calculations are the realm of Fourier analysis and synthesis – together, they can describe what happens when light passes through various slits or mirrors curved one way or the other, or is fully or partially reflected. Fourier optics forms much of the theory behind image processing techniques, as well as finding applications where information needs to be extracted from optical sources such as in quantum optics. To put it in a slightly more complex way, similar to the concept of frequency and time used in traditional Fourier transform theory, Fourier optics makes use of the spatial frequency domain (kx, ky) as the conjugate of the spatial (x,y) domain. Terms and concepts such as transform theory, spectrum, bandwidth, window functions and sampling from one dimensional signal processing are commonly used.

Contents

Propagation of light through a homogeneous, source-free medium

Light can be described as a waveform propagating through free space (vacuum) or a material medium (such as air or glass). Mathematically, the amplitude of the wave is represented by a scalar wave function u that depends on both space and time:

 u = u(\mathbf{r},t)

where

 \mathbf{r} = (x,y,z)

represents position in three dimensional space, and t represents time.

The Wave Equation in the Time Domain

Fourier optics begins with the homogeneous, scalar wave equation (valid in source-free regions):


\left(\nabla^2-\frac{1}{c^2}\frac{\partial^2}{\partial{t}^2}\right)u(\mathbf{r},t)=0.

where u(r,t) is a real number, Cartesian component of an electromagnetic wave propagating through free space.

The Helmholtz Equation in the Frequency Domain

If it is assumed that the solution of this equation takes a time-harmonic form:

u(\mathbf{r},t) = \mathrm{Re} \left\{  \psi(\mathbf{r}) e^{j\omega t} \right\}

where

\psi(\mathbf{r}) = a(\mathbf{r}) e^{j \phi (\mathbf{r}) }

and this expression is substituted into the wave equation, the time-independent form of the wave equation may be derived, also known as the Helmholtz equation:

\left(\nabla^2+ k^2 \right) \psi (\mathbf{r})=0

where

k = { \omega \over c} = { 2 \pi \over \lambda }

is the wave number, j is the imaginary unit, and ψ(r) is the time-independent, complex-valued component of the propagating wave. Note that the propagation constant, k, and the frequency, ω, are linearly related to one another, a typical characteristic of transverse electromagnetic (TEM) waves.

Paraxial plane waves (Optic axis is assumed z-directed)

As will be shown rigorously in the following section, an elementary product solution to this equation takes the form:

\psi(\mathbf{r}) = A(\mathbf{k}) e^{-j \mathbf{k} \cdot \mathbf{r}}

where

\mathbf{k} = k_x \mathbf{x} + k_y \mathbf{y} + k_z\mathbf{z}

is the wave vector, and

 k = \|\mathbf{k}\| = \sqrt{k_x^2 + k_y^2  + k_z^2} = {\omega \over c}

is the wave number. Next, using the paraxial approximation, it is assumed that

k_x^2 + k_y^2 \ll k_z^2

or equivalently,

\sin \theta \approx \theta

where θ is the angle between the wave vector k and the z-axis.

As a result,

 k_z = k \cos \theta \approx k ( 1 - \theta^2 / 2)

and

\psi(\mathbf{r}) \approx A(\mathbf{k}) e^{-j(k_x x+k_y y)} e^{jkz \theta^2/2 }  e^{-jk z}

The paraxial wave equation

Substituting this expression into the Helmholtz equation, the paraxial wave equation is derived:

\nabla_T^2 A - 2jk { \partial A \over \partial z} = 0

where

\nabla_T^2 = {\partial^2 \over \partial x^2} + {\partial^2 \over \partial y^2}

is the transverse Laplacian operator, shown here in Cartesian coordinates.

The plane wave spectrum: the foundation of Fourier optics

The plane wave spectrum concept is the basic foundation of Fourier Optics. The plane wave spectrum is a continuous spectrum of uniform plane waves, and there is one plane wave component in the spectrum for every tangent point on the far-field phase front. The amplitude of that plane wave component would be the amplitude of the optical field at that tangent point. Again, this is true only in the far field, defined as: Range = 2 D2 / λ where D is the maximum linear extent of the optical sources and λ is the wavelength (Scott [1998]). The plane wave spectrum is often regarded as being discrete for certain types of periodic gratings, though in reality, the spectra from gratings are continuous as well, since no physical device can have the infinite extent required to produce a true line spectrum.

The plane wave component propagating parallel to the optic axis is analogous to the DC component of an electrical signal. Bandwidth in electrical signals relates to the difference between the highest and lowest frequencies present in the spectrum of the signal. For optical systems, bandwidth is a measure of how far a plane wave is tilted away from the optic axis, so for this reason, this type of bandwidth is often referred to as angular bandwidth or spatial bandwidth. It takes more frequency bandwidth to produce a short pulse in an electrical circuit, and more angular (or, spatial frequency) bandwidth to produce a sharp spot in an optical system (see discussion related to Point spread function).

The plane wave spectrum arises naturally as the eigenfunction solution to the homogeneous electromagnetic wave equation in rectangular coordinates (see also Electromagnetic radiation, which derives the wave equation from Maxwell's equations in source-free media, or Scott [1998]). In the frequency domain, the homogeneous electromagnetic wave equation in the frequency domain, or the Helmholtz equation, assumes the form:

  \nabla^2 E_u + k^2E_u = 0  ~~~~~~~~~~~~~~(2.0)

where u = x, y, z and k = 2π/λ, the wavenumber of the medium.

Eigenfunction solutions

In the case of differential equations, as in the case of matrix equations, whenever the right-hand side of an equation is zero (i.e., the forcing function / forcing vector is zero), the equation may still admit a non-trivial solution, known as an eigenfunction solution. In matrix algebra, this occurs when the determinant of the matrix is zero, i.e. when the matrix has no inverse. There is a similarity between eqn. (2.0) above, which may be written

  (\nabla^2 + k^2)~f = 0 ,

and the usual equation for the eigenvalues/eigenvectors of a square matrix, A,

  (\bold A - \lambda \bold I) ~ \bold x = 0  .

Both the scalar Laplacian, :   \nabla^2  and the matrix, A are linear operators. Finite matrices have only a finite number of eigenvalues/eigenvectors, whereas linear operators can have a countably infinite number of eigenvalues/eigenfunctions (in confined regions) or uncountably infinite (continuous) spectra of solutions, as in unbounded regions.

In certain physics applications, it is often the case that the elements of a matrix will be functions of frequency and wavenumber, and the matrix will be non-singular for most combinations of frequency and wavenumber, but will also be singular for certain other combinations. By finding which combinations of frequency and wavenumber drive the determinant of the matrix to zero, the propagation characteristics of the medium may be determined. Relations of this type, between frequency and wavenumber, are known as dispersion relations and some physical systems may admit many different kinds of dispersion relations. An example from electromagnetics is the ordinary waveguide, which may admit numerous dispersion relations, each associated with a unique mode of the waveguide. Each propagation mode of the waveguide is known as an eigenfunction solution (or eigenmode solution) to Maxwell's equations in the waveguide. Free space also admits eigenmode solutions, but with the distinction that for any given frequency, free space admits a continuous modal spectrum, whereas waveguides have a discrete mode spectrum.

Separation of variables

Solutions to the homogeneous wave equation (2.0) may be found in rectangular coordinates by using the principle of separation of variables for partial differential equations. This principle says that in separable orthogonal coordinates, an elementary product solution may be constructed to this wave equation of the following form:

  E_u(x, y, z) = f_x(x) \times f_y(y) \times f_z(z)

i.e., a solution which is expressed as the product of a function of x, times a function of y, times a function of z. If this elementary product solution is plugged into the wave equation, using the scalar Laplacian (aka, Laplace operator) in rectangular coordinates

 \nabla^2 E_u = \frac{\partial^2 E_u}{\partial x^2} + \frac{\partial^2 E_u}{\partial y^2} + \frac{\partial^2 E_u}{\partial z^2}

this is obtained

f''_x(x)f_y(y)f_z(z) + f_x(x)f''_y(y)f_z(z) + f_x(x)f_y(y)f''_z(z) + k^2f_x(x)f_y(y)f_z(z)=0  \,

which may be rearranged into the form:

   \frac{f''_x(x)}{f_x(x)}+ \frac{f''_y(y)}{f_y(y)} + \frac{f''_z(z)}{f_z(z)} + k^2=0

It may now be argued that each of the quotients in the equation above must, of necessity, be constant. For, say the first quotient is not constant, and is a function of x. None of the other terms in the equation has any dependence on the variable x. Therefore, the first term may not have any x-dependence either; it must be constant. The constant is referred to as -kx². Reasoning in a similar way for the y and z quotients, three ordinary differential equations are obtained for the fx, fy and fz, along with one separation condition:

\frac{d^2}{dx^2}f_x(x) + k_x^2 f_x(x)=0
\frac{d^2}{dy^2}f_y(y) + k_y^2 f_y(y)=0
\frac{d^2}{dz^2}f_z(z) + k_z^2 f_z(z)=0
k_x^2+k_y^2+k_z^2= k^2

Each of these 3 differential equations has the same solution, a complex exponential, so that the elementary product solution for Eu is:

E_u(x,y,z)=e^{-j(k_x x + k_y y + k_z z)}
=e^{-j(k_x x + k_y y)} e^{-j k_z z}
=e^{-j(k_x x + k_y y)} e^{\pm j z \sqrt{k^2-k_x^2-k_y^2} }

which represents a propagating or exponentially decaying uniform plane wave solution to the homogeneous wave equation. The - sign is used for a wave propagating/decaying in the +z direction and the + sign is used for a wave propagating/decaying in the -z direction (this follows the engineering time convention, which assumes an ejωt time dependence). This field represents a propagating plane wave when the quantity under the radical is positive, and an exponentially decaying wave when it is negative (in passive media, the root with a non-positive imaginary part must always be chosen, to represent uniform propagation or decay, but not amplification).

The superposition integral

A general solution to the homogeneous electromagnetic wave equation in rectangular coordinates is formed as a weighted superposition of all possible elementary plane wave solutions as:

E_u(x,y,z)=\int\!\!\!\int E_u(k_x,k_y) ~ e^{j(k_x x + k_y y)} ~ e^{\pm j z \sqrt{k^2-k_x^2-k_y^2} } ~ dk_x dk_y ~~~~~~~~~~~~~~~~~~(2.1)

where the integrals extend from minus infinity to infinity.

This plane wave spectrum representation of the electromagnetic field is the basic foundation of Fourier Optics (this point cannot be emphasized strongly enough), because when z=0, the equation above simply becomes a Fourier transform (FT) relationship between the field and its plane wave content (hence the name, "Fourier optics").

All spatial dependence of the individual plane wave components is described explicitly via the exponential functions. The coefficients of the exponentials are only functions of spatial wavenumber kx, ky, just as in ordinary Fourier analysis and Fourier transforms.

Free space as a low-pass filter

When

   k_x^2+k_y^2 > k^2

the plane waves are evanescent (decaying), so that any spatial frequency content in an object plane transparency which is finer that one wavelength will not be transferred over to the image plane, simply because the plane waves corresponding to that content cannot propagate. In connection with lithography of electronic components, this phenomenon is known as the diffraction limit and is the reason why light of progressively higher frequency (smaller wavelength) is required for etching progressively finer features in integrated circuits.

The far field approximation and the concept of angular bandwidth

The equation above may be evaluated asymptotically in the far field (using stationary phase approximation) to show that the field at the point (x,y,z) is indeed due solely to the plane wave component (kx, ky, kz) which propagates parallel to the vector (x,y,z), and whose plane is tangent to the phasefront at (x,y,z). The mathematical details of this process may be found in Scott [1998] or Scott [1990]. The result of performing a stationary phase integration on the expression above is the following expression,

E_u(r,\theta,\phi)~=~2 \pi j~ (k~\cos\theta)~ \frac{e^{-jkr}}{r}~ E_u(k~\sin\theta~\cos\phi,k~\sin\theta~\sin\phi) ~~~~~~~~~~~~(2.2)

which clearly indicates that the field at (x,y,z) is directly proportional to the spectral component in the direction of (x,y,z), where,

 x = r ~ \sin \theta ~ \cos \phi
 y = r ~ \sin \theta ~ \sin \phi
 z = r ~ \cos \theta ~

and

 k_x = k ~ \sin \theta ~ \cos \phi
 k_y = k ~ \sin \theta ~ \sin \phi
 k_z = k ~ \cos \theta ~

Stated another way, the radiation pattern of any planar field distribution is the FT of that source distribution (see Huygens-Fresnel principle, wherein the same equation is developed using a Green's function approach). Note that this is NOT a plane wave, as many might think. The  \frac{e^{-jkr}}{r} radial dependence is a spherical wave - both in magnitude and phase - whose local amplitude is the FT of the source plane distribution at that far field angle. The plane wave spectrum has nothing to do with saying that the field behaves something like a plane wave for far distances.

Equation (2.2) above is critical to making the connection between spatial bandwidth (on the one hand) and angular bandwidth (on the other), in the far field. Note that the term "far field" usually means we're talking about a converging or diverging spherical wave with a pretty well defined phase center. The connection between spatial and angular bandwidth in the far field is essential in understanding the low pass filtering property of thin lenses. See section 5.1.3 for the condition defining the far field region.

Once the concept of angular bandwidth is understood, the optical scientist can "jump back and forth" between the spatial and spectral domains to quickly gain insights which would ordinarily not be so readily available just through spatial domain or ray optics considerations alone. For example, any source bandwidth which lies past the edge angle to the first lens (this edge angle sets the bandwidth of the optical system) will not be captured by the system to be processed.

K-Space

The separation condition,

k_x^2+k_y^2+k_z^2=k^2

which so closely resembles the equation for the Euclidian norm of a vector in terms of its rectangular components, suggests the notion of k-vector, or wave vector, defined (for propagating plane waves) in rectangular coordinates as

\bold k = k_x \hat \bold x + k_y \hat \bold y + k_z \hat \bold z

and in the spherical coordinate system as

 k_x = k ~ \sin \theta ~ \cos \phi
 k_y = k ~ \sin \theta ~ \sin \phi
 k_z = k ~ \cos \theta ~

We'll make use of these spherical coordinate system relations in the next section.

Fourier's theorem: Fourier transform pairs

The two-dimensional Fourier transform pairs

Analysis Equation:

F(k_x,k_y) = \int_{-\infty}^{\infty}  \int_{-\infty}^{\infty} f(x,y) e^{-j(k_x x + k_y y)} dx dy

Synthesis Equation:

 f(x,y) = \frac{1}{4\pi^2}\int_{-\infty}^{\infty}  \int_{-\infty}^{\infty} F(k_x,k_y) e^{j(k_x x + k_y y)} dk_x dk_y

note: the normalizing factor of:\frac{1}{4\pi^2} is present whenever angular frequency (radians) is used, but not when ordinary frequency (cycles) is used.

Optical systems: General overview and analogy with electrical signal processing systems

An optical system consists of an input plane, and output plane, and a set of components that transforms the image f formed at the input into a different image g formed at the output. The output image is related to the input image by convolving the input image with the optical impulse response, h (known as the point-spread function, for focused optical systems). The impulse response uniquely defines the input-output behavior of the optical system. By convention, the optic axis of the system is taken as the z-axis. As a result, the two images and the impulse response are all functions of the transverse coordinates, x and y.

The impulse response of an optical imaging system is the output plane field which is produced when an ideal mathematical point source of light is placed in the input plane (usually on-axis). In practice, it is not necessary to have an ideal point source in order to determine an exact impulse response. This is because any source bandwidth which lies outside the bandwidth of the system won't matter anyway, so therefore it's not necessary in determining the impulse response. The source only needs to have as much (angular) bandwidth as the optical system.

Optical systems typically fall into one of two different categories. The first is the ordinary focused optical imaging system, wherein the input plane is called the object plane and the output plane is called the image plane. The field in the image plane is desired to be a high-quality reproduction of the field in the object plane. In this case, the impulse response of the optical system is desired to approximate a 2D delta function, at the same location (or a scaled location) in the output plane corresponding to the location of the impulse in the input plane. The actual impulse response typically resembles an Airy function, whose radius is on the order of the wavelength of the light used. In this case, the impulse response is typically referred to as a point spread function, since the mathematical point of light in the object plane has been spread out into an Airy function in the image plane.

The second type is the optical image processing system, in which a significant feature in the input plane field is to be located and isolated. In this case, the impulse response of the system is desired to be a close replica (picture) of that feature which is being searched for in the input plane field, so that a convolution of the feature against the input plane field will produce a bright spot at the feature location in the output plane.

Input plane

The input plane is defined as the locus of all points such that z = 0. The input image f is therefore

f(x,y) = U(x,y,z)\big|_{z=0}

Output plane

The out plane is defined as the locus of all points such that z = d. The output image g is therefore

g(x,y) = U(x,y,z)\big|_{z=d}

The 2D convolution of input function against the impulse response function

 g(x,y) ~ = ~ h(x,y) * f(x,y)

i.e.,

 g(x,y)= \int_{-\infty}^{\infty}  \int_{-\infty}^{\infty}  h(x-x', y-y') ~ f(x',y') ~ dx' dy'   ~~~~~~(4.1)  ~

The alert reader will note that the integral above tacitly assumes that the impulse response is NOT a function of the position (x',y') of the impulse of light in the input plane (if this were not the case, this type of convolution would not be possible). This property is known as shift invariance (Scott [1998]). No optical system is perfectly shift invariant: as the ideal, mathematical point of light is scanned away from the optic axis, aberrations will eventually degrade the impulse response (known as a coma in focused imaging systems). However, high quality optical systems are often "shift invariant enough" over certain regions of the input plane that the equation above can be used with impunity for certain calculations.

Also, this equation assumes unit magnification. If magnification is present, then eqn. (4.1) becomes

 g(x,y)= \int_{-\infty}^{\infty}  \int_{-\infty}^{\infty}  h_M(x-Mx', y-My') ~ f(x',y') ~ dx' dy'   ~~~~~~(4.2)  ~

which basically translates the impulse response function, hM(), from x' to x=Mx'. In (4.2), hM() will be a magnified version of the impulse response function h() of a similar, unmagnified system, so that hM(x,y) =h(x/M,y/M).

Derivation of the convolution equation

The extension to two dimensions is trivial, except for the difference that causality exists in the time domain, but not in the spatial domain. Causality means that the impulse response h(t - t') of an electrical system, due to an impulse applied at time t', must of necessity be zero for all times t such that t - t' < 0.

Obtaining the convolution representation of the system esponse requires representing the input signal as a weighted superposition over a train of impulse functions by using the sifting property of Dirac delta functions.

  f(t) = \int_{-\infty}^{\infty}    \delta(t-t') f(t') dt'

It is then presumed that the system under consideration is linear, that is to say that the output of the system due to two different inputs (possibly at two different times) is the sum of the individual outputs of the system to the two inputs, when introduced individually. Thus the optical system may contain no nonlinear materials nor active devices (except possibly, extremely linear active devices). The output of the system is then simplified to a single delta function input, which would be the impulse response of the system, h(t - t'). Thus, the output of the linear system to a general input function f(t) is shown to be:

  Output(t) = \int_{-\infty}^{\infty}    h(t-t') f(t') dt'

where h(t - t') is the (impulse) response of the linear system to the delta function input δ(t - t'), applied at time t'. This is where the convolution equation above comes from. The convolution equation is useful because it is often much easier to find the response of a system to a delta function input - and then perform the convolution above to find the response to an arbitrary input - than it is to try and find the response to the arbitrary input directly. Also, the impulse response (in either time or frequency domains) usually yields insight to relevant figures of merit of the system. In the case of most lenses, the point spread function (PSF) is a pretty common figure of merit for evaluation purposes.

The same logic is used in connection with the Huygens-Fresnel principle, or Stratton-Chu formulation, wherein the "impulse response" is referred to as the Green's function of the system. So the spatial domain operation of a linear optical system is analogous in this way to the Huygens-Fresnel principle.

System transfer function

If the last equation above is Fourier transformed, it becomes:

  Output(\omega) ~ = ~  H(\omega)~ F(\omega)

where

 Output(\omega) ~ is the spectrum of the output signal
 H(\omega) ~ is the system transfer function
 F(\omega) ~ is the spectrum of the input signal

In like fashion, (4.1) may be Fourier transformed to yield:

 G(k_x,k_y) ~ = ~ H(k_x,k_y) ~ F(k_x,k_y)

Once again it may be noted from the discussion on the Abbe sine condition, that this equation assumes unit magnification.

This equation takes on its real meaning when the Fourier transform, : ~G(k_x,k_y) is associated with the coefficient of the plane wave whose transverse wavenumbers are: :~ (k_x,k_y). Thus, the input-plane plane wave spectrum is transformed into the output-plane plane wave spectrum through the multiplicative action of the system transfer function.

Applications of Fourier optics

Fourier optics is used in the field of optical information processing, the staple of which is the classical 4F processor.

The Fourier transform properties of a lens provide numerous applications in optical signal processing such as spatial filtering, optical correlation and computer generated holograms.

Fourier optical theory is used in interferometry, optical tweezers, atom traps, and quantum computing. Concepts of Fourier optics are used to reconstruct the phase of light intensity in the spatial frequency plane (see adaptive-additive algorithm).

Fourier transforming property of lenses

If a transmissive object is placed one focal length in front of a lens, then its Fourier transform will be formed one focal length behind the lens. Consider the figure to the right (click to enlarge)

On the Fourier transforming property of lenses

In this figure, a plane wave incident from the left is assumed. The transmittance function in the front focal plane (i.e., Plane 1) spatially modulates the incident plane wave in magnitude and phase, like on the left-hand side of eqn. (2.1) (specified to z=0), and in so doing, produces a spectrum of plane waves corresponding to the FT of the transmittance function, like on the right-hand side of eqn. (2.1) (for z>0). The various plane wave components propagate at different tilt angles with respect to the optic axis of the lens (i.e., the horizontal axis). The finer the features in the transparency, the broader the angular bandwidth of the plane wave spectrum. We'll consider one such plane wave component, propagating at angle θ with respect to the optic axis. It is assumed that θ is small (paraxial approximation), so that

\frac{k_x}{k} = \sin \theta \cong \theta

and

\frac{k_z}{k} = \cos \theta \cong 1 - \frac{\theta^2}{2}

and

 \frac{1}{\cos \theta} \cong \frac{1}{1 - \frac{\theta^2}{2}} \cong 1 + \frac{\theta^2}{2}

In the figure, the plane wave phase, moving horizontally from the front focal plane to the lens plane, is

 e^{j k f \cos \theta} \,

and the spherical wave phase from the lens to the spot in the back focal plane is:

 e^{j k f / \cos \theta} \,

and the sum of the two path lengths is f (1 + θ2/2 + 1 - θ2/2) = 2f i.e., it is a constant value, independent of tilt angle, θ, for paraxial plane waves. Each paraxial plane wave component of the field in the front focal plane appears as a Point spread function spot in the back focal plane, with an intensity and phase equal to the intensity and phase of the original plane wave component in the front focal plane. In other words, the field in the back focal plane is the Fourier transform of the field in the front focal plane.

All FT components are computed simultaneously - in parallel - at the speed of light. As an example, light travels at a speed of roughly 1 ft (0.30 m). / ns, so if a lens has a 1 ft (0.30 m). focal length, an entire 2D FT can be computed in about 2 ns (2 x 10−9 seconds). If the focal length is 1 in., then the time is under 200 ps. No electronic computer can compete with these kinds of numbers or perhaps ever hope to, although new supercomputers such as the petaflop IBM Roadrunner may actually prove faster than optics, as improbable as that may seem. However, their speed is obtained by combining numerous computers which, individually, are still slower than optics. The disadvantage of the optical FT is that, as the derivation shows, the FT relationship only holds for paraxial plane waves, so this FT "computer" is inherently bandlimited. On the other hand, since the wavelength of visible light is so minute in relation to even the smallest visible feature dimensions in the image i.e.,

 k^2 >>  k_x ^2 + k_y ^2

(for all kx, ky within the spatial bandwidth of the image, so that kz is nearly equal to k), the paraxial approximation is not terribly limiting in practice. And, of course, this is an analog - not a digital - computer, so precision is limited. Also, phase can be challenging to extract; often it is inferred interferometrically.

Optical processing is especially useful in real time applications where rapid processing of massive amounts of 2D data is required, particularly in relation to pattern recognition.

Object truncation and Gibbs phenomenon

Spatially modulated electric field, shown on the left-hand side of eqn. (2.1), typically only occupies a finite (usually rectangular) aperture in the x,y plane. The rectangular aperture function acts like a 2D square-top pulse function, where the field is assumed to be zero outside this 2D rectangle. The spatial domain integrals for calculating the FT coefficients on the right-hand side of eqn. (2.1) are truncated at the boundary of this aperture. This step truncation can introduce inaccuracies in both theoretical calculations and measured values of the plane wave coefficients on the RHS of eqn. (2.1).

Whenever a function is discontinuously truncated in one FT domain, broadening and rippling are introduced in the other FT domain. A perfect example from optics is in connection with the Point spread function, which for on-axis plane wave illumination of a quadratic lens (with circular aperture), is an Airy function, J1(x)/x. Literally, the point source has been "spread out" (with ripples added), to form the Airy point spread function (as the result of truncation of the plane wave spectrum by the finite aperture of the lens). This source of error is known as Gibbs phenomenon and it may be mitigated by simply ensuring that all significant content lies near the center of the transparency, or through the use of window functions which smoothly taper the field to zero at the frame boundaries. By the convolution theorem, the FT of an arbitrary transparency function - multiplied (or truncated) by an aperture function - is equal to the FT of the non-truncated transparency function convolved against the FT of the aperture function, which in this case becomes a type of "Greens function" or "impulse response function" in the spectral domain. Therefore, the image of a circular lens is equal to the object plane function convolved against the Airy function (the FT of a circular aperture function is J1(x)/x and the FT of a rectangular aperture function is a product of sinc functions, sin x/x).

Fourier analysis and functional decomposition

Even though the input transparency only occupies a finite portion of the x-y plane (Plane 1), the uniform plane waves comprising the plane wave spectrum occupy the entire x-y plane, which is why (for this purpose) only the longitudinal plane wave phase (in the z-direction, from Plane 1 to Plane 2) must be considered, and not the phase transverse to the z-direction. It is of course, very tempting to think that if a plane wave emanating from the finite aperture of the transparency is tilted too far from horizontal, it will somehow "miss" the lens altogether but again, since the uniform plane wave extends infinitely far in all directions in the transverse (x-y) plane, the planar wave components cannot miss the lens.

This issue brings up perhaps the predominant difficulty with Fourier analysis, namely that the input plane function, defined over a finite support (i.e., over its own finite aperture), is being approximated with other functions (sinusiods) which have infinite support (i.e., they are defined over the entire infinite x-y plane). This is unbelievably inefficient computationally, and is the principal reason why wavelets were conceived, that is to represent a function (defiined on a finite interval or area) in terms of oscillatory functions which are also defined over finite intervals or areas. Thus, instead of getting the frequency content of the entire image all at once (along with the frequency content of the entire rest of the x-y plane, over which the image has zero value), the result is instead the frequency content of different parts of the image, which is usually much simpler. Unfortunately, wavelets in the x-y plane don't correspond to any known type of propagating wave function, in the same way that Fourier's sinusoids (in the x-y plane) correspond to plane wave functions in three dimensions. However, the FTs of most wavelets are well known and could possibly be shown to be equivalent to some useful type of propagating field.

On the other hand, Sinc functions and Airy functions - which are not only the point spread functions of rectangular and circular apertures, respectively, but are also cardinal functions commonly used for functional decomposition in interpolation/sampling theory [Scott 1990] - do correspond to converging or diverging spherical waves, and therefore could potentially be implemented as a whole new functional decomposition of the object plane function, thereby leading to another point of view similar in nature to Fourier optics. This would basically be the same as conventional ray optics, but with diffraction effects included. In this case, each point spread function would be a type of "smooth pixel," in much the same way that a soliton on a fiber is a "smooth pulse."

Perhaps a lens figure-of-merit in this "point spread function" viewpoint would be to ask how well a lens transforms an Airy function in the object plane into an Airy function in the image plane, as a function of radial distance from the optic axis, or as a function of the size of the object plane Airy function. This is kind of like the Point spread function, except now we're really looking at it as a kind of input-to-output plane transfer function (like MTF), and not so much in absolute terms, relative to a perfect point. Similarly, Gaussian wavelets, which would correspond to the waist of a propagating Gaussian beam, could also potentially be used in still another functional decomposition of the object plane field.

Far-field range and the 2D2 / λ criterion

In the figure above, illustrating the Fourier transforming property of lenses, the lens is in the near field of the object plane transparency, therefore the object plane field may be regarded as being a superposition of plane waves, each one of which propagates to the lens. This is defined via the far-field criterion as: Range = 2 D2 / λ where D is the maximum linear extent of the optical sources and λ is the wavelength (Scott [1998]). D of the transparency is on the order of cm (10−2 m) and the wavelength of light is on the order of 10−6 m, therefore D/λ is on the order of 104. This times D is on the order of 102 m, or hundreds of meters. On the other hand the far field distance from a PSF spot is on the order of λ. This is because D for the spot is on the order of λ, so that D/λ is on the order of unity. One times D (i.e., λ) is on the order of λ (10−6 m).

Since the lens is in the far field of any PSF spot, the field incident on the lens from the spot may be regarded as being an asymptotic spherical wave, as in eqn. (2.2), not as a plane wave spectrum, as in eqn. (2.1). On the other hand, the lens is in the near field of the entire input plane transparency, therefore eqn. (2.1) - the full plane wave spectrum - accurately represents the field incident on the lens from that larger, extended source.

Lens as a low-pass filter

A lens is basically a low-pass plane wave filter (see Low-pass filter). Consider a "small" light source located on-axis in the object plane of the lens. It is assumed that the source is small enough that, by the far-field criterion, the lens is in the far field of the "small" source. Then, the field radiated by the small source is a spherical wave which is modulated by the FT of the source distribution, as in eqn. (2.2), Then, the lens passes - from the object plane over onto the image plane - only that portion of the radiated spherical wave which lies inside the edge angle of the lens. In this far-field case, truncation of the radiated spherical wave is equivalent to truncation of the plane wave spectrum of the small source. So, the plane wave components in this far-field spherical wave, which lie beyond the edge angle of the lens, are not captured by the lens and are not transferred over to the image plane. Note: this logic is valid only for small sources, such that the lens is in the far field region of the source, according to the 2 D2 / λ criterion mentioned previously. If an object plane transparency is imagined as a summation over small sources (as in the Whittaker-Shannon interpolation formula, Scott [1990]), each of which has its spectrum truncated in this fashion, then every point of the entire object plane transparency suffers the same effects of this low pass filtering.

Loss of the high (spatial) frequency content causes blurring and loss of sharpness (see discussion related to Point spread function). Bandwidth truncation causes a (fictitious, mathematical, ideal) point source in the object plane to be blurred (or, spread out) in the image plane, giving rise to the term, "point spread function." Whenever bandwidth is expanded or contracted, image size is typically contracted or expanded accordingly, in such a way that the space-bandwidth product remains constant, by Heisenberg's principle (Scott [1998] and Abbe sine condition).

Coherence and Fourier transforming

While working in the frequency domain, with an assumed ejωt (engineering) time dependence, coherent (laser) light is implicitly assumed, which has a delta function dependence in the frequency domain. Light at different (delta function) frequencies will "spray" tha plane wave spectrum out at different angles, and as a result these plane wave components will be focused at different places in the output plane. The Fourier transforming property of lenses works best with coherent light, unless there is some special reason to combine light of different frequencies, to achieve some special purpose.

4F Correlator

When the plane wave spectrum representation of the electric field (section 2) is combined with the Fourier transforming property of quadratic lenses (section 5.1), it leads naturally to the development of numerous 2D image processing devices (this section). One of the primary applications of Fourier Optics is in the mathematical operations of cross-correlation and convolution, and these have historically been done with a device known as a 4F correlator, shown in the figure below (click to enlarge).

4F Correlator

The 4F correlator is based on the convolution theorem from Fourier transform theory, which states that convolution in the spatial (x,y) domain is equivalent to direct multiplication in the spatial frequency (kx, ky) domain. Once again, a plane wave is assumed incident from the left and a transparency containing one 2D function, f(x,y), is placed in the input plane of the correlator, located one focal length in front of the first lens. The transparency spatially modulates the incident plane wave in magnitude and phase, like on the left-hand side of eqn. (2.1), and in so doing, produces a spectrum of plane waves corresponding to the FT of the transmittance function, like on the right-hand side of eqn. (2.1). That spectrum is then formed as an "image" one focal length behind the first lens, as shown. A transmission mask containing the FT of the second function, g(x,y), is placed in this same plane, one focal length behind the first lens, causing the transmission through the mask to be equal to the product, F(kx,ky) x G(kx,ky). This product now lies in the "input plane" of the second lens (one focal length in front), so that the FT of this product (i.e., the convolution of f(x,y) and g(x,y)), is formed in the back focal plane of the second lens.

If an ideal, mathematical point source of light is placed on-axis in the input plane of the first lens, then there will be a uniform, collimated field produced in the output plane of the first lens. When this uniform, collimated field is multiplied by the FT plane mask, and then Fourier transformed by the second lens, the output plane field (which in this case is the impulse response of the correlator) is just our correlating function, g(x,y). In practical applications, g(x,y) will be some type of feature which must be identified and located within the input plane field (see Scott [1998]). In military applications, this feature may be a tank, ship or airplane which must be quickly identified within some more complex scene.

The 4F correlator is an excellent device for illustrating the "systems" aspects of optical instruments, alluded to in section 4 above. The FT plane mask function, G(kx,ky) is the system transfer function of the correlator, which we'd in general denote as H(kx,ky), and it is the FT of the impulse response function of the correlator, h(x,y) which is just our correlating function g(x,y). And, as mentioned above, the impulse response of the correlator is just a picture of the feature we're trying to find in the input image. In the 4F correlator, the system transfer function H(kx,ky) is directly multiplied against the spectrum F(kx,ky) of the input function, to produce the spectrum of the output function. This is how electrical signal processing systems operate on 1D temporal signals.

Afterword: Plane Wave Spectrum Within the Broader Context of Functional Decomposition

Electrical fields are really just particular types of mathematical functions and, as such, may often be represented in many different ways. In the Huygens-Fresnel or Stratton-Chu viewpoints, the electric field is represented as a superposition of point sources, each one of which gives rise to a Green's function field. The total field is then the weighted sum of all of the individual Greens function fields. That seems to be the most natural way of viewing the electric field for most people - no doubt because most of us have, at one time or another, drawn out the circles with protractor and paper, much the same way Thomas Young did in his class paper on the Double-slit experiment. However, it is by no means the only way to represent the electric field, which may also be represented as a spectrum of sinusoidally varying plane waves. In addition, Frits Zernike proposed still another functional decomposition based on his Zernike polynomials, defined on the unit disc. The third-order (and lower) Zernike polynomials correspond to the normal lens aberrations. And still another functional decomposition could be made in terms of Sinc functions and Airy functions, as in the Whittaker-Shannon interpolation formula and the Nyquist-Shannon sampling theorem. All of these functional decompositions have utility in different circumstances. The optical scientist having access to these various different representational forms has available a richer insight to the nature of these marvelous fields and their properties. Embrace these different ways of looking at the field, rather than viewing them as being in any way conflicting or contradictory.

Functional Decomposition and Eigenfunctions

The twin subjects of eigenfunction expansions and functional decomposition, both briefly alluded to in this Wikipedia article, are not completely independent. The eigenfunction expansions to certain linear operators defined over a given domain, will often yield a countably infinite set of orthogonal functions which will span that domain. Depending on the operator and the dimensionality (and shape, and boundary conditions) of its domain, many different types of functional decompositions are, in principle, possible.

See also

References

  • Goodman, Joseph (2005). Introduction to Fourier Optics (3rd ed, ed.). Roberts & Co Publishers. ISBN 0974707724.  or online here
  • Hecht, Eugene (1987). Optics (2nd ed. ed.). Addison Wesley. ISBN 0-201-11609-X. 
  • Wilson, Raymond (1995). Fourier Series and Optical Transform Techniques in Contemporary Optics. Wiley. ISBN 0471303577. 
  • Scott, Craig (1998). Introduction to Optics and Optical Imaging. Wiley. ISBN 0-7803-3440-X. 
  • Scott, Craig (1990). Modern Methods of Reflector Antenna Analysis and Design. Artech House. ISBN 0-89006-419-9. 
  • Scott, Craig (1989). The Spectral Domain Method in Electromagnetics. Artech House. ISBN 0-89006-349-4. 

-


 
 

 

Copyrights:

Photography Encyclopedia. The Oxford Companion to the Photograph. Copyright © 2005 by Oxford University Press. All rights reserved.  Read more
Wikipedia. This article is licensed under the Creative Commons Attribution/Share-Alike License. It uses material from the Wikipedia article "Fourier optics" Read more