| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Biophys J, May 2001, p. 2455-2470, Vol. 80, No. 5
Institute for Hearing and Communication Research, Karolinska Institute, Karolinska Hospital, SE-171 76 Stockholm, Sweden
| |
ABSTRACT |
|---|
|
|
|---|
Deconvolution algorithms have proven very effective in conventional (wide-field) fluorescence microscopy. Their application to confocal microscopy is hampered, in biological experiments, by the presence of important levels of noise in the images and by the lack of a precise knowledge of the point spread function (PSF) of the system. We investigate the application of wavelet-based processing tools to deal with these problems, in particular wavelet denoising methods, which turn out to be very effective in application to three-dimensional confocal images. When used in combination with more classical deconvolution algorithms, these methods provide a robust and efficient restoration scheme allowing one to deal with difficult imaging conditions. To make our approach applicable in practical situations, we measured the PSF of a Biorad-MRC1024 confocal microscope under a large set of imaging conditions, including in situ acquisitions. As a specific biological application, we present several examples of restorations of three-dimensional confocal images acquired inside an intact preparation of the hearing organ. We also provide a quantitative assessment of the gain in quality achieved by wavelet-aided restorations over classical deconvolution schemes, based on a set of numerical experiments that we performed with test images.
| |
INTRODUCTION |
|---|
|
|
|---|
Thanks to its optical sectioning properties,
laser scanning confocal microscopy is an invaluable tool for the
visualization of complex biological structures in three dimensions (for
a general reference, see Pawley, 1995
). To take the example that will
be considered in this article, the mammalian hearing organ, the organ of Corti, is enclosed inside the cochlea, a tube-like structure divided
into different compartments, which in turn is embedded in the temporal
bone. Confocal microscopy has proven extremely useful to bypass the
difficulty of access inside the temporal bone and to study this system
under conditions close to normal living conditions (Flock et al., 1998
,
1999
; Ulfendahl et al., 2000
). The limited access inside a nearly
intact inner ear, however, constrains us to use suboptimal imaging
conditions. When focusing deep inside the organ of Corti, much of the
light is lost due to scattering by surrounding structures, and various
distortions may occur due to variations of the optical index inside the
specimen. A subsequent processing of the images is required to partly
remedy these problems.
Different image restoration methods have been designed over the years. A popular method is based on the application of deconvolution algorithms, allowing one to compensate for the blurring caused by out-of-focus contributions to the measured intensity at each pixel of the image. Such a blurring effect occurs in any imaging process to some extent, but it is particularly important to consider in biological microscopy, where the interesting details are often of a size comparable to the system's resolution. For this reason, deconvolution has become increasingly popular in fluorescence microscopy, with the advent of laser scanning systems and of personal computers capable of processing huge three-dimensional (3D) images.
Although deconvolution algorithms have proven very effective in
conventional (wide-field) fluorescence microscopy, their application to
biological confocal microscopy is hampered by two main difficulties. One is that images acquired in a confocal setting are quantum limited:
Each pixel receives only a limited and random number of photons per
unit time, with most of the out-of-focus light being rejected by the
small confocal aperture. As a consequence, noise problems are usually
important in confocal microscopy and must be treated with particular
care (Pawley, 1995
). A second, even more important difficulty is due to
the lack of a precise control on the imaging conditions in biological
experiments. When focusing light deep inside a complex specimen, one
can usually hope for only an approximate knowledge of the point spread
function (PSF) of the system, which is the crucial information needed
for deconvolution. These problems, together with a common belief that confocal images, being already deblurred, cannot be further improved, seem to have limited the use of deconvolution in confocal microscopy. This method is more commonly applied in conventional microscopy, where
intensity levels are significantly higher and accurate measurements of
the PSF are easier to obtain and more stable from one sample to another.
This article is concerned with the processing methods that can be used to improve on the limits imposed by the two problems mentioned above. We shall focus on methods based on wavelet analysis, in particular wavelet denoising algorithms, which appear to be most relevant to 3D fluorescence microscopy. We put an emphasis on their use in laser fluorescence confocal microscopy, but we stress that the processing tools that we describe are not restricted to this particular type of microscopy. They are in particular relevant to two-photon and to wide-field imaging systems as well. These methods are of real biological interest, because they allow one to obtain results of a quality that would not be possible without increasing the specimen exposure to radiation.
The paper is organized as follows. In the next section, we explain the
experimental and theoretical basis for deconvolution applied to
confocal imaging systems. We describe different ways to determine the
PSF of a confocal microscope and report on a thorough experimental
investigation that we have performed with a Biorad-MRC1024 confocal
microscope. We then review the Bayesian approach to deconvolution in
the form suitable for confocal microscopy. Such methods are effective
when the PSF of the system is perfectly known but appear to be
seriously limited when this is not the case. In the third section we
describe and discuss the application of wavelet denoising (Donoho and
Johnstone, 1994
), which can be used as an efficient regularization of
deconvolution and lead to significant improvement of the restorations
in practical situations. The important question of a quantitative
assessment of these methods is discussed in the fourth section, where
we report on a set of experiments performed on test images, clearly
demonstrating the improvement achieved by wavelet-aided restorations
over more classic methods. We finally conclude by pointing out some of
the issues remaining to be solved, together with some directions of
research that seem to us promising for the future.
| |
DECONVOLUTION FOR CONFOCAL MICROSCOPY |
|---|
|
|
|---|
Determination of the point spread function of a confocal microscope
Before one can use deconvolution algorithms, it is crucial to get a reasonably precise knowledge of the imaging system's PSF. One must make a distinction between the PSF measured under imaging conditions close to optimal with respect to the system's design, and the PSF measured in situ, i.e., under the imaging conditions of biological experiments. The former PSF is a reproducible characteristic of the microscope, which can be determined in principle from a precise knowledge of the system's optical setup. The latter PSF varies from sample to sample and should be measured on a case-by-case basis. We shall discuss mainly the optimal PSF, as it is much easier to obtain with current systems and it can be considered a reasonable working approximation.
By definition, the PSF of the system is the image that it produces of a
point-like source placed at the origin of sample space. Under an ideal
confocal setting, this image can be written as the product of two
wide-field PSFs, which can in turn be computed for an aberration-free
"aplanatic" lens using diffraction theory (Bertero et al., 1990
,
and references therein). This theoretical model is suitable for
describing the bulk PSF of a real confocal microscope, but it does not
allow one to capture details important for the sake of deconvolution,
such as asymmetries in the tails of the PSF. As these details depend on
many parameters, some of which are not easily accessible
experimentally, it is important to determine the PSF of a real confocal
microscope also by direct measurement.
We used a common experimental approach to measuring PSFs, which consists of imaging fluorescent microspheres of sub-resolution size. With a laser excitation wavelength around 500 nm, beads of 50-nm diameter would provide a safe approximation of point-like sources. However, the images obtained with such small beads have a very poor signal-to-noise ratio (SNR) unless the system is specially optimized. We obtained a better compromise with the Biorad-MRC1024 confocal microscope, by using micro-beads of ~100-nm diameter.
We performed several test experiments to determine the PSF for different sizes of the confocal aperture and under different imaging conditions. Our main results are reproduced in Figs. 1-4. Except for Fig. 4, the beads were immersed in an agarose gel preparation of moderate concentration. Details on our experimental procedure and on the image acquisition settings we used are given at the end of this section. (Further details may be obtained upon request to the authors.)
Fig. 1 shows maximum projections of the PSF and its intensity profile along the optical axis for a theoretical model assuming an ideal confocal setting (with no aberration) and for measurements performed in the 488/585LP excitation/emission filter mode of the system (see the Experimental Methods below for an explanation). Each measurement represents an average over 17-18 different bead images extracted from the raw acquisition, for a given value of the confocal pinhole radius.
|
We define the full-half-maximum-widths (FHMWs) of the PSF in the
standard way, as the intervals
x1/2,
y1/2, and
z1/2, (along the transverse focal
axes Ox, Oy, and the optical axis Oz,
respectively), where the PSF intensity is more than half of its maximum
value. Fig. 2 shows a plot of
x1/2,
y1/2, and
z1/2, measured as a function of the
depth of focus below the surface of the gel, keeping the same
acquisition settings at different depths. No significant variation of
these FHMWs is observed up to ~300 µm below the surface of the gel,
and the shape of the PSF was not varying significantly either. At a
depth of ~300 µm,
z1/2 displays
a significantly smaller value, but this is an artifact caused by a
movement of the gel during the acquisition. Due to this movement, the
PSF appeared to be abnormally tilted along the Ox direction,
which explains why
x1/2 appears
larger, whereas
y1/2 is not
significantly changed.
|
The most important properties of an imaging system as regards to
deconvolution are linearity and translation invariance. Linearity (the
requirement that a superposition of fluorophore distributions f1(x) + f2(x) should produce a
superposition of images
I1(x) + I2(x),
Ii(x) being the image of
fi(x), i = 1,2) is not easy to check experimentally because there is no obvious
way to superpose two different samples without disturbing them.
Although there are known effects limiting the validity of linearity in
fluorescence confocal microscopy (mostly absorption in thick specimens,
together with saturation and photodegradation of the fluorophore
(Pawley, 1995
)), it is generally assumed that this property holds to a good accuracy under normal working conditions. Most laser scanning systems used in biological applications operate in a partial or full
beam-scanning mode. In the Biorad system the specimen is physically
translated along the direction of the optical axis, while the focus
point is scanned throughout the focal plane with an arrangement of
movable mirrors that slightly deflect the laser beam from the optical
axis. Small spatial variations of the PSF are inevitably introduced in
this way. However, in most of our experiments we could assume
translation invariance to be verified within measurement errors.
Significant variations of the PSF occurred only at very high beam
deflections, which are rarely explored in practice. Together with the
results of Fig. 2, this observation implies that aberrations induced by
the objective were very small in our experiments, which is not
surprising in view of the high quality of the lens used.
Fig. 3 shows the behavior of the FHMWs of
the PSF as the pinhole radius is increased. From this figure and Fig. 1
it is clear that the experimental PSFs are neither symmetric with
respect to the focal plane, nor cylindrically symmetric around the
optical axis, as
x1/2 appears
systematically larger than
y1/2. We
observed this slight, but clear astigmatism in all our experiments,
regardless of excitation wavelength and emission filter used in the
acquisition. This effect is most certainly system dependent, and
probably reflects non-exact confocality of the excitation spot with
respect to the pinhole aperture, due to a slight misalignment of the
scanning mirrors of the system. Another possibility would be the
presence of slight aberrations or a misalignment of the objective lens. This possibility has to be ruled out, however, because the objective lens was removed and replaced several times during our experiments, and
once in place, it could be freely rotated around the optical axis. It
is thus clear that the position of the lens was not the same in all our
acquisitions; nevertheless, we always observed the same astigmatism.
|
Finally, Fig. 4 reproduces a limited set
of acquisitions obtained by imaging beads attached just above the
Reissner's membrane, inside an in vitro preparation of the hearing
organ (Ulfendahl et al., 1989
). Each PSF in this figure is from a
different bead, and the acquisition was performed with a large pinhole
aperture (6 mm) to produce an image with a reasonable SNR. One sees in this example that the shape of the PSF is significantly modified by the
presence of complex surrounding structures. In particular, the PSF
appears tilted along the Ox axis. This was not an artifact caused by a movement of the beads, as was checked by repeating the
acquisition several times at the same location. The measured FHMWs of
this PSF also appear significantly increased compared with the
corresponding optimal PSF. Some of the main features of Figs. 1 and 3
remain; in particular, the same astigmatism effect was observed, with
x1/2 being larger than
y1/2 with the same order of
discrepancy. Note that the translation invariance of the PSF is still
respected to a fairly good approximation in this example.
|
| |
EXPERIMENTAL METHODS |
|---|
|
|
|---|
Image acquisition and PSF measurements
The Biorad-MRC1024 confocal microscope utilizes a 15-mW krypton/argon laser providing excitation lines at 488 nm and 568 nm, which are transmitted selectively to the specimen through narrow-band (~10-nm) excitation filters. (The system also allows one to exploit the 647-nm line of the laser, but we did not use this line.) For collecting the fluorescence light, we used different spectral emission filters provided by Biorad: a long-pass filter transmitting all wavelengths above 585 nm (referred to as the 585LP filter), a band-pass filter transmitting within the interval 605 nm ± 16 nm (referred to as the 605DF32 filter), and occasionally a band-pass filter adjusted to the interval 522 nm ± 16 nm (referred to as the 522DF32 filter). The confocal aperture of the system consists of a circular pinhole of variable radius (ranging between 0.7 and 8 mm), placed in front of a detection photomultiplier (PMT). The size and shape of the PSF of a confocal microscope is primarily determined, for a given objective lens, by the spectrums of excitation and emission wavelengths and by the pinhole radius. Parameters such as the excitation laser intensity and the gain of the PMTs are important to optimize the SNR and dynamic range of the images, but under normal working conditions they do not affect the shape of the PSF significantly.
In experiments, the value of the pinhole radius is usually set by hand (together with the other parameters) to optimize the contrast of the images visually. The PSF of the system must therefore be measured for each excitation/emission filter mode and each value of the pinhole radius used in practice. To keep this feasible we restricted our acquisitions to a representative set of values ranging from 2 to 6 mm by increments of 0.5 mm. When investigating the effect of some other parameter than the pinhole radius, we used a medium radius of 4 mm, as representative of the most common imaging situation for our studies.
Our PSF measurements were performed with Molecular Probes (Eugene, OR)
T-8873 TransFluoSpheres, which are microspheres of diameter 0.1 µm,
coated with a dye whose excitation peak matches the 488-nm spectral
line of the argon laser and whose fluorescence emission is maximal
around 605 nm. The beads were imaged with a Zeiss Achroplan
40×NA0.75 water-immersion lens. A common experimental procedure for
PSF acquisition consists of imaging the beads on a glass slide. Because
this method introduces a significant background in the measurements due
to reflections by the glass slide, we used another approach where the
beads are immersed in an agarose gel of moderate concentration. This
allows one to avoid much of the background light and usually leads to
better measurements, without greatly complicating the procedure of
extraction of the PSF from the images. A slight mismatch exists in this
setting between the optical index of the gel and the design index of
the objective lens, which might cause aberration effects (Gibson and Lanni, 1991
; Hell et al., 1993
). These effects should, however, not be
significant with a water-immersion lens and if the gel is of reasonably
low concentration. For the type of dyes and samples studied in our
experiments, we found that the PSF did not depend much on the
excitation wavelength or the type of emission filter used. The main
parameter determining the size and shape of the PSF was the value of
the pinhole radius.
PSF extraction and averaging
Due to the quantum-limited nature of the light collected from a single bead, it is necessary to average the PSF over several different beads to get an accurate measurement. The averaging procedure we used involved four steps: 1) extracting a stack of single-bead images from the raw acquisition, 2) selecting out the bead images representing outliers, which should not be taken into account in the averaging (typical outliers are aggregates of several beads glued together, or couples of different, but close beads that were not separated by the extraction procedure), 3) aligning the selected beads together, and 4) computing the averaged PSF. All these steps may be performed in a large part automatically by using standard image-processing methods. The selection phase is the more involved, however, and it had to be carried out partly by visual inspection. Some care is needed in the alignment procedure, because the center of a given bead has typically non-integer pixel coordinates. (This problem is usually significant only along the optical axis where the resolution is coarsest.) Standard cubic-spline interpolation was used to match the centers of the different beads at half of the pixel resolution, the center of an interpolated bead being determined as the pixel of maximum spatial correlation with some fixed reference PSF. Step 4 is not entirely trivial, because after being selected and aligned properly, the different bead images still have different SNRs and should be weighted accordingly when taking the average. We used the technique described in the Appendix to estimate the SNR for each bead image. Essentially the same results were obtained by simply weighting the bead images by their root-mean-square values, in good agreement with the Poisson nature of the images' statistics. Depending on the beads' density inside the gel, the number of bead images contained in a typical acquisition (of size ~60 µm × 60 µm × 12 µm) ranged between a few dozens to several hundreds. Due to the selection step, however, the number of beads contributing to the final averaged PSF ranged typically between 10 and 30.
Imaging settings in Figs. 1-4
The results shown in this section correspond to 8-bit images acquired in the normal scanning mode of the Biorad system (corresponding to a pixel dwell time a little less than 4 µs), using 10% of the total excitation light intensity of the 15-mW krypton/argon laser, and a sixfold Kalman averaging. Except for Fig. 4, all our acquisitions had format 512 × 512 × 31 with a pixel size of 0.1247 µm × 0.1247 µm × 0.4 µm, corresponding to a 63.8-µm × 63.8-µm × 12.4-µm region scanned inside the agarose gel. The pixel size of the images shown in Fig. 1 has been divided by two, however, due to the interpolation involved in averaging the PSF. Fig. 4 was extracted from a 512 × 512 × 28 image of pixel size 0.075 µm × 0.075 µm × 0.6 µm, corresponding to a 38.4-µm × 38.4-µm × 16.8-µm region scanned inside the hearing organ preparation. The measurements of Fig. 1 correspond to identical imaging settings, except for the size of the pinhole radius, which are 2.5 mm, 4 mm, and 6 mm for the small, the medium, and the large aperture, respectively. We used the 488/585LP filter mode, with a PMT gain of 1100 V (this gain is the voltage applied across the dynode chain of the PMT, which can be set, on the Biorad system, between 0 and 1500 V). For the PSF measurements of Fig. 2, we used the 488/605DF32 filter mode, together with a pinhole opening of 4 mm, and a PMT gain of 1050 V. For the measurements shown in Fig. 3 we used the same settings as for Fig. 2, except for the pinhole radius varying from 2 to 6 mm by steps of 0.5 mm. Finally, the images of Fig. 4 were taken in the 488/585LP mode with a large pinhole aperture (7 mm) and a high PMT gain of 1450 V.
Bayesian approach to deconvolution
Commonly used deconvolution algorithms in 3D microscopy include
iterative methods, such as the Jansson-van Cittert algorithm (Jansson
et al., 1970
); algorithms based on Fourier filtering, such as the
Wiener filter method and classical least-squares regularization methods
based on the minimization of a Tikhonov functional (Tikhonov and
Arsenin, 1977
; Miller, 1970
); and algorithms based on Bayesian inference, the most popular being the Richardson-Lucy algorithm (Richardson, 1972
; Lucy, 1974
).
Bayesian deconvolution
Both from a theoretical and practical point of view, the
Bayesian approach appears the most appropriate for confocal microscopy; it can be based on a physically well founded model of the imaging process and leads to algorithms that are both efficient and easy to
implement. We shall briefly review the main ideas underlying this
method. First of all, the image restoration problem considered here can
be cast as a convolution I(x) = f
* p(x), where f(x) represents the local concentration of fluorophore inside the specimen, p(x) is the PSF of the system, and
I(x) is the light intensity collected in the
image. (In the following, we shall not make an explicit distinction
between a volume element (x,
x) of the sample and the corresponding volume element in image space. This amounts to
adopting for image space a coordinate system that is conjugate to the
one used for sample space, as is usual in microscopy.) This form of the
imaging transformation assumes that the system is linear and
translation invariant. The PSF is band-limited, so that inversion of
the imaging transformation is an ill-posed problem: direct Fourier
inversion leads to huge amplification of the low-level, high-frequency
components induced by noise in the image. As already pointed out, noise
is inevitable in confocal microscopy, and some regularization is needed
to avoid deconvolving the noise rather than the signal.
Bayesian restoration makes use of a stochastic model of the imaging
process, which specifies the probability
P(I|f) to obtain an image
I(x) given that the sample fluorophore
distribution is f(x). The conditional probability
P(f|I) of f given
I, also called the posterior probability distribution, is
then defined with Bayes' formula
P(f|I) = P(I|f)P(f)/P(I).
In this formula P(I) serves as a normalization
factor (the image I is fixed), and
P(f) is the unconditional, or prior, probability
distribution of the object f. A certain freedom is usually
left on the choice of P(f), but it is an
important ingredient of the approach, as it introduces a stochastic
constraint on the reconstructed object, which will provide
regularization. Once P(I|f) and
P(f) are specified, one is interested in deducing
some useful information about the object f from the
corresponding posterior probability. The maximum a posteriori (MAP)
estimate fMAP(x) is
obtained by maximization of P(f|I)
over all possible values of f, for a given image
I. Another natural estimate (more in the spirit of the
statistical physics of image restoration (Geman and Geman, 1984
; Pryce
and Bruce, 1995
)) is the posterior mean
E(f(x)|I), which is the
point-wise average of f(x) over the posterior
distribution P(f|I) (formally defined as the integral
E(f(x)|I) =
f(x)P(f|I)Df
as a function of x). For the purposes of image restoration,
the posterior mean has been recognized to be more natural than the MAP
estimate, and it often leads to significantly better results (Marroquin et al., 1987
; Pryce and Bruce, 1995
; Nishimori and Wong, 1999
). We
shall, however, restrict ourselves to the MAP estimate in the following. The main reason for this is that
fMAP(x) can be computed efficiently by using iterative optimization algorithms, whereas computing E(f(x)|I) involves
sampling from the posterior distribution and leads to computationally
much more demanding algorithms. Moreover, for the stochastic models
considered in this paper, it may be argued that the typical differences
between the MAP estimate and the posterior mean are not significant for
all practical purposes.
The Richardson-Lucy algorithm
In a laser fluorescence system, the light emission is well
approximated by a Poisson point process, where a given volume element
x of the specimen, centered at x, emits light
with an intensity proportional to
f(x)
x. The image is then also a
Poisson point process, the photon count I(x)
inside (x,
x) being on average proportional to
f * p(x)
x. In
other words, the likelihood in this model takes the form:
|
(1) |
|
(2) |

x),
and we assume that normalized units are taken where
x = 1 and
x
p(x) = 1. The expression f * p(x) can then be identified with the discrete convolution f * p(x) =
y
f(y)p(x
y),
taken as a valid approximation when the pixels are small enough. In
addition to its nice physical interpretation, the RL algorithm has a
number of robustness properties explaining its popularity in many
applications other than confocal microscopy. (It is, for example,
widely used in astronomy.) It is, however, a nonregularized algorithm,
so that taking the limit k
in Eq. 2 would not lead to
any meaningful result. It is desirable in practice to work with a
deconvolution algorithm that actually converges to something
meaningful. This can be done by choosing the prior distribution
P(f) so as to impose a smoothing constraint on
the restored image, as is described in the next paragraph.
Introducing a temperature
In the present case it turns out to be useful to impose a
maximum-entropy constraint on f. This amounts to taking
P(f) in the form P(f) = Const exp(TS(f)), where the entropy
functional S(f) is defined by
S(f) =

x f(x)(ln
f(x)
1), and T is a positive parameter interpreted as an effective temperature of the image. Besides
being empirically relevant (Gull and Daniell, 1978
), this choice of
P(f) is a natural one when the images are
constrained to be positive (Csiszàr, 1991
). Maximization of the
corresponding posterior distribution
P(f|I) can be performed iteratively
with a straightforward modification of the RL algorithm:
|
(3) |
for the normalization chosen here.) It is
intuitive that a given reasonable measure of the quality of the
restoration will be optimized for a certain temperature
Topt depending on the problem at hand.
Depending on the quality of the data (SNR of the image, accuracy of the
PSF used, fineness of the pixel sampling, etc.), more or less
regularization is needed. The better the quality, the lower the optimal
temperature should be. It is clearly important to dispose of
statistical tests of the restoration to perform this choice in a
systematic way. We shall return to this point in the fourth section.
| |
WAVELET DENOISING |
|---|
|
|
|---|
When we apply an iterative deconvolution algorithm to real 3D
images, artifacts eventually develop for several reasons: at some point
we begin to deconvolve the noise; we begin to see that we are using a
slightly wrong PSF; distortions associated with sampling problems
appear, etc. However these artifacts appear in different ways at
different scales within the image. In particular, the fine details of
an image are more sensitive to noise, mismatches in the PSF,
undersampling, etc., than the larger features. It should therefore be
advantageous to treat the different scales separately in the
restoration process. Wavelet analysis allows one in some sense to do
this, by providing efficient tools for analyzing and processing a
signal in a position- and scale-dependent way (for a recent general
reference, see Mallat, 1999
). Effective algorithms based on the
discrete wavelet transform (DWT) have been designed in recent years for
estimating a signal from blurred and noisy data (Donoho and Johnstone,
1994
; Donoho, 1995
; Starck and Bijaoui, 1994
). Although the field of
application of wavelet-based processing methods is enlarging rapidly,
to date their use in image restoration has mostly concerned 1D and 2D
situations, e.g., in the analysis of spectra (Fligge and Solanski,
1997
), in the restoration of astronomical images (Starck and Bijaoui,
1994
), or in application to medical images (Strömberg, 1997
). It
is quite clear, however, that these methods are of great potential also
to 3D microscopy and more generally as basic tools in the analysis of
complex biological structures containing details at many different
scales. Here we investigate the use of wavelet denoising algorithms in
application to deconvolution for confocal microscopy.
The discrete wavelet transform and wavelet denoising
A basic scheme for multiscale analysis consists in repeated
applications of two conjugate scaling transformations
I
L.I and I
H.I, where
L is a low-pass filter producing a coarse-grained version of
an image I(x), and H is high-pass and
is formally defined so that H.I(x) = I(x)
L.I(x).
Repeated applications of L and H allow one to
work at coarser and coarser resolutions, while keeping track of the
information when passing from one scale of resolution to the coarser
scales. The DWT allows one to implement the above general scheme in a
nonredundant way, meaning that the information extracted at different
scales and different positions represent independent components of the
image (in the same sense that, in a Fourier transform, independent
components are extracted from a signal at different frequencies).
Wavelet denoising (Donoho and Johnstone, 1994
) exploits the properties
of the DWT to recover a signal that has been degraded by the presence
of noise. This method consists of applying a DWT to an image,
performing some thresholding in the wavelet domain to retain only
significant wavelet coefficients, and applying the inverse DWT, thereby
producing a denoised image. Noise is present at all scales of the DWT
decomposition with essentially the same magnitude in absolute value. In
relative value, however, noise is mostly concentrated in the finest
scales where it can be easily thresholded out. (More precisely, when
the resolution scale is increased by a factor of l, the SNR
is increased by a factor of ld/2 for a
d-dimensional signal.) Significant data, including sharp structures in the image, typically involve sets of high-level wavelet
coefficients that are mostly unaffected by the thresholding. In effect,
the method produces an adaptive smoothing of the image in which noise
is strongly suppressed while sharp features are preserved. Moreover,
this is done in a computationally very efficient way.
An important body of results initiated by Donoho and Johnstone (1994)
and described in Mallat (1999)
shows that in many cases the
thresholding procedure described above greatly outperforms more
classical denoising methods based on linear filtering. This will be so
particularly if one can find a wavelet decomposition that provides a
well-adapted, or sparse, representation of the class of signals under
study, meaning that a given signal can be approximated accurately with
a small number of the building blocks entering in the decomposition. It
is still unknown how to design representations that are optimally
adapted to the complex geometry of typical images in two and higher
dimensions. We may thus expect that the performance of wavelet
denoising on such images is not yet the best of what is theoretically
possible. However, in practice this method often provides the state of
the art, both in effectiveness and in computational efficacy. In our experiments we used the SURE wavelet shrinkage procedure of Donoho et
al. (1995)
, which we describe in the Appendix. (All the codes used in
this article have been written in the Matlab language (The MathWorks,
Natick, MA) and are available from the first author.) Fig. 10 in the
next section shows the gain in SNR achieved by this method in a set of
numerical experiments performed on the test image of Fig. 9.
Application of wavelet denoising to deconvolution
The image restoration scheme that we investigate in this article
is based on applying wavelet denoising to the images before a more
classical deconvolution algorithm. This approach has several motivations. General multiscale approaches for solving inverse linear
problems have been proposed (Donoho, 1995
; see also Nowak and Kolaczyk,
2000
, which considers inverse problems in the Poisson statistics case).
The application of these methods to deconvolution is best understood
for 1D signals, wavelets being particularly effective in that case
(Mallat, 1999
). Significant progress has been made recently on the case
of 2D images (Candès and Donoho, 2000
); however, this case is
still in development, and the situation in 3D and higher dimensions
remains to our knowledge largely unexplored. More importantly for our
purposes, to date no systematic theory has been proposed to deal with
inverse problems for which the inverse kernel is known only
approximately, the situation routinely encountered in 3D microscopy.
This theoretical difficulty has an important practical consequence:
working with an approximate PSF makes it unavoidable to use iterative
deconvolution algorithms. For this reason, it is for the moment
difficult to propose a wavelet deconvolution scheme that is
computationally acceptable for processing huge 3D images. Wavelet
denoising, on the other hand, is better understood theoretically and
leads to more practical algorithms that achieve very reliable and
robust improvements. In view of this state of affairs, it seems to us
reasonable to separate the denoising part from the deconvolution part
of the restoration. This approach, although not definitive, has the
advantage of being immediately applicable and of providing significant
results. In effect, wavelet denoising acts as a very efficient
regularization scheme for deconvolution. This method does not allow one
to achieve a higher resolution in the sense that the frequency
bandwidth of the final restored image is not increased. (Achieving this would require using a finer pixel sampling and a more accurate PSF.)
Rather, it suppresses many of the artifacts that would be present at a
given step of deconvolution without denoising.
Figs.
5-8
show examples of applications of the above restoration scheme to
confocal images acquired inside an in vitro preparation of the mouse
inner ear (Le Calvez and Ulfendahl, 2000
). These images were acquired
with the Biorad-MRC1024 system, using the same objective lens (Zeiss
40×NA0.75 water immersion) as in the PSF experiments described in this
paper. Their pixel format, acquisition settings, and biological content
are indicated in the legends. To each image, the WD procedure was
applied, followed by a MAP deconvolution using the algorithm defined by
Eq. 3. The PSFs used for the different cases were determined from a
library of pre-measured PSFs obtained from bead acquisitions as
described above. (In each case, we used a PSF acquired for the same
excitation/emission filter mode and the same pinhole radius as for the
image.) Note that some of the images (e.g., the image of Fig. 5) are
presumably slightly undersampled along the optical axis. The possible
aliasing effects due to undersampling are usually avoided by filtering the analog image above the pixel sampling frequency during the digitalization (Mallat, 1999
). This is done at the level of the detector and amounts to imposing that the array of sensors defining the
physical pixels does not leave holes in space. Performing deconvolutions of slightly undersampled, but properly filtered images
is perfectly sensible; however, this comes at the price of a loss of
resolution in the restoration. Despite this, and the fact that our
measured PSFs can at best be considered as estimates of the in situ
PSFs, our restorations lead to a significant increase of detail. The
wavelet denoising regularization proves very effective in each case, as
the corresponding deconvolutions performed with maximum-entropy
regularization alone (not shown) lead to visually much poorer
restorations. This will be confirmed on a more quantitative basis in
the next section.
|
|
|
|
Quantitative assessment of image restoration:experiments
An important problem in image restoration is to assess quantitatively the degree of restoration achieved with a deconvolution algorithm. One would like to be able to measure the gain in resolution and reliability in the restored image, as compared with the original acquisition, and to decide what is the best restoration feasible with a given algorithm. The main obstacle to doing this is that the degraded (blurred and noisy) image is the only information available, together with an approximate knowledge of the imaging system's PSF. There is no obvious way of measuring the quality of a proposed restoration unless we have already a strong a priori knowledge of the solution, which we rarely have. To date the only way around this problem has been to perform experiments on images blurred with a known realistic model of the imaging system. The study of such artificial cases often leads to insights that turn out to be very useful in practice to decide what are the optimal parameters (optimal temperature, number of iterations, etc.) under given circumstances.
We investigated these questions using a set of test images such as the one shown in Fig. 9. Fig. 9 A represents sections through a 3D sample of pixel format X × Y × Z = 64 × 64 × 16, taken as the undegraded image f(x). The imaging process of a confocal microscope is simulated by writing an image in the form In(x) = I0(x) + n(x), where I0(x) = f * p(x) is obtained by blurring the test sample with a known PSF p(x) (see Fig. 11) and n(x) is a noisy component simulating Poisson statistics. In other words, for each pixel x, In(x) is a Poisson random variable of mean value I0(x). Fig. 9, B and C, shows sections of I0(x) and In(x), respectively. Fig. 9 D shows the result of applying wavelet denoising to In(x) and should be compared with Fig. 9 B.
|
Effectiveness of wavelet denoising
Fig. 10 shows the results of a set
of denoising experiments performed on the degraded test image (cf. Fig.
9 C) at various levels of noise. The wavelet denoising
procedure we have applied is described in the Appendix. For each level
of noise, the gain in SNR of the denoising was measured as the ratio
(S/N)WD/(S/N), where S/N and
(S/N)WD denote the SNRs of
the image before and after denoising, respectively. Fig. 10
A shows a line-plot of the gain against
S/N for a representative set of
S/N values. It is clear from Fig. 10 A
that the effectiveness of wavelet denoising decreases as
S/N increases, in the sense that the gain
decreases (from a value of ~9.6 at S/N = 1 (0 dB) to a value of ~2.9 at S/N
29.5 (30 dB)). In other words, lowering the level of noise makes it more
difficult to distinguish from the signal, which is not surprising. The
behavior of the quality factor Q2 = 1
(S/N)/(S/N)WD
is also of interest. (Up to a normalization factor, the quantity
(1
Q2)2
corresponds to the mean-square risk used in the statistical
literature.) As shown in Fig. 10 B,
Q2 has a near-linear behavior at small
S/N. Linear extrapolation leads to
Q2
0.9 at
S/N = 0, which corresponds to a maximum gain
of ~10. We point out, however, that this extrapolation breaks down at
very small S/N and must be considered valid in the present case only for S/N larger than
~0.5 (see the remarks at the end of the Appendix).
|
Experiments with the MAP estimate
Figs. 12 and
13 and Table
1 reproduce the results of experiments
combining wavelet denoising (WD) and Bayesian deconvolution. Figs. 12
and 13 correspond to two different sets of restorations, applied again
to the test image of Fig. 9, at different levels of noise, namely, for
S/N = 4.3, 6.2, 10.7, and 13.8. For the first set (Fig. 12), the correct PSF p(x) was
used, whereas for the second set (Fig. 13) we used a slightly different
function papp(x) (see Fig.
11). For each value of
S/N we applied the MAP deconvolution algorithm
defined by Eq. 3 to A) the nonprocessed image
In(x) (see Fig. 9
C) and B) the image obtained by applying WD to
In(x) (see Fig. 9
D). We shall refer to the restored images obtained in cases
A and B as the MAP estimate and the WD+MAP
estimate, respectively. In all cases, the MAP or WD+MAP estimate was
obtained for a representative set of temperatures around the optimum
Topt. Convergence of the deconvolution
algorithm was checked by measuring the ratio
k
x(fk+1(x)
fk(x))2/
x
fk(x)2)1/2
at each iteration k. The condition
k < 10
4 was used as a
stopping criterion. We assessed the performance of the restoration by
measuring the following root-mean-square quality ratio as a function of
T:
|
(4) |
|
|
|
|
|
Wavelet denoising versus maximum-entropy
The nonregularized Richardson-Lucy algorithm (obtained by setting T = 0 in Eq. 3) does not converge to a stable result, but it does lead to a stable improvement during the first few iterations. In particular, after a certain number of iterations, the quality ratio Q2 defined by Eq. 4 (replacing fT(x) by the RL estimate) will achieve a maximum value, so that it makes sense to speak of the best RL iteration in the mean-square sense. This is illustrated in Fig. 15, which shows the evolution of Q2 as a function of the number of iterations for RL iterations (A) and the estimate obtained by applying WD before RL iterations (B; which we refer to as the WD+RL estimate). We see that the WD+RL estimate allows one to achieve a significantly higher Q2 than RL iterations alone (the difference being quite dramatic at low SNR), and it leads also to a more stable result, in the sense that Q2 has a much less pronounced maximum after WD.
|
It is of interest to compare the mean-square performance of the four
estimates defined above (namely, the MAP, WD+MAP, RL, and WD+RL
estimates). This comparison is made in Table 1, where the quality
ratios achieved by the different estimates in the various cases are
regrouped. We see that when applied without denoising, the
maximum-entropy constraint enhances the quality of the restoration.
More precisely, the quality ratio of the best RL iteration is
significantly outperformed by that of the optimal MAP estimate (at
least when one is not using a too wrong PSF at a too high SNR). In
contrast, the WD+RL estimate appears to achieve a slightly higher
quality ratio than does the WD+MAP estimate. The above comparison
indicates that what limits the quality of the restoration after WD is
not the residual level of noise remaining in the image (note that this
noise is no longer Poissonian) but rather other factors to which the
maximum-entropy constraint is much less relevant. The level of accuracy
of the PSF measurement is probably the most limiting factor, but other
factors such as the fineness of pixel sampling and boundary problems
may also significantly affect the optimal quality of the restoration.
Note that in most of the cases considered, the gain achieved by the WD+RL estimate over the MAP estimate is significant. This clearly supports the claim that WD provides a more effective (and more adaptive
(Donoho et al., 1995
)) regularization than does the maximum-entropy constraint. Note finally that, even though the WD+RL estimate seems to
do a better job than the WD+MAP estimate, the differences are not
dramatic, and in fact the two methods lead to nearly equivalent results
visually. We find the WD+MAP method more convenient, because it appears
to us to be easier in practice to optimize a Bayesian parameter, such
as the temperature, than to determine an optimal iteration number.
Choice of the restoration temperature
Some statistical criterion should ideally be used to check the
reliability of the restored estimate and to optimize the different restoration parameters in a systematic way. There is a natural
2 function for our problem, defined to be the
sum of squares of the residuals (Hunt, 1973
):
|
(5) |
2 stands for the overall noise
variance, defined by
2 = 1/Npix
xVar(In(x)),
which can be accurately estimated from the data (as described in the
Appendix). An ideal, error-free estimate for f would
correspond to a value of
2 equal to the total
number Npix of pixels in the image, up
to a fluctuation of order
O(
2
Npix as a criterion to choose the
value of T. This choice is somewhat naive, as it does not
take into account the fact that fT(x) depends upon
In(x) in a deterministic
way. Although this criterion leads to a significant oversmoothing in
the context of least-squares regularization (Galatsanos and
Katsaggelos, 1992
2 as a
function of T for our different sets of restorations. Some remarks are in order. First, the maximum-entropy constraint introduces a slight bias on the MAP estimate
fT(x), which must be
compensated for when measuring
2: with our
normalization, the average intensity
fT(x)
= N
x fT(x) is slightly compressed
toward the value 1. Ideally, the deconvolved estimate should have the
same average intensity as the original image. It is not difficult to
check that this is indeed the case at T = 0. In Figs.
11 C and D, and 12, C and
D, we therefore computed
2 not for
fT(x) but for the adjusted
estimate fT(x)
fT(x)
+
In(x)
. Second,
although the determination of Topt
appears to be pretty accurate for the bare MAP estimate, it is less
accurate after WD has been applied. The temperature that is selected
for the WD+MAP estimate leads to an oversmoothed estimate. However, in
practice the above criterion has the advantage of being simple to use,
and it provides a useful check of the solution.
| |
CONCLUDING REMARKS |
|---|
|
|
|---|
It is clear from the previous sections that the two problems discussed in the Introduction concerning the application of deconvolution to confocal microscopy (inevitable presence of noise in the image and limited knowledge of the PSF) do not prevent one from achieving significant restorations. These two problems can be reduced by combining appropriate processing methods. Bayesian inference offers a powerful approach to deconvolution based on a physically sound model of the microscope's imaging process. In this context, the use of a maximum-entropy constraint provides an effective regularization. Maximum-entropy has its limits, however, both theoretical and