
Chau Chemometrics From Basics to Wavelet Transform
.pdf
resolution enhancement |
209 |
Figure 5.41. The first- (left column) and second- (right column) order derivatives of the simulated noisy signal calculated by CWT with the Haar wavelet and the scale parameter a = 1 (a), 20 (b), and 50 (c).
Figure 5.42. Relationship between the relative increase of the peak width in the calculated derivative of a Gaussian peak and the value of the parameter a.

210 |
application of wavelet transform in chemistry |
Figure 5.43. An experimental photoacoustic spectrum (a) and its first- (b) and second-
(c) order derivatives calculated by CWT with a = 100 [From Fresenius J. Anal. Chem. 367:525--529 (2000)].
5.4.3.Comparison between Wavelet Transform and other
Numerical Differentiation Methods
Here, we compared the derivatives obtained using different methods, including numerical differentiation, the Fourier transform method, the Savitsky--Golay method, the DWT method, the DWT method with the improved WT algorithm, and the CWT method with a = 50. The firstorder derivatives of the above two simulated signals by these methods are shown in Figure 5.44, respectively. From the figure it can be seen that, for the signal without noise, all the results are almost the same. But for the noisy signal, only the last two methods can obtain smooth derivatives.
Table 5.4 lists the parameters describing these results in the left column of Figure 5.44. The definition of the peak width and peak position is depicted in Figure 5.44, curve (f). From the parameters in the table, we can find that all the methods give the same peak position, but the peak width is slightly different. Because it is difficult to determine the exact beginning and ending points of the peaks, there is a large error in the peak width in the table. But it is evident that the peak width in the results using the CWT method is comparatively greater than from the other methods.

resolution enhancement |
211 |
Figure 5.44. Comparison of the first derivatives of the clean (left column) and noisy (right column) simulated signals calculated by numerical differentiation (a), Fourier transform method (b), Savitsky--Golay method (c), DWT method (d), DWT method with the improved WT algorithm (e), and the CWT method (f).
Table 5.4. Comparison of Peak Position and Width of Calculated Derivatives by Different Derivative Calculation Methods
|
Gaussian peak |
|
Lorentzian peak |
|
Sigmoid peak |
|||
|
|
|
|
|
|
|
|
|
Method |
Position |
Width |
|
Position |
Width |
|
Position |
Width |
|
|
|
|
|
|
|
||
Simulated |
334 |
239 |
1000 |
596 |
1666 |
253 |
||
Numerical |
333 |
269 |
1000 |
381 |
1667 |
215 |
||
FT |
334 |
272 |
1000 |
370 |
1667 |
270 |
||
Savitsky--Golay |
168 × 2 |
135 × 2 |
500 × 2 |
199 × 2 |
834 × 2 |
125 × 2 |
||
DWT |
167 × 2 |
135 × 2 |
500 × 2 |
200 × 2 |
833 × 2 |
113 × 2 |
||
Improved DWT |
333 |
275 |
1000 |
381 |
1665 |
241 |
||
CWT |
334 |
330 |
1000 |
415 |
1667 |
318 |
||
|
|
|
|
|
|
|
|
|
212application of wavelet transform in chemistry
5.4.4.Resolution Enhancement
According to the characteristics of the WT, a signal can be decomposed into its contributions by the MRSD method to obtain discrete details dj and discrete approximations cj , which represent the different components of the signal at different frequencies. If an overlapping or low-resolution signal is decomposed into its contributions, there must be discrete details that represent the information at the frequency lower than noise and higher than the original signal. Therefore, it is not difficult to select a detail at medium scale to obtain the high-resolution information of the signal. Furthermore, if we amplify one or several of these selected details and then perform the inverse transform, that is, reconstruct the original signal with the amplified contributions, we can also obtain a signal with high resolution.
Therefore, there are generally two methods using WT for resolution enhancement of analytical signals:
1.Decompose the analyzing signal c0 into its approximations cj and details dj using Equations (5.32) and (5.33).
2.Inspect the cj and dj , then select either (a) a detail component as the resolved result for further studies (method A), or (b) one or more detail components that represent the high-resolution information of the analyzing signal (method B).
3.Multiply the selected dj by a factor k whose value is bigger than 1.0 and reconstruct the signal by using Equation (5.34) (method B).
Example 5.12: Resolution Enhancement of an Overlapping Chromatogram Using Method A. Curve (c0) in Figure 5.45 is an experimental chromatogram, and curves (d4), (d5), and (d6) are its detail components obtained by using the improved WT algorithm in Equations (5.32) and (5.33) with Symmlet (L = 4) wavelet at scale parameter j = 4, 5, and 6, respectively.
As discussed in the WT denoising section, the frequency of the detail components dj obtained by a WT decomposition decreases with the increase of the scale parameter j . The detail components at low scales are generally composed of noise. From Figure 5.45, it is clear that d4 is still composed mainly of noise, but d5 and d6 are composed of chromatographic information whose frequency is higher than that of noise and lower than that of the original signal. That is to say, the d5 and d6 are the desired highresolution part of the analytical signal. Therefore, in the detail components on medium scale, we can find one or more components which represent the high-resolution information of the analytical signal. If we further compare d5 and d6, it is easy to find that d5 is the better one for representing

resolution enhancement |
213 |
Figure 5.45. An experimental chromatogram (c0) and its detail components obtained by WT with j = 4(d4), 5(d5), and 6(d6).
the high-resolution information, which can be used for further studies, such as determination of peak position and quantitative calculation.
Computational Details of Example 5.12
1.Load the experimental chromatogram [Fig. 5.45 (c0)].
2.Extend the chromatographic data for avoiding the edge effect.
3.Make a wavelet filter---Symmlet4.
4.Set resolution level J = 6.
5.Perform WT to obtain the c and d components with the improved algorithm.
6.Display Figure 5.45.
Because this method can extract the high-resolution information from a low-resolution or overlapping analytical signal, a method for determination of the component number in overlapping chromatograms was proposed. Figure 5.46 shows four experimental chromatograms with different intensities. It is impossible to obtain the correct component number from such chromatograms. Figure 5.47 shows the d5 component of the four chromatograms in Figure 5.46 obtained with Symmlet wavelet filters (L = 4). The dotted line in the figure indicates the position of zero in the magnitude axis. From Figure 5.47, it is clear that there are five components in the chromatograms, and all four chromatograms of different magnitude give us the same result.

214 |
application of wavelet transform in chemistry |
Figure 5.46. Four experimental chromatograms with different amplitudes [from Chemometr. Intell. Lab. Syst. 43:147--155 (1998)].
Because the WT is a linear transform, the high-resolution information should be used for quantitative calculation. An example of quantitative determination of the components in overlapping chromatographic peaks was published in Analytical Chemistry, [69:1722--1725 (1997)]. Figure 5.48 shows the chromatograms of five mixed samples of benzene, methyl benzene, and ethyl benzene. It is difficult to perform quantitative calculation
Figure 5.47. The detail component d5 obtained from the four chromatograms in Figure 5.46 by WT decomposition [from Chemometr. Intell. Lab. Syst. 43:147--155 (1998)].

resolution enhancement |
215 |
Intensity
120000 |
|
|
|
|
|
100000 |
|
|
|
|
|
80000 |
|
|
|
|
|
60000 |
|
|
|
|
|
40000 |
|
|
|
|
|
20000 |
|
|
|
|
|
0 |
|
|
|
|
|
-20000 |
|
|
|
|
|
1.2 |
1.6 |
2.0 |
2.4 |
2.8 |
3.2 |
Retention Time / min
Figure 5.48. Experimental chromatograms of 5 three-component samples [from Anal. Chem. 69:1722--1725 (1997)].
|
200000 |
|
|
|
|
|
|
150000 |
|
|
|
|
|
|
100000 |
|
|
|
|
|
Intensity |
50000 |
|
|
|
|
|
0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
-50000 |
|
|
|
|
|
|
-100000 |
|
|
|
|
|
|
-150000 |
|
|
|
|
|
|
1.2 |
1.6 |
2.0 |
2.4 |
2.8 |
3.2 |
Retention Time / min
Figure 5.49. Wavelet coefficients d3 obtained from the five chromatograms in Figure 5.48 by WT decomposition [from Anal. Chem. 69:1722--1725 (1997)].

216 |
application of wavelet transform in chemistry |
Intensity
250000
200000
150000
100000
50000
0 |
|
|
|
|
|
-50000 |
|
|
|
|
|
1.2 |
1.6 |
2.0 |
2.4 |
2.8 |
3.2 |
Retention Time / min
Figure 5.50. Baseline-corrected wavelet coefficients after subtracting the estimated baseline by linking the minimum point at both sides of the peak in Figure 5.49 [from Anal. Chem. 69:1722--1725 (1997)].
of the components by directly using the chromatograms because of overlapping of the three peaks. Figure 5.49 shows the d3 component obtained by WT decomposition with Haar wavelet. It is clear that the information of the three peaks is resolved. In order to calculate the peak area, we can estimate a baseline by simply linking the minimum point at both sides of a peak. Figure 5.50 shows the results after subtracting such a baseline. Figure 5.51 shows the relationship between the area and the concentration. It can be seen that a very good linearity of the signals in the wavelet coefficients is kept.
Example 5.13: Resolution Enhancement of an Overlapping NMR Spectrum Using Method B. In Figure 5.52, spectrum (a) shows a simulated NMR spectrum by the Lorentzian equation in (5.77). From left to right, the peaks are doublet, triplet, quartet, and quintet. Spectrum (b) in the figure shows the reconstructed spectrum by multiplying the d1 and d2 by k1 = k2 = 55. Figure 5.53 shows the detail coefficients d1 to d4 obtained by WT decomposition of spectrum (a) with Symmlet (L = 4) wavelet filters. From Figure 5.53 it can be seen that, except for discrete detail d4, d1 through d3 all represent the resolved information of the peaks in the overlapping spectrum, but from d1 to d3 the resolution decreases. Therefore, if we amplify the details d1 and d2, and then perform reconstruction, the

resolution enhancement |
217 |
Peak Area
30000
(c)
25000
(b)
20000 |
|
15000 |
(a) |
|
10000
5000
0 |
|
|
|
|
|
|
0.0 |
2.0 |
4.0 |
6.0 |
8.0 |
10.0 |
12.0 |
Concentration / µl.ml-1
Figure 5.51. Calibration curves obtained from the peak area in Figure 5.50 and concentrations of the three components [from Anal. Chem. 69:1722--1725 (1997)].
resolution of the reconstructed spectrum will be improved. From spectrum
(b) in Figure 5.52, it is clear that all four groups of peak are well resolved.
Selection of the details undergoing amplification is generally by visual inspection on the decomposed details as shown in Figure 5.53. It will be
Figure 5.52. A simulated NMR spectrum (a) and the reconstructed spectrum (b) by multiplying the d1 and d2 by k1 = k2 = 55.

218 |
application of wavelet transform in chemistry |
Figure 5.53. Detail coefficients of the simulated NMR spectrum obtained by WT decomposition.
difficult to select the appropriate detail coefficients when the noise level in the original signal is significant because the noise will also be decomposed into those low-scale detail coefficients for amplification. Figure 5.54, curve
(a) shows a simulated noisy NMR spectrum. The detail coefficients are shown in Figure 5.55. It can be seen that the d1 and d2 coefficients are noisy. If we multiply d1 and d2 by k1 = k2 = 55 as we did above, the noise level of the reconstructed spectrum will be increased as well, as is shown in Figure 5.54, curve (b). In such cases, we can only multiply the d2 or d3
Figure 5.54. A simulated noisy NMR spectrum (a) and its reconstructed spectra by multiplying d1 and d2 with k1 = k2 = 55 (b), d2 with k2 = 60 (c), and d3 with k3 = 10 (d), respectively.