Добавил:

fench Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Казанский национальный исследовательский технологический университет

Предмет:

Химия

Файл:

Chau Chemometrics From Basics to Wavelet Transform

.pdf

Скачиваний:

119

Добавлен:

15.08.2013

Размер:

2.71 Mб

Скачать

☆

<<< < Предыдущая 9 10 11 12 13 14 15 16 17 18 19 20 21 22 2324 / 3324 25 26 27 28 29 30 31 32 33 > Следующая >>>

resolution enhancement

219

Figure 5.55. Detail coefﬁcients of the simulated noisy NMR spectrum obtained by WT decomposition.

in order to avoid the effect of noise. Figure 5.54, curves (c) and (d) are obtained by multiplying d2 and d3, respectively, with k2 = 60 and k3 = 10. It is clear that the SNR of the results is improved.

Computational Details of Example 5.13

1.Generate the signal with 700 data points [Figure 5.52 (a)] using Lorentzian equations.

2.Make a wavelet ﬁlter---Symmlet4.

3.Set resolution level J = 4.

4.Perform WT to obtain the c and d components with the improved algorithm.

5.Perform reconstruction with multiplying the d1 and d2 with a factor 55.0.

6.Display Figures 5.52 and 5.53.

A more detailed discussion of Example 5.13 can be found in a paper published in Applied Spectroscopy [54:731--738 (2000)]. In this paper, an experimental NMR spectrum was also processed by the method. Figure 5.56 shows two enlarged parts of the experimental and the reconstructed spectra. It is clear that the resolution of the spectra is greatly improved by this method.

220	application of wavelet transform in chemistry

Figure 5.56. An experimental NMR spectrum (a) and the reconstructed spectrum by multiplying d1 and d2 with k1 = k2 = 10 (b).

5.4.5.Resolution Enhancement by Using Wavelet Packet Transform

WPT differs from WT with respect to the decomposition tree. In WT, only the approximation coefﬁcients on each scale are used for further decomposition, but in WPT, the further decomposition is applied to both the approximation and detail coefﬁcients. Therefore, for resolution enhancement, the resolving ability of WPT should be stronger than that of WT, because there will be more decomposed components representing the information with different frequencies. Consequently, it is easy for us to select a component that represents the desired high-resolution information.

The procedures for resolution enhancement of analytical signals are almost the same as in methods A and B proposed above. The only difference is to use the Equations (5.42)--(5.44) instead of Equations (5.32)-- (5.34), for decomposition and reconstruction computation.

Figure 5.57 shows the experimental chromatograms of six samples containing six rare-earth ions. Concentrations of the samples are listed in Table 5.5. In order to extract the chromatographic information of each component from the overlapping chromatograms in Figure 5.57, we can subject them to WPT decomposition and obtain all wjp ﬁrst. Then we can select a coefﬁcient component to represent the desired high-resolution information. Figure 5.58 shows the selected w63 coefﬁcients of the six chromatograms. Finally, we can estimate a baseline by linking the minimum point at both sides of every peak. After subtracting the baseline, we can obtain the results shown in Figure 5.59. It can be seen that all six peaks in the six chromatograms are well resolved except for the second peak in

resolution enhancement

221

Figure 5.57. Experimental chromatograms of mixed rare-earth solution samples.

chromatogram (d), which is slightly distorted. Using the resolved peaks, we can investigate the linearity of the extracted signals by examining the relationship between the peak area and concentration. Figure 5.60 shows the calibration curves of the six components. It can be seen that all six curves are satisfactory.

5.4.6. Comparison between Wavelet Transform and Fast Fourier Transform for Resolution Enhancement

As we have seen, deconvolution by Fourier transform can also be used for resolution enhancement of analytical signals. The procedures can be

Table 5.5. Compositions and Concentrations (ppm) of Mixed

Rare-Earth Solution Samples

Number	Lu	Yb	Tm	Er	Ho	Tb

1	15.0	10.0	30.0	25.0	38.0	21.2
2	25.0	30.0	15.0	18.0	22.0	21.2
3	40.0	12.0	25.0	15.0	15.0	21.2
4	16.0	38.0	40.0	18.0	26.0	21.2
5	10.0	20.0	35.0	32.0	24.0	21.2
6	34.0	34.0	16.0	30.0	35.0	21.2

222	application of wavelet transform in chemistry

Figure 5.58. WPT coefﬁcients w63 of the chromatograms in Figure 5.57 obtained by WPT decomposition.

summarized as follows:
1.	Transform the signal f (t ) to the Fourier domain: f (t ) fˆ(ω).
2.	ˆ	ˆ	ˆ	= ˆ	ˆ
	Multiply f (ω) with a window function h(ω) : g(ω)			f (ω)h(ω).

3.Perform inverse transform to obtain the deconvoluted signal: gˆ (ω) g(t ).

Figure 5.59. Baseline-corrected WPT coefﬁcients after subtracting the estimated baseline by linking the minimum point at both sides of the peak in Figure 5.58.

resolution enhancement

223

Figure 5.60. Calibration curves obtained from the peak area in Figure 5.59 and concentrations of each component in the samples.

Generally, for discrete signals, FFT is used for calculation, and the window functions include the box window, the triangle window, the Hanning window, and the Hamming window. In practice, the width and position of the window must be determined by trial and error, which is always time-consuming.

Figure 5.61 compares the results obtained by two FFT deconvolution methods and the WT method of the simulated and experimental NMR spectra in curves (a) of Figures 5.52 and 5.56, respectively. Curves (a1) and (b1) in Figure 5.61 are obtained by the FFT method on the basis of the abovementioned procedures, where the Hanning window is used, those coefﬁcients out of the window are cut off. Curves (a2) and (b2) are obtained by an improved FFT method based on the abovementioned procedures, where all the coefﬁcients remain but those coefﬁcients in the Hanning window are ampliﬁed by a factor k . It should be noted that many trials must be done in order to obtain satisfactory results by FFT methods, because we must select a suitable window width and position. For the second method, we must determine a suitable value for the parameter k . The (a3) and (b3) curves are obtained by the WT method in exactly the same way as curves

(b) in Figures 5.52 and 5.56. In this method, only the value of the parameter k should be optimized by trial and error.

From the results of the simulated spectrum, it can be seen that we cannot obtain a satisfactory baseline by the ﬁrst FFT method, because the low-frequency part of the signal is cut off. For the second FFT method and the WT method, the baseline will not change because all the information remain, and the resolution enhancement is caused by the increase of the

224	application of wavelet transform in chemistry

Figure 5.61. Comparison of the results from the simulated (a) and experimental (b) NMR spectra by FFT deconvolution (a1, b1, a2 and b2) and WT method (a3 and b3).

high-frequency part of the signal. We can easily control the resolution of the results by changing the value of the parameter k . Comparing curves (a2) and (a3) in Figure 5.61a, the improved FFT method is even superior to the WT method because there are positive sidelobes in (a3). From the results of the experimental spectrum, it can be seen that there is no signiﬁcant difference among the results of the three methods, and the resolution of (b3) is slightly better than that of curves (b1) and (b2) in Figure 5.61. Furthermore, it should be noted that we cannot expect to further enhance the resolution by using a higher value of k in the second FFT method because the negative sidelobes in (b2) will increase with the value of the parameter k .

combined techniques

225

5.5.COMBINED TECHNIQUES

By the applications of WT discussed above, it has been shown that WT is a powerful tool for compression, denoising, and resolution of analytical signals. In the following text, we will discuss the extended applications based on the combined techniques of WT and other chemometrics methods.

5.5.1. Combined Method for Regression and Calibration

Regression and calibration are the most commonly used techniques for quantitative determination in analytical chemistry. There is no difﬁculty in regression and calibration calculations for traditional analytical methods because only one variable is used. In modern analytical chemistry, spectroscopic methods are increasingly employed for quantitative determination. New methods such as multiple linear regression (MLR) for modeling multivariable datasets are needed. However, the dimensionality of spectral datasets is basically limited by the number of the objects studied, whereas the number of variables can easily reach a very large number. Furthermore, the high-dimensional spectral data are closely correlated and usually noisy. Therefore, methods more suitable for modeling correlated variables are proposed, such as the principal-component regression (PCR) and partial least-squares (PLS) methods.

Generally, all the information contained in the spectra can be used for the modeling; these are called full-spectrum methods. However, in many cases, a preprocessing of the experimental spectra using WT compression can offer some advantages compared to the full-spectrum methods.

A combined procedures of WT compression and PLS is illustrated in Figure 5.62, including the following steps:

1.The measured signals, denoted by X, such as spectra, are transformed into wavelet domain represented by wavelet coefﬁcients, W.

2.The matrix W is sorted according to their contribution to the data variance and a matrix Wsorted can be obtained. Because many wavelet coefﬁcients in W or Wsorted are usually very small, only a limited number of columns of Wsorted are needed to represent the signal X. Therefore, the Wsorted can be divided into two submatrices, Ws and Wn , containing signiﬁcant (information component) and insigniﬁcant (noisy component) coefﬁcients, respectively. This step can be skipped in many cases because the sorting will change the relative position of the coefﬁcients and, subsequently, cause a variation of the original information.

226	application of wavelet transform in chemistry

m×n

DWT

m×n

sorting

m×n

selection

and

m×1

m×n b

n×1

PLS

y	=	Ws		b
y	=	Ws			n′×1
					n′×1
	m×1		m×n′

m×n mean(B) std(B)

stability(B)=mean(B)/std(B)

Figure 5.62. A diagram showing the procedures of the PLS coupled with WT compression.

3.The submatrix Ws can be determined by different criteria:

3a. We may simply use the criteria discussed in the WT compression, but this means that only the advantage of WT compression is utilized.

3b. Other methods can also be employed for this purpose, such as the relevant component extraction (RCE) PLS approach described in Walczak’s book, Wavelets in Chemistry [12]. In this method, as illustrated in Figure 5.62, the PLS is employed to calculate the b coefﬁcients. A matrix of the regression coefﬁcients can be obtained by using the ‘‘leave one out’’ cross-validation procedure. Then, the stability of the regression coefﬁcient i , deﬁned by

Stability(bi ) =	mean(bi )	(5.81)
	std(bi )

combined techniques

227

can be calculated. Using the maximal stability of the noisy variables as a threshold,

Threshold = max (abs(stabilitynoise))

(5.82)

we can cut off those coefﬁcients in b and the corresponding wavelet coefﬁcients in the W or Wsorted.

4.With the submatrix Ws , build the PLS model from the training set.

5.Finally, we can use the model for prediction. It should be noted that the experimental data must be processed in the same way as that of the training set used to build the model. When you use step 3a for compression, you must compress the experimental data with the same criteria. When you use step 3b for determination of the Ws , you should keep those coefﬁcients at the same position.

This method has been successfully used in the analysis of NIR spectra of gasoline samples. Examples can be found in the book Wavelets in Chemistry [12].

5.5.2. Combined Method for Classiﬁcation and Pattern Recognition

Generally, pattern recognition refers to the ability to assign an object to one of several possible categories according to the values of some measured parameters, and the classiﬁcation is one of the principal goals of pattern recognition. Many methods have been proposed for classiﬁcation and pattern recognition because of their importance in chemical studies. Combined methods of WT for classiﬁcation and pattern recognition include two main steps: (1) compression or feature selection is performed to the original dataset using WT as a preprocessing technique; then (2) classiﬁcation or pattern recognition is performed by classiﬁers such as the artiﬁcial neural network (ANN), the soft independent modeling of class analogy (SIMCA), and the k th nearest neighbors (KNN), in the wavelet domain.

There have been several successful examples based on the combined method of WT and conventional classiﬁers for classiﬁcation of analytical signals. One of them is reported by Bos and Vrielink in Chemometrics and Intelligent Laboratory Systems, [23:115--122, (1994)]. In their report, identiﬁcation of monoand disubstituted benzenes utilizing WT and several classiﬁers from IR spectra was studied. The aim of their work is to show whether the localization property of WT in both position and scale can be used to extract this information into

228	application of wavelet transform in chemistry

a concentrated form to obtain the salient features of an IR spectrum effectively. The coefﬁcients obtained from the WT treatment of the IR spectrum were employed as inputs for an identiﬁcation process that is based on the linear or nonlinear neural network classiﬁers. Using the concentrated form instead of the full spectra, the time to develop the classiﬁers is greatly reduced. Moreover, it is expected that the quality of the classiﬁers will improve if they are derived from smaller datasets that contain all the relevant information. From their study, it is concluded that WT coupled with Daubechies wavelet functions is a feature extracting method that can successfully reduce IR spectral data by more than 20-fold with a signiﬁcant improvement in the classiﬁcation process.

Another example is reported by Collantes et al. in Analytical Chemistry, [69:1392--1397 (1997)]. They employed WPT for preprocessing of HPLC data and several classiﬁers as potential tools for pharmaceutical ﬁngerprinting pattern recognition. The HPLC data for each l-tryptophan sample was preprocessed by the Haar wavelet function in the WPT treatment. Then, the coefﬁcients thus obtained in the wavelet domain were sorted in descending order. A small portion of sorted coefﬁcients were fed as the inputs of the ANN, KNN, and SIMCA classiﬁers to classify the samples according to manufacturers. With this study, they concluded that WPT preprocessing provides a fast and efﬁcient way of encoding the chromatographic data into a highly reduced set of numerical inputs for the classiﬁcation models.

For purposes classiﬁcation and regression, an adaptive wavelet algorithm (AWA) using higher-multiplicity wavelets was proposed. Detailed descriptions of the method can be found in Chapter 8 and Chapter 18 of the book Wavelets in Chemistry [12]. The wavelet neural network (WNN) can also be used for classiﬁcation and pattern recognition, discussed in the following section.

5.5.3. Combined Method of Wavelet Transform and

Chemical Factor Analysis

There are several ways to combine WT and chemical factor analysis (CFA) for different purposes. In Section 5.3.5, for example, WT is used for background removal in order to obtain the correct rankmap of a data matrix.

Principal-component analysis (PCA) is an important and basic method in CFA. However, because the principal components (PCs) are calculated as the eigenvectors of the variance--covariance matrix, computation of the

<<< < Предыдущая 9 10 11 12 13 14 15 16 17 18 19 20 21 22 2324 / 3324 25 26 27 28 29 30 31 32 33 > Следующая >>>

Соседние файлы в предмете Химия

#
15.08.201321.36 Mб50Carey F.A. - Organic Chemistry (2004)(en).djvu
#
15.08.201321.36 Mб43Carey F.A. Advanced organic chemistry 5ed., MGH, 2004.djvu
#
15.08.201311.62 Mб29Carey F.A. Advanced organic chemistry. Part A structure and mechanisms 1938.djvu
#
15.08.20138.77 Mб20Carey F.A. Advanced organic chemistry. Part B reaction and synthesis 1938.djvu
#
15.08.20139.85 Mб27Cazes J., Scott R.P.W. Chromatography theory New York 2002.djvu
#
15.08.20132.71 Mб119Chau Chemometrics From Basics to Wavelet Transform.pdf
#
15.08.20135.81 Mб282Chemiluminescence in Analytical Chemistry.pdf
#
15.08.20133.98 Mб19Chen The electron capture detector.pdf
#
15.08.20133 Mб87Chivers T. - A Guide to Chalcogen-Nitrogen Chemistry (2005)(en).pdf
#
15.08.201332.51 Mб67Clayden J. - Organic chemistry (Oxford, 2000).pdf
#
15.08.20136.12 Mб50Cohen M.F., Wallace J.R. - Radiosity and realistic image synthesis (1995)(en).pdf