Добавил:
Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:

Kluwer - Handbook of Biomedical Image Analysis Vol

.2.pdf
Скачиваний:
102
Добавлен:
10.08.2013
Размер:
25.84 Mб
Скачать

A Knowledge-Based Scheme for Digital Mammography

613

Table 11.4: Mean percentage improvement in segmentation performance by using the target-enhancement method compared to the segmentation of the unenhanced image for each mammogram grouped by breast type

Type

TPT

SUBTPT

Total

 

 

 

 

1

0.76

1.25

0.83

2

1.28

0.64

1.03

3

0.80

4.00

1.43

4

1.47

6.50

2.25

Mean

1.07

3.09

1.38

 

 

 

 

methods for fatty breasts but is noticeably less effective in the segmentation of dense breasts (types 3 and 4).

11.3.2.2.5Mean Percentage of Target Mass Detected as TPT or SUBTPT . Using the target contrast enhancement expert, Table 11.4 tabulates the mean percentage improvement in segmentation performance compared with the segmentation of the unenhanced image for lesion ground-truth detected with outcomes TPT and SUBTPT for all mammograms, grouped by breast type. The greatest improvement can be seen in the segmentation of the densest breast type using the target contrast enhancement method, compared to the segmentation obtained from the unenhanced original (e.g., on average, for breast type 1, using target enhancement method result in 83% improvement in segmentation compared to the unenhanced original).

11.3.2.2.6A Classical Solution in Choosing a Single Optimal Contrast Enhancement Method for CAD. In analyzing the 166 selected mammograms, a common approach to identify a single optimal contrast enhancement for use in CAD is to evaluate a selection of enhancement methods on a range of different training mammograms from different breast types. By determining the mean value of T P T + SUBTPT for all enhancement methods across all mammograms, the CAD researcher can choose to select the single method maximizing the value of TPT + SUBTPT from the training set. Table 11.5 lists the

614

Singh and Bovis

Table 11.5: Mean percentage improvement in segmentation performance for each contrast enhancement method compared with the segmentation of the enhanced image, grouped by breast type

Type

TPT

SUBTPT

Total

 

 

 

 

(a) ACE

−0.24

 

−0.21

1

0.00

2

0.17

−0.55

−0.10

3

0.10

−0.25

0.09

4

−0.07

0.00

0.00

Mean

0.01

0.20

0.06

(b) DWCE

 

 

 

1

0.12

0.00

0.10

2

−0.22

−0.27

−0.21

3

0.05

0.25

0.13

4

−0.13

1.00

0.06

Mean

0.05

0.24

0.02

(c) FUZZY

−0.12

 

 

1

2.50

0.24

2

0.17

1.00

0.48

3

0.05

5.50

1.04

4

0.33

8.00

1.38

Mean

0.11

4.25

0.79

(d) HISTEQ

−0.28

 

−0.24

1

0.00

2

0.50

−0.55

0.07

3

0.00

−0.50

−0.04

4

0.20

1.00

0.38

Mean

0.11

0.01

0.04

(e) ACELE

−0.24

 

−0.10

1

0.75

2

0.06

−0.45

−0.17

3

0.10

−0.25

0.09

4

−0.20

0.50

−0.06

Mean

0.07

0.14

0.06

(f) ACELFD

−0.24

 

−0.17

1

0.25

2

−0.11

−0.55

−0.28

3

0.05

0.25

0.09

4

−0.07

0.50

0.06

Mean

0.09

0.11

0.07

A Knowledge-Based Scheme for Digital Mammography

615

percentage improvement in segmentation compared with the original segmentation, in the detection of ground truth defined lesions detected with outcomes TPT and SUBTPT obtained using each individual enhancement method. From the data obtained, in identifying the single best enhancement method, the FUZZY method (Table 11.5, part c) is chosen for each breast type. Notice that the segmentation performance obtained using the FUZZY method shown in Table 11.5 (part c) for all mammograms, is suboptimal compared with that of the target contrast enhancement for each mammogram shown in Table 11.4.

11.3.3 Identifying Input Mapping Features

To implement a knowledge-based contrast enhancement component to learn the target enhancement for a given mammogram, a supervised learning paradigm is employed. By utilizing pattern recognition tools, a classifier can learn from a set of example mammograms the target contrast enhancement. For an unseen testing mammogram, the trained classifier will accurately predict the actual enhancement that maximizes segmentation performance.

During training the classifier learns a mapping between a characteristic of an example training mammogram and the target enhancement method. To facilitate this mapping, features are extracted to characterize the training and testing mammograms. Two different approaches to feature extraction are described:

(1) feature extraction from a ROI and (2) feature extraction from a breast profile.

11.3.3.1 Gray-Scale Features Extracted from a Suspicious ROI

This approach to feature extraction extracts a set of F features from pixels comprising a suspicious ROI target, T , thus FROI = { f1, f2, . . . , fF }. A surrounding region labelled background, B, of the same area is constructed encircling the ROI, but comprising normal pixels. From pixels that comprise the target (T ) and background (B) region, the following gray-level statistics are extracted: mean, standard deviation, entropy, skewness, and kurtosis. These are transformed into feature values by determining the ratio of the target value to background value (T /B) for each gray-scale statistic. The features reflect the mathematical composition of the quantitative measure of contrast enhancement previously proposed.

616

Singh and Bovis

11.3.3.2 Gray-Scale Features Extracted from the Breast Profile

An alternative approach to feature extraction is based on the gray scales that comprise a breast profile. A method for the construction of a segmented breast profile is described in [19]. A number of gray-scale features are extracted from the mammograms, including co-occurrence matrix based features, Fourier Transform-spectral energy based features, Law’s mask features, discrete wavelet transform features, statistical features, circularity (shape) feature and fractal dimension feature. These are described in detail in [26]. Altogether this gives a set of 316 features FBP316 = ( f1, f2, . . . , f316) that are extracted from the breast profile to characterize a mammogram. In addition, the application of principal component analysis (PCA) (a mechanism for dimensionality reduction) results in a 26-dimensional feature space FBP26 = ( f1, f2, . . . , f26). Both features sets

{FBP316 , FBP26 ) are evaluated as mapping features in the learning of the expert contrast enhancement.

11.3.4Strategies for Learning the Contrast Enhancement Experts

To train a knowledge-based contrast enhancement component, the mapping between the input gray-scale feature vectors discussed in the previous section and the target method indicated by the quantitative measure of segmentation from Table 11.2 is learnt. This section gives an overview of two strategies to learn the target enhancement mapping:

Double network mapping (DNM): This strategy adopts a divide-and- conqueror paradigm. It attempts to decompose a single mapping into two simpler mappings. The first mapping to be learnt between the features from ROI and the three quantitative measure of enhancement performance proposed in section 11.3.1. A second process learns the mapping of the quantitative measure of enhancement with quantitative measure of segmentation. On testing this strategy will predict a measure of segmentation for each contrast enhancement method, and the actual contrast enhancement method is identified as the one maximizing the segmentation performance.

Breast profile mapping (BPM): This strategy differs in that the solution adopts a classification-based approach and aims to learn the mapping of feature set FBP extracted from the complete breast image with a label of the target

A Knowledge-Based Scheme for Digital Mammography

617

enhancement. On test, a single contrast enhancement method is predicted. Each strategy is described in detail in the following sections.

11.3.4.1 Double Network Mapping Overview

The double network mapping (DNM) method is used to predict the target contrast enhancement using two ANNs for each enhancement method. The aim is to learn a mapping based on a set of gray-scale features FROI from a given mammogram, with a quantitative measure of segmentation performance, S. The segmentation performance is quantified following contrast enhancement, for each enhancement method m, where 1 ≤ m M from a set of M enhancement methods. The two submappings are detailed below:

1.ANNmDNMenh: For a mammogram I, enhanced using enhancement method m, this ANN learns the mapping between the set of F gray-scale input features

FROI = ( f1, f2, . . . , fF ) extracted from a suspicious ROI, and a set of P quantitative measures Q = (q1, q2, . . . , qP ) of enhancement performance as described previously in section 11.3.1.

2.ANNmDNMseg: For a mammogram I, enhanced using enhancement method m, the ANN learns the mapping between the set of quantitative measure

Q = (q1, q2, . . . , qP ) of enhancement performance and a set of R measures quantifying the performance of lesion segmentation S = (S1, S2, . . . , SP ) identified in Table 11.2.

A diagrammatic overview of the mappings learnt is given in Fig. 11.6 and the training and testing phases are described in more detail below. To evaluate the strategy, a firefold cross-validation approach is used to reduce bias and ensure that a test result is produced for each mammogram image.

11.3.4.1.1 Training the DNM Approach. Using this strategy, ANNmDNMenh and ANNmDNMseg are trained independently for each enhancement method, Em where m {1, . . . , M}. For a training image, a border comprising normal pixels of the same areas as the target ROI is constructed around it. The set of grayscale input features FROI are extracted from the target ROI and background regions as described in section 11.3.3.1. Each training mammogram is contrast enhanced with each method and a set of quantitative measures of enhancement

618

Singh and Bovis

Figure 11.6: Diagrammatic overview of the DNM strategy.

Q are calculated from the target ROI and border. Thus ANNmDNMenh learns the mappings:

ANNDNMenhm

(11.11)

FRO I −→

Q m = {1, . . . , M}

Similarly for enhancement method Em to train ANNmDNMseg, the set of quantitative measures of enhancement Q are used as input features in learning the mapping with the set of quantitative measures of segmentation, S. Thus ANNmDNMseg learns the mappings:

ANNDNMsegm

(11.12)

Q −→

S m = {1, . . . , M}

11.3.4.1.2 Testing the DNM Approach. The first step in determining the optimal contrast enhancement method for a test mammogram I is to locate a suspicious ROI. To do this, the HMRFU segmentation algorithm is used to segment the test image and it results in several candidate regions. Regions with a

Euler number > 1 (i.e, enclose a smaller region totally) are removed and from the remaining regions, the most likely suspicious regions are selected on the basis of area and morphological tests using a previously trained ANN. For the single suspicious ROI identified, a surrounding border is constructed of equal area, and the set of input gray-scale features FROI are extracted. These are used as inputs to ANNmDNMenh for each enhancement method Em where m {1, . . . , M}. The output of these networks is then supplied as input to ANNmDNMseg for each enhancement

A Knowledge-Based Scheme for Digital Mammography

619

Table 11.6: Results from using optimized strategy DNM showing the mean percentage improvement in segmentation performance compared with the segmentation of the unenhanced original mammogram

Type

TPA

SUBTPA

Total

 

µ

µ

 

 

 

 

 

1

−0.16

1.75

0.10

2

0.28

−0.09

0.14

3

0.20

1.00

0.39

4

0.00

2.00

0.31

Mean

0.08

1.16

0.24

 

 

 

 

method m {1, . . . , M}. The mth AN NDNMsegm maximising the value for the sum of T P and SU BT P outcomes is predicted as the optimal enhancement method for a test mammogram I, thus

assign Em m if AN NDNMsegm

= argmaxmM=1 AN NDNMsegm for m = {1, . . . , M}

 

(11.13)

11.3.4.1.3 Model Order Selection. Optimization of the model order for

AN NDNMsegm and AN NDNMsegm for each enhancement method Em, where m {1, . . . , M}, is performed independently by varying the number of hidden nodes between 2 and 30. The mean squared error (MSE) resulting for each model on test using fivefold cross validation is minimized. The optimal number of hidden nodes selected is the one which minimizes the MSE over all configurations of hidden nodes.

11.3.4.1.4 DNM Framework Results. Table 11.6 shows the mean percentage improvement in segmentation performance compared with the segmentation of the unenhanced original image, using the predicted expert enhancement for each breast type. The DNM strategy results are significantly poorer than those obtained using the target expert contrast enhancement methods reported in Table 11.4. They are also inferior to the use of the FUZZY method on all breasts as shown in Table 11.5 (part c).

620

Singh and Bovis

11.3.4.2 Breast Profile Mapping—Overview

The second strategy used for learning the expert contrast enhancement for a mammogram is the breast profile mapping (BPM) strategy. For a mammogram

I, enhanced using enhancement method Em, where m {1, . . . , M}, the BPM strategy learns the mapping between the set of N gray-scale input features FBPN detailed in section 11.3.3.1, and a l = {1., . . . , L} indicates the target contrast enhancement for a training mammogram. Both feature sets {FBP316 , {FBP26 ) are evaluated separately in their utility for learning the expert contrast enhancement. The expert l is based on a set of R measures quantifying the performance of lesion segmentation S = {s1, s2, . . . , sR} described in Table 11.2. The expert l is identified as the one maximising the sum of TPT and SUBTPT outcomes for each enhancement method Em where m {1, . . . , M} as defined previously in Eq. (11.10).

Unlike the DNM strategy, this method utilizes a single classifier to predict the target contrast enhancement method. The k-nearest neighbor (k- NN) classifier has been show to be effective at learning nonparametric mappings with a small sample size [27] and for this reason it is employed in the knowledge-based contrast enhancement expert. To evaluate the strategy a fivefold cross validation is used to reduce bias and provide a test decision for each mammogram.

11.3.4.2.1 Training the BPM Approach. To train the BPM strategy, the set of gray-scale input features FBPN = ( f1, f2, . . . , fN ), where N identifies the original and PCA feature sets (N = {316, 26}), are extracted from the segmented breast profile. Each training mammogram is contrast enhanced with each enhancement method. The quantitative measures of segmentation are calculated for the target ROI. For each enhancement method, the winning predicted enhancement method identified by the label l is used to learn the mapping between

F and l with the k-NN classifier.

11.3.4.2.2 Testing the BPM Approach. To determine the predicted target enhancement method El for a test mammogram I, the set of gray-scale input features FBPN are extracted from the segmented breast profile. Using the trained k-NN classifier, the predicted actual expert contrast enhancement is determined.

A Knowledge-Based Scheme for Digital Mammography

621

Table 11.7: Percentage improvement in segmenting an unenhanced mammogram compared to that obtained when segmenting the image enhanced using the predicted enhancement method from the optimized BPM strategy based on feature set FBP316

Type

TPA

SUBTPA

Total

 

µ

µ

 

 

 

 

 

1

−0.16

1.50

0.07

2

0.11

0.91

0.41

3

0.10

4.75

0.96

4

0.47

7.50

1.44

Mean

0.13

3.66

0.72

 

 

 

 

11.3.4.2.3Model Order Selection. In order that the BPM strategy is to perform optimally, the number of nearest neighbors k must be correctly set. For each input feature set FBP316 and FBP26 for different values of k the validation set error is plotted and the value of k corresponding to the least error is chosen.

11.3.4.2.4BPM Framework Results

1.Feature set FBP316 : Using an optimized value of k = 23, Table 11.7 shows the percentage improvement in segmentation performance when using the predicated actual enhancement method, compared with that obtained with the unenhanced original from the FBP316 set. These results show that the segmentation improvement obtained over the unenhanced image, when segmenting an image enhanced using a enhancement method predicted by the BPM strategy, is greater than that obtained using the DNM strategy predicted enhancement method. However, segmenting the BPM strategy’s predicted enhanced image results in inferior performance to that using the target enhancement method identified in Table 11.4. The result for breast type 4, the densest breast type, shows a small improvement over using the FUZZY method, shown in Table 5 (part c), for all mammograms of that type.

2.Feature set FBP26 : Using an optimized value of k = 19, Table 11.8 shows the percentage improvement in segmenting the unenhanced image compared

622

Singh and Bovis

Table 11.8: Percentage improvement in segmenting an unenhanced mammogram compared to that obtained when segmenting the image enhanced using the predicted enhancement method from the optimized BPM strategy based on feature set FBP26

Type

TPA

SUBTPA

Total

 

µ

µ

 

 

 

 

 

1

0.04

2.00

0.31

2

0.33

1.00

0.59

3

0.30

5.00

1.17

4

0.13

7.00

1.06

Mean

0.20

3.75

0.78

 

 

 

 

to that when segmenting the image enhanced using the predicted enhancement method by the optimized BPM strategy with the FBP26 feature set. These results indicate better performance than the DNM strategy but are still inferior to the segmentation using the target expert enhancement method shown in Table 11.4. The result for breast type 1–3 show an improvement over using the FUZZY method in Table 5 (part c) for all mammograms of that type. Comparing the results from the evaluation of the two feature sets, FBP26 and FBP316 from the BPM strategy, the results indicate that the feature set FBP26 is better suited to processing mammograms with breast types 1–3, whereas the feature set FBP316 gives better performance on the densest breast type, i.e., type 4. Interestingly, for both feature sets, the performance improvement is worse over the fattiest breast types, type 1, compared with the densest, type 4. This is because of the variability of optimal enhancement method for the fatty breast types, whereas the denser breasts tend to be optimal enhanced by the FUZZY method more often.

11.3.5 Key Observations

Table 11.9 shows the shows the percentage improvement in segmenting the unenhanced image compared to that when segmenting the image enhanced using the predicted enhancement method for each of the two strategies compared that