Добавил:

fench Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Сумский государственный университет

Предмет:

Биомеханика

Файл:

Kluwer - Handbook of Biomedical Image Analysis Vol

.3.pdf

Скачиваний:

120

Добавлен:

10.08.2013

Размер:

16.35 Mб

Скачать

☆

<<< < Предыдущая 35 36 37 38 39 40 41 42 43 44 45 46 4748 / 5948 49 50 51 52 53 54 55 56 57 58 59 > Следующая >>>

460	Rohlﬁng et al.

of the 20 bee brains was taken as the raw image and automatically segmented using every one of the remaining 19 brains as an atlas. This resulted in 19 segmentations per brain. These 19 segmentations were then combined into a ﬁnal segmentation.

The most straightforward method for combining multiple classiﬁcations into one is the so-called “Vote Rule” decision fusion [26]. For each voxel in the raw image, the outputs of the individual atlas-based classiﬁers are determined. Their “votes” are then counted, and the label that received that highest number of votes is assigned to the voxel. It is worth, however, to take a closer look at the way an atlas-based classiﬁer works: by looking up a label according to a transformed image coordinate. The label map is discrete, arranged on a 3D grid of labeled voxels. Yet the coordinates of the raw image voxels that we are trying to label hardly ever directly fall on grid points in the atlas. Therefore, looking up the correct label requires some sort of interpolation. The simplest label interpolation method is nearest neighbor (NN) interpolation, resulting in a single unique label per atlas-based classiﬁer. These can easily be combined using vote fusion as described above.

A slightly more complex interpolation technique that can be applied to labels is partial volume interpolation (PVI) as introduced by Maes et al. [36]. Here, the labels of all eight neighbors of the interpolated coordinate are determined and weighted with the trilinear interpolation coefﬁcients of their respective grid nodes. Therefore, the output of an atlas-based classiﬁer using PVI is a vector of weights between zero and one, which are assigned to each of the possible labels. One can interpret the weights as the conﬁdence of the classiﬁer in the respective label being the correct answer. These weighted decisions from all classiﬁers can be combined by so-called “Sum Rule” fusion [26]. The weights for each label are added over all classiﬁers, and the label with the highest sum is taken as the combined decision.

11.5 Quantifying Segmentation Accuracy

In addition to presenting selected algorithms for atlas-based segmentation, this chapter provides a quantitative comparison among different methods. For each segmentation that we perform, its accuracy is computed. The accuracies

Quo Vadis, Atlas-Based Segmentation?

461

achieved for each image are then compared among different methods in order to illustrate quality differences and to identify superior algorithms.

Computing the accuracy of a segmentation requires a gold standard, or ground truth. That is, the correct segmentation needs to be known for an image in order to be able to compute the accuracy of an automatically generated segmentation of that image. While not at all guaranteed to be correct, it is commonly accepted today to use a manual segmentation by a human expert, supported by advanced semi-automatic labeling techniques such as intelligent scissors [40], as the gold standard that automatic segmentation methods are measured against.

11.5.1 Similarity Index

Figure 11.11 provides a visual impression of the segmentation result for two representative slices from one segmented bee brain image. However, in order

Figure 11.11: Example of segmentation using non-rigid image registration (MUL atlas selection paradigm). The two columns show axial images at two different slice locations. Top row: Overlays of segmentation contours (shown in white) after non-rigid image registration. Bottom row: Difference images between manual and automatic segmentation. Voxels with different labels assigned by manual and automatic segmentation are shown in black.

462 Rohlﬁng et al.

to effectively compare different segmentation methods, we need to quantify the

segmentation accuracy. One possible measure of segmentation quality is the similarity index (SI) [87]. For a structure s, the SI is computed from the set

V (s)

of voxels in s according to the automatic segmentation and the set V (s)

auto

manual

of voxels in s according to the (gold standard) manual segmentation:

V (s)

(s)

manual

auto

SI(s)

(s)

(11.8)

∩

manual

auto

For perfect mutual overlap of both segmentations, manual and automatic, the SI has a value of 1. Lesser overlap results in smaller values of SI. No overlap between the segmentations results in an SI value of 0. A major advantage of the SI measure is that it is sensitive to both over-segmentation and under-segmentation, that is, it recognizes both false positives and false negatives among the voxels of a given structure.

11.5.2 Bias from Structure Volume

In order to understand the SI values computed later in this chapter and to compare them with other published values, we investigated the dependence of SI values on object size. We performed a numerical simulation in which discretely sampled spheres of various radii were dilated by one or two voxels and the SI values between the original and dilated spheres were computed. The resulting SI values are plotted versus object radius in Fig. 11.12. It is also easy to derive a closed-form expression for the continuous case. The SI between two concentric spheres, one with radius R and the other dilated by d, i.e., with a radius of R + d, is

2(R/d)3

SI = . (11.9) 2(R/d)3 + 3(R/d)2 + 3(R/d) + 1

The SI values for the discrete and continuous cases are almost identical (Fig. 11.12). The SI value between a sphere and a concentric dilated sphere approximates the SI value for a segmentation error consisting of a uniform thickness misclassiﬁcation on the perimeter of a spherical object. Inspection of Fig. 11.12 and Eq. (11.9) shows that SI depends strongly on object size and is smaller for smaller objects. A one voxel thick misclassiﬁcation on the perimeter of a spherical object with a radius of 50 voxels has an SI value of 0.97, but for a radius of 10 voxels the SI value is only 0.86. Thus it is not surprising that Dawant

Quo Vadis, Atlas-Based Segmentation?

463

Similarity Index Between Sphere

	1.0
Sphere	0.8
	0.6

and Dilated	0.4
and Dilated	0.2

1 Unit Dilation, Discrete

1 Unit Dilation, Continuous

0.0

100

Sphere Radius

Figure 11.12: Dependence of SI values on size for spherical objects. The squares show SI values computed from discrete numerical simulation of dilation by one voxel. The solid line shows SI values for the continuous case (Eq. 11.9). Note that while the units on the horizontal axis are voxels for the discrete case, they are arbitrary units for the continuous case.

et al. [16] reported mean SI values of 0.96 for segmentation of the human brain from MR images and mean SI values of only 0.85 for segmentation of smaller brain structures such as the caudate.

In Fig. 11.13, the average volumes of the anatomical structures in the bee brain images under consideration are shown with the actual segmentation accuracies achieved for them using one of the segmentation methods discussed later (MUL). It is easy to see that the larger a structure, the more accurately it was typically segmented by the atlas-based segmentation. This conﬁrms the theoretical treatment above and illustrates the varying bias of the SI metric when segmenting structures of different sizes.

11.5.3 Bias from Structure Shape

A simple numerical measure that characterizes the shape of a geometrical object is its surface-to-volume ratio (SVR). For a discrete set of labeled voxels in a segmented structure, we can approximate the SVR ρ as the ratio of the number of surface voxels Ns to the total number of voxels Nt , that is,

ρ ≈	Ns	(11.10)

464	Rohlﬁng et al.

		1000																					1.00
Average Volume Over 20 Bee	Brains (1000s of voxels)	900																					0.90
		800																					0.80	Similarity Index (MUL)
		700																					0.70
		600																					0.60
		500																					0.50
		400																					0.40
		300																					0.30
		200																					0.20
		100																					0.10
		0																					0.00
		0																					0.00
		vPKl	vPKr	Cb	LPL-USG	rmbR	rmLip	rmColl	rlLip	rlColl	rlbR	llLip	llColl	llbR	lmbR	lmColl	lmLip	rLobula	rMedulla	lMedulla	lLobula	lal	ral

Anatomical Structure

Figure 11.13: Volumes of anatomical structures and corresponding segmentation accuracies. The gray bars show the volumes (in numbers of voxels) of the 22 anatomical structures, averaged over the 20 bee brains. The black vertical lines show the range of SI values achieved by the automatic segmentation (MUL paradigm) over all segmented raw images. The diamond shows the median over all segmented raw images.

A surface voxel is easily deﬁned as one that has a neighbor with a label different from its own. When the entire surface of a structure is misclassiﬁed, this can be seen as an erosion of the structure by one voxel. The SI value computed between the original structure and the eroded structure represents the SI resulting from a segmentation that misclassiﬁes exactly all surface voxels. From the structure’s SVR ρ and its total volume V , this SI can be computed as

2V (1 − ρ)

1 − ρ

(11.11)

= V

−

ρ)V

−

ρ/2

Similarly, we can estimate the SI resulting from a misclassiﬁcation of half of all surface voxels. Figure 11.14 shows the SVR values computed for all structures in all brains in our 20 bee brains, plotted versus the SI values of the automatic segmentations. The ﬁgure also shows two curves that represent the theoretical misclassiﬁcation of all and half of all surface voxels, respectively.

For a typical segmentation result of a single structure, a detailed comparison of manual and automatic segmentation is shown in Fig. 11.15. The structure shown here, a right ventral mushroom body, is typical in that its volume and

Quo Vadis, Atlas-Based Segmentation?

465

	1
	0.9
	0.8
Index	0.7
	0.6

Similarity	0.5
	0.4
	0.3
	0.3
	0.2		Individuals
	0.2		Mean for one structure over all individuals
			Mean for one structure over all individuals
	0.1		Erosion by 1/2 voxel
	0.1
			Erosion by one voxel
	0
	0	0.1	0.2	0.3	0.4	0.5	0.6	0.7	0.8	0.9	1

Surface-to-Volume Ratio

Figure 11.14: Similarity index vs. surface-to-volume ratio. Each dot represents one structure in one brain (418 structures in total). The average over all individuals for one structure is marked by a ×. The solid and dashed lines show the theoretical relationship between SVR and SI for misclassiﬁcation of all and half of all surface voxel, respectively.

its surface-to-volume ratio are close to the respective means over all structures (volume 141k pixels vs. mean 142k pixels; SVR 0.24 vs. mean 0.36). The segmentation accuracy for the segmentation shown was SI = 0.86, which is the median SI value over all structures and all brains.

11.5.4 Comparison of Atlas Selection Strategies

The results achieved using the different atlas selection strategies outlined above are visualized in Figs. 11.16–11.19. Each graph shows a plot of the distribution of the SI segmentation accuracies over 19 segmentations, separated by anatomical structure. There were 19 segmentations per strategy as one out of the 20 available bee brain images served as the ﬁxed individual atlas for the IND strategy. Therefore, this brain was not available as the raw image for the remaining strategies, in order to avoid bias of the evaluation.

A comparison of all four strategies is shown in Fig. 11.20. It is easy to see from the latter ﬁgure that the widely used IND strategy produced the least accurate

466	Rohlﬁng et al.

Figure 11.15: A typical segmented structure: right ventral mushroom body (SI = 0.86). Columns from left to right: microscopy image, contour from manual segmentation, contour from automatic segmentation (MUL paradigm), and difference image between manual and automatic segmentation. The white pixels in the difference image show where manual and automatic segmentation disagree.

Rows from top to bottom: axial, sagittal, and coronal slices through the right ventral mushroom body.

results of all strategies. Only slightly better results were achieved by selecting a different individual atlas for each raw image, based on the NMI after non-rigid registration criterion discussed in section 11.4.2. The AVG strategy, segmentation using an average shape atlas, outperformed both the IND and SIM strategies, but was itself clearly outperformed by the MUL strategy. Our results therefore show that the multiclassiﬁer approach to atlas-based segmentation produced substantially more accurate segmentations than the other three strategies. This ﬁnding is, in fact, statistically signiﬁcant when performing a t-test on the SI values for all structures over all segmentations, which conﬁrms the experience of

Quo Vadis, Atlas-Based Segmentation?

467

Similarity Index

1.00

0.80

0.60

0.40

0.20

0.00

vPKl

vPKr

LPL-USG

rmbR

rmLip

rmColl

rlLip

rlColl

rlbR

llLip

llColl

llbR

lmbR

lmColl

lmLip

rLobula

rMedulla

lMedulla

lLobula

lal

ral

Anatomical Structure

Figure 11.16: SI by label for segmentation using a single individual atlas (IND atlas selection strategy) [reproduced from [53]].

the pattern recognition community that multiple classiﬁer systems are generally superior to single classiﬁers [27, 83].

Another interesting ﬁnding is that both the AVG and the MUL strategies performed better than the theoretical upper bound of any strategy working with only a single individual atlas (series labeled “Best SI” in Fig. 11.20). We note that “Best SI” is the upper bound not only for any method with the best atlas for each raw image, but also for any possible selection of one atlas for all raw images. Therefore, it is also the upper bound of the IND strategies, which in our study can consequently never outperform the AVG or MUL strategies.

Similarity Index

1.00

0.80

0.60

0.40

0.20

0.00

vPKl

vPKr

LPL-USG

rmbR

rmLip

rmColl

rlLip

rlColl

rlbR

llLip

llColl

llbR

lmbR

lmColl

lmLip

rLobula

rMedulla

lMedulla

lLobula

lal

ral

Anatomical Structure

Figure 11.17: SI by label for segmentation using the most similar single individual atlas (SIM atlas selection strategy) [reproduced from [53]].

468	Rohlﬁng et al.

1.00

Index

0.80

0.60

Similarity

0.40

0.20

0.00

vPKl

vPKr

LPL-USG

rmbR

rmLip

rmColl

rlLip

rlColl

rlbR

llLip

llColl

llbR

lmbR

lmColl

lmLip

rLobula

rMedulla

lMedulla

lLobula

lal

ral

Anatomical Structure

Figure 11.18: SI by label for segmentation using a single average shape atlas (AVG atlas selection strategy) [reproduced from [53]].

11.6More on Segmentation with Multiple Atlases

We saw in the previous section that a multiclassiﬁer approach to atlas-based segmentation outperforms atlas-based segmentation with a single atlas, be it an individual atlas, an average atlas, or even the best out of a database of atlases. Compared to that, the insight underlying the SIM (“most similar”) atlas selection

1.00

0.80

Index

0.60

Similarity

0.40

0.20

0.00

vPKl

vPKr

LPL-USG

rmbR

rmLip

rmColl

rlLip

rlColl

rlbR

llLip

llColl

llbR

lmbR

lmColl

lmLip

rLobula

rMedulla

lMedulla

lLobula

lal

ral

Anatomical Structure

Figure 11.19: SI by label for segmentation by combining multiple independent

atlas-based segmentations (MUL atlas selection strategy) [reproduced from

[53]].

Quo Vadis, Atlas-Based Segmentation?

469

of Structures with SI Better	Than Threshold
Percentage

100%

0.70

	90%	0.75
	80%	0.80
	80%	0.85
		0.85
	70%	0.90
	60%	0.95
	60%

50%

40%

30%

20%

10%

IND

SIM

AVG

MUL

Best SI*

Atlas Selection Strategy

Figure 11.20: Percentage of registration-based segmentations with similarity index SI better than the given threshold plotted by atlas selection strategy. The series labeled “Best SI” is the upper bound of all strategies working with a single individual atlas (see text for details).

strategy was that different atlases lead to segmentations of different accuracies. Combined, both observations lead to an even more interesting concept: combination of multiple atlas-based segmentations, weighted by estimates of their individual segmentation accuracy.

In other words, if we had estimates of how well each atlas-based classiﬁer is performing, then we could be more conﬁdent in decisions of those classiﬁers that perform well, compared to the decisions of those that do not. One would hope that by concentrating on more accurate classiﬁers in the ensemble, the classiﬁcation accuracy would be further improved.

The performance of each atlas-based classiﬁer is obviously not known in general, due to the lack of a ground truth. However, several methods have been proposed that can estimate the performance parameters, for example, using expectation maximization (EM) methods. Two of these are outlined below, one based on a per-label binary performance model [79], and another based on a simultaneous multilabel performance model [60, 61].

For the description of both methods, we assume that an image with N voxels is segmented by K different (atlas-based) classiﬁers. For each voxel x, we denote with ek(x) the decision by classiﬁer k, which is one of the labels assigned in

<<< < Предыдущая 35 36 37 38 39 40 41 42 43 44 45 46 4748 / 5948 49 50 51 52 53 54 55 56 57 58 59 > Следующая >>>

Соседние файлы в предмете Биомеханика

#
10.08.2013966.8 Кб89Intermediate Probability Theory for Biomedical Engineers - JohnD. Enderle.pdf
#
10.08.201335.92 Mб69Introduction to Biomedical Engineering - John D. Enderle et al.pdf
#
10.08.20131.72 Mб135Introduction to Statistics for Biomedical Engineers - Kristina M. Ropella.pdf
#
10.08.201310.58 Mб115Kluwer - Handbook of Biomedical Image Analysis Vol.1.pdf
#
10.08.201325.84 Mб111Kluwer - Handbook of Biomedical Image Analysis Vol.2.pdf
#
10.08.201316.35 Mб120Kluwer - Handbook of Biomedical Image Analysis Vol.3.pdf
#
10.08.20137.87 Mб1867Laser-Tissue Interactions Fundamentals and Applications - Markolf H. Niemz.pdf
#
10.08.20132.76 Mб130Mathematics for Life Sciences and Medicine - Takeuchi Iwasa and Sato.pdf
#
10.08.20131.85 Mб69Metabolic Engineering - T. Scheper and Jens Nielsen.pdf
#
10.08.201310.41 Mб71Micro-Nano Technology for Genomics and Proteomics BioMEMs - Ozkan.pdf
#
10.08.201324.9 Mб60Microarray Technology and Its Applications - U.R. Muller & D.V. Nicolau.pdf