vstatmp_engl

Добавил:

Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Южный Федеральный Университет

Предмет:

[НЕСОРТИРОВАННОЕ]

Файл:

The number of observed events belonging to bin j is Xi

Tij θj . Thus we get for

the relative bin error

= s i Tij θj .

.pdf

Скачиваний:

Добавлен:

12.03.2016

Размер:

6.43 Mб

Скачать

☆

<<< < Предыдущая 14 15 16 17 18 19 20 21 22 23 24 2526 / 4226 27 28 29 30 31 32 33 34 35 36 37 38 > Следующая >>>

9.3 Binning-free Methods	237

Fig. 9.7. Deconvolution of a blurred pictur with the satellite method.

= σf r	1	+	1	= 0.24 σf .

	60		25

Measurement resolution and acceptance should stay approximately constant in the region in which the events migrate. When we start with a reasonably good approximation of the true distribution, this condition is usually satisﬁed. In exceptional cases it would be necessary to update the distribution of the satellites yik′ after each move, i.e. to simulate or correct them once again. It is more e cient, however, to perform the adaptation for all elements periodically after a certain number of migration steps.

The number K determines the maximal resolution after the deconvolution, it has

therefore a regularization e ect; e.g. for a measurement resolution σf and K = 16

√

the minimal sampling interval is σT = σf / K = σf /4.

If the true p.d.f. has several maxima, we may ﬁnd several relative minima of the energy. In this case a new stochastic element has to be introduced in the minimization (see Sect. 5.2.7). In this case a move towards a position with smaller energy is not performed automatically, but only preferred statistically.

We have not yet explained how acceptance losses are taken into account. The simplest possibility is the following: If there are acceptance losses, we need Ki0 > K trials to generate the K satellites of the event yi. Consequently, we relate a weight wi = K0i/K to the element yi. After the deconvolution we then obtain a weighted sample.

A more detailed description of the satellite method is found in [53].

9.3.3 The Maximum Likelihood Method

In the rare cases where the transfer function t(x, x′) is known analytically or easily calculable otherwise, we can maximize the likelihood where the parameters are the locations of the true points. Neglecting acceptance losses, the p.d.f. for an observation x′, with the true values x1, . . . , xN as parameters is

238 9 Deconvolution

-4

-8

-4

-8

-4

-8

-4

	200
	100
8	0-8	-4	0	4	8

200

	100
8	0-8	-4	0	4	8

	200
	100
8	0-8	-4	0	4	8

Fig. 9.8. Deconvolution of point locations. The left hand side shows from top to bottom the true distribution, the smeared distribution and the deconvoluted distribution. The right hand side shows the corresponding projections onto the x axis in form of histograms.

9.4 Comparison of the Methods

239

fN (x′|x1, . . . , xN ) = N1 XN t(xi, x′)

i=1

where t is assumed to be normalized with respect to x′. The log likelihood then is given, up to a constant, by

		N	N
		X X
x	x′	) = ln	t(x , x′		) .
ln L( \|		) = ln	i	k
		k=1	i=1

The maximum can either be found using the well known minimum searching procedures or the migration method which we have described above and which is not restricted to low event numbers. Of course maximizing the likelihood leads to the same artifacts as observed in the histogram based methods. The true points form clusters which, eventually, degenerate into discrete distributions. A smooth result is obtained by stopping the maximizing process before the maximum has been reached. For deﬁniteness, similar to the case of histogram deconvolution, a ﬁxed di erence of the likelihood from its maximum value should be chosen to stop the maximization

process. Similarly to the histogram case, this di erence should be of the order of p

ΔL ≈ NDF/2 where the number of degrees of freedom NDF is equal to the number of points times the dimension of the space.

Example 124. : Deconvolution by ﬁtting the true event locations

Fig. 9.8 top shows 2000 points randomly generated according to the superposition of two normal distributions denoted as N(x′, y′|µx, µy, σx, σy):

f(x′, y′) = 0.6 N(x′, y′| − 2, 0, 1, 1) + 0.4 N(x′, y′| + 2, 0, 1, 1) .

The transfer function again is a normal distribution centered at the true points with symmetric standard deviations of one unit. It is used to convolute the original distribution with the result shown in Fig. 9.8 middle. The starting values of the parameters xˆi, yˆi are set equal to the observed locations x′i, yi′. Randomly selected points are then moved within squares of size 4 × 4 units and moves that improve the likelihood are kept. After 5000 successful moves the procedure is stopped to avoid clustering of the true points. The result is shown in the lower plot of Fig. 9.8. On the right hand side of the same ﬁgure the projections of the distribution onto the x axis in form of histograms are presented.

Generally, as with other deconvolution methods, we have to ﬁnd a sensible compromise between smoothness and resolution and to choose the corresponding regularization strength. In astronomy and optics, often the signals originate from point sources. In this case it is reasonable to completely omit the regularization, and to determine the locations and strength of the sources by maximizing the likelihood. Then, eventually, after inspection of the result, the number of sources might be ﬁxed and the parameters then could be determined in a standard likelihood ﬁt.

9.4 Comparison of the Methods

We have discussed three di erent methods for deconvolution of histograms, and three binning-free methods:

240 9 Deconvolution

1.Likelihood ﬁt of the true histogram with curvature sensitive or entropy regularization

2.Multiplication of the observed histogram vector with the inverted, regularized transfer matrix

3.Iterative deconvolution

4.Iterative binning-free deconvolution

5.The satellite method

6.The binning-free likelihood method

The ﬁrst method is more transparent than the others. The user has the possibility to adapt the regularization function to his speciﬁc needs. With curvature regularization he may, for instance, choose a di erent regularization for di erent regions of the histogram, or for the di erent dimensions in a higher-dimensional histogram. He may also regularize with respect to an assumed shape of the resulting histogram. The statistical accuracy in di erent parts of the histogram can be taken into account. Regularization with the entropy approach is technically simpler but it is not suited for applications in particle physics, because it favors a globally uniform distribution while the local smearing urges for a local smoothing. It has, however, been successfully applied in astronomy and been further adjusted to speciﬁc problems there. An overview with critical remarks is given in [56].

The second method is independent from the shape of the distribution to be deconvoluted. It depends on the transfer matrix only. This has the advantage to be independent from subjective inﬂuences of the user. A disadvantage is that regions of the true histogram with high statistics are treated not di erently from those with only a few entries. A reﬁned version which has successfully been applied in several experiments is presented in [8].

The third procedure is technically the simplest. It can be shown that it is very similar to the second method. It also suppresses small eigenvalues of the transfer matrix.

The binning-free, iterative method has the disadvantage that the user has to choose some parameters. It requires su ciently high statistics in all regions of the observation space. An advantage is that there are no approximations related to the binning. The deconvolution produces again single points in the observation space which can be subjected to selection criteria and collected into arbitrary histograms, while methods working with histograms have to decide on the corresponding parameters before the deconvolution is performed.

The satellite method has the same advantages. Important parameters must not be chosen, however. It is especially well suited for small samples and multidimensional distributions, where other methods have di culties. For large samples it is rather slow even on large computers.

The binning-free likelihood method requires an analytic transfer function. It is much faster than the satellite method, and is especially well suited for the deconvolution of narrow structures like point sources.

An qualitative comparison of the di erent methods does not show big di erences in the results. In the majority of problems the deconvolution of histograms with the ﬁtting method and curvature regularization is the preferred solution.

9.5 Error Estimation for the Deconvoluted Distribution

241

Fig. 9.9. Result of a deconvolution with strong (top) and weak (bottom) regularization. The errors depend on the regularization strength.

As stated above, whenever the possibility exists to parametrize the true distribution, the deconvolution process should be avoided and replaced by a standard ﬁt.

9.5 Error Estimation for the Deconvoluted Distribution

The ﬁtting methods produce error estimates automatically, for the other methods the uncertainties can be obtained by the usual error propagation. But we run into another unavoidable di culty connected to the regularization: The size of the errors depends on the strength of the regularization which on the other hand is unrelated to the statistical accuracy of the data. This is illustrated in Fig. 9.9 where the deconvolution of a structure function with di erent regularization strengths is shown. A strongly regularized distribution may exhibit smaller errors than the distribution before the convolution. This is unsatisfactory, as we loose information by the smearing. We should present errors which do not depend on data manipulations.

As described above, the errors of neighboring bins of a histogram are negatively correlated. The goodness-of-ﬁt changes only slightly if we, for instance, enhance a bin content and accordingly reduce both the two neighboring bin contents or vice versa. The regularization has the e ect to minimize the di erence while keeping the sum of entries nearly unchanged, as can be seen in the example of Fig. 9.9. The e ect of the regularization is sketched in Fig. 9.10. Even a soft regularization will reduce the area of the error ellipse considerably.

242 9 Deconvolution

Fig. 9.10. E ect of regularization. The correlation of neighboring bins is reduced.

weak

regularization

strong

regularization

Fig. 9.11. Deconvoluted distribution with strong and weak regularization. The horizotal error bars represent the experimental resolution.

A sensible presentation of the result of a deconvolution where the values but not the errors depend on the regularization is the following: For each deconvoluted point we calculate its statistical error δθj neglecting uncertainties due to a possible wrong association of observations to the corresponding bin. This means essentially, that the relative error is equal to one over the square root of the number of observed events associated to the true bin.

9.5 Error Estimation for the Deconvoluted Distribution

243

Besides the pure Poisson ﬂuctuations the graphical presentation should also show the measurement resolution. We represent it by a horizontal bar for each point. Fig. 9.11 shows a deconvolution result for two regularizations of di erent strength. The curve represents the true distribution. Contrary to the representation of Fig. 9.9, the vertical error bars are now independent of the regularization strength. The horizontal bars indicate the resolution. With the strong regularization the valley is somewhat ﬁlled up due to the suppression of curvature and the points are following the curve better than expected from the error bars. The experienced scientist is able to judge also for the weak regularization that the curve is compatible with the data.

In multi-dimensional and in binning-free applications a graphical representation of the resolution is di cult but the tranfer function has to be documented in some way.

Hypothesis Tests

10.1 Introduction

So far we treated problems where a data sample was used to discriminate between completely ﬁxed competing hypotheses or to estimate free parameters of a given distribution. Now we turn to the case where we would like to ﬁnd out whether a single hypothesis, without a completely deﬁned alternative, is true or not. Some of the questions which we would like to address are the following:

1.Track parameters are ﬁtted to some measured points. Are the deviations of the coordinate from the adjusted track compatible with statistical ﬂuctuations or should we reject the hypothesis that they are related to a particle trajectory?

2.Can we describe a sample of e+e− reactions by quantum electrodynamics?

3.Do two samples obtained at di erent times in an experiment signiﬁcantly di er from each other?

4.Is a signal in a spectral distribution signiﬁcant?

5.Can we describe a background distribution signiﬁcantly better by a linear or by a higher order polynomial.

To answer this kind of questions, we will have to set up a test procedure which quantiﬁes the compatibility of a given data sample with a hypothesis. The test has to provide a quantitative result which is used to judge how plausible or unlikely a hypothesis is, deﬁnite judgements – right or wrong – are outside the possibilities of statistics. A test can never prove the validity of a hypothesis, it can only indicate problems with it.

A scientist who chooses a certain test procedure has to ﬁx all parameters of the test before looking at the data1. Under no circumstances is it allowed to base the selection of a test on the data which one wants to analyze, to optimize a test on the bases of the data or to terminate the running time of an experiment as a function of the output of a test. This would bias the result. Obviously, it is allowed to optimize a test with a part of the data which is excluded in the ﬁnal analysis.

1Scientists often call this a blind analysis.

246 10 Hypothesis Tests

Usually a test is associated with a decision: accept or reject. We will not always attempt a decision but conﬁne ourselves to ﬁx the parameters which form the bases for a possible decision.

As mentioned, we will primarily deal with a part of test theory which is especially important in natural sciences and also in many other empirical research areas, namely that only one hypothesis, we call it the null hypothesis H0, is tested while the admitted alternative is so vague or general that it cannot be parameterized. The alternative hypothesis H1 is simply “H0 is false”. The question is whether the sample at hand is in agreement with H0 or whether it deviates signiﬁcantly from it. The corresponding tests are called goodness-of-ﬁt (GOF) tests.

Strongly related to GOF tests are two-sample tests which check whether two samples belong to the same population.

At the end of this chapter we will treat another case in which we have a partially speciﬁed alternative and which plays an important role in physics. There the goal is to investigate whether a small signal is signiﬁcant or explainable by a ﬂuctuation of a background distribution corresponding to H0. We call this procedure signal test.

10.2 Some Deﬁnitions

Before addressing GOF tests, we introduce some notations.

10.2.1 Single and Composite Hypotheses

We distinguish between simple and composite hypotheses. The former ﬁx the population uniquely. Thus H0: “The sample is drawn from a normal distribution with mean zero and variance one, i.e. N(0, 1).” is a simple hypothesis. If the alternative is also simple, e.g. H1 : “N(5, 1)”, then we have the task to decide between two simple hypotheses which we have already treated in Chap. 6, Sect. 6.3. In this simple case there exists an optimum test, the likelihood ratio test.

Composite hypotheses are characterized by free parameters, like H0: “The sample is drawn from a normal distribution.” The user will adjust mean and variance of the normal distribution and test whether the adjusted Gaussian is compatible with the data.

The hypothesis that we want to test is always H0, the null hypothesis, and the alternative H1 is in most cases the hypothesis that H0 does not apply. H1 then represents an inﬁnite number of speciﬁed hypotheses.

10.2.2 Test Statistic, Critical Region and Signiﬁcance Level

After we have ﬁxed the null hypothesis and the admitted alternative H1, we must choose a test statistic t(x), which is a function of the sample values x ≡ {x1, . . . , xN }, possibly in such a way that the di erence between the distribution f(t|H0) and distributions belonging to H1 are as large as possible. To simplify the notation, we consider one-dimensional distributions. The generalization to multi-dimensional observations is trivial.

<<< < Предыдущая 14 15 16 17 18 19 20 21 22 23 24 2526 / 4226 27 28 29 30 31 32 33 34 35 36 37 38 > Следующая >>>

Соседние файлы в предмете [НЕСОРТИРОВАННОЕ]

#
13.02.201531.06 Кб27Voprosy_lichnost_v_prof_deyatelnosti.docx
#
20.09.2019556.03 Кб7Voprosy_po_distsipline2 - копия.doc
#
27.09.2019275.97 Кб18voprosy_po_spets_kursu_k_zachetu.doc
#
23.09.2019653.82 Кб85Voprosy_vnimanie_i_pamyat_1.doc
#
02.08.201928.44 Кб8Vopros_10.docx
#
12.03.20166.43 Mб11vstatmp_engl.pdf
#
13.02.20151.12 Mб13Vsya_teoria_k_FAYa.pdf
#
14.11.2019212.99 Кб9Vvedenie_v_ specialnoct_kl.doc
#
11.11.2019739.33 Кб9vvedenie_v_socialno_ekonomicheskuyu_geografiyu.doc
#
08.11.201963.49 Кб1vvodnyy_urok_10_kl.doc
#
12.03.20161.99 Mб13web.doc