Добавил:

Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Тульский Государственный Университет

Предмет:

[НЕСОРТИРОВАННОЕ]

Файл:

Applied regression analysis / Lab / 1 / cookbook-en.pdf

Скачиваний:

Добавлен:

10.05.2015

Размер:

1.26 Mб

Скачать

☆

<<< < Предыдущая 1 2 3 4 5 6 7 8 9 10 11 12 1314 / 1914 15 16 17 18 19 > Следующая >>>

Training error

Rtr(S) =

(Yi(S) Yi)2

rss(S) = 1

Rtr(S) = 1

in=1(Yi(S)

R2(S) = 1

P in=1(Yi

tss

The training error is a downward-biased estimate of the prediction risk.

	h	i
	E Rbtr(S) < R(S)		n
bias(Rtr(S)) = E hRtr(S)i		R(S) = 2 i=1 Cov hYi; Yii
b	b		X	b

Adjusted R2

R2(S) = 1 n 1 rss n k tss

Mallow's Cp statistic

Rb(S) = Rbtr(S) + 2kb2 = lack of t + complexity penalty

Akaike Information Criterion (AIC)

AIC(S) = `n(bS; 2 ) k bS

Bayesian Information Criterion (BIC)

BIC(S) = `n(bS; 2 ) k log n bS 2

Validation and training

(S) =

m =

(S)

Y )2

validation data

; often n or

Leave-one-out cross-validation

RCV (S) =		(Yi Y(i))2 =			2
RCV (S) =	n	(Yi Y(i))2 =	n	1i Uii(S) !
	X		Xi	Y	Yi(S)
b	X	b	Xi		b
	i=1		=1

U(S) = XS(XST XS) 1XS (\hat matrix")

19 Non-parametric Function Estimation

19.1Density Estimation


Estimate f(x), where f(x) =			P	[X	2	A] =	RA		f(x) dx.
Integrated square error (ise)			P		2		RA
L(f; fn) = Z		f(x) fn(x)					2 dx = J(h) + Z f2(x) dx
Frequentist risk	b				b			b2(x) dx + Z
	R(f; fbn) = E hL(f; fbn)i = Z							b2(x) dx + Z		v(x) dx
		b(x) = E hfn(x)i f(x)
						b		i
						h		i

v(x) = V fbn(x)

19.1.1Histograms

De nitions

Number of bins m

Binwidth h = m1

Bin Bj has j observations

De ne pj = j=n and pj = Bj f(u) du

Histogram estimator

I(x 2 Bj)

fn(x) =

E hfn(x)i =

V h

pj(1

pj)

nh2

(x) =

(f0(u))

R(fn; f)

du + nh

		h =	1
			n1=3

R	(fn; f)		C

			n2=3
Cross-validation	estimate of			E
	b
JCV (h) =	Z	fn2(x) dx
b		b

(f0(u))2 du!

1=3

2=3

1=3

C =

Z (f0(u))2 du

[J(h)]

n + 1

f( i)(Xi) =

1)h

j=1

p2 bj

19.1.2Kernel Density Estimator (KDE)

Kernel K

K(x) 0

K(x) dx = 1

xK(x) dx = 0

R x2K(x) dx K2 > 0

KDE

(x) =

x Xi

n i=1 h

R(f; fn)

(h K)4 Z (f00(x))2 dx +

Z K2(x) dx

c 2=5c2 1=5c3 1=5

; c2 = Z

= Z

h =

c1 = K

(x) dx; c3

(f00(x))

n1=5

R (f; fn) = n4=5

= 4( K2 )2=5 Z K2(x) dx

4=5

Z (f00)2 dx

1=5

}

(K)

k-nearest Neighbor Estimator

where Nk(x) = fk values of x1; : : : ; xn closest to xg

r(x) = k

i:xi2Nk(x)

Nadaraya-Watson Kernel Estimator

x hxi

r(x) =

wi(x)Yi

wi(x) =

2 [0; 1]

x xj

4j=1

x2K2(x) dx Z r00

f (x)

R(rn; r)

(x) + 2r0

(x)

f(x)

K2(x) dx

+ Z

Rnhf(x)

n1=5

R (r

; r)

n4=5

Epanechnikov Kernel

otherwisej j

K(x) =

x < p

5(1 x2=5)

Cross-validation estimate of E [J(h)]

JCV (h) = Z

fn2

2 n

n n

(x) dx n i=1 f( i)(Xi) hn2

i=1 j=1 K

X b

Cross-validation estimate of E [J(h)]

n	n
JCV (h) = (Yi r( i)(xi))2 =	X		(Yi		r(xi))2
JCV (h) = (Yi r( i)(xi))2 =					r(xi))2					2
Xi						K(0)
b		1				b
		1			n	K	x	xj	!
					n	K
b	i=1				j=1		h
=1	i=1			P
				P

19.3Smoothing Using Orthogonal Functions

h		nh		1	J
Xi Xj	+	2	K(0)	Approximation
	+		K(0)	Approximation

X X

r(x) = j j(x) j j(x)

K (x) = K(2)(x) 2K(x) K(2)(x) = K(x y)K(y) dy

19.2Non-parametric Regression

Estimate f(x) where f(x) = E [Y j X = x]. Consider pairs of points (x1; Y1); : : : ; (xn; Yn) related by

Yi = r(xi) + i

E [ i] = 0

V [ i] = 2

j=1		i=1
Multivariate regression
Y = +
where i = i and	=	0 0(...x1)	...	J (...x1)	1
		B 0(xn)		J (xn)C
		@			A

Least squares estimator

b = ( T ) 1 T Y

n1 T Y (for equally spaced observations only)

<<< < Предыдущая 1 2 3 4 5 6 7 8 9 10 11 12 1314 / 1914 15 16 17 18 19 > Следующая >>>

Соседние файлы в папке 1

#
10.05.20156.15 Кб32.DS_Store
#
10.05.20154.1 Кб32._.DS_Store
#
10.05.20154.1 Кб33._cookbook-en.pdf
#
10.05.20154.1 Кб33._lab1.pdf
#
10.05.20154.1 Кб34._lab1.tex
#
10.05.20151.26 Mб34cookbook-en.pdf
#
10.05.201564.17 Кб34lab1.pdf
#
10.05.20152.13 Кб33lab1.tex
#
10.05.20151.03 Кб33missfont.log
#
10.05.20156.74 Кб32Presentation.aux
#
10.05.20150 б32Presentation.bbl