Добавил:

Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Московский государственный физико-технический университет (МФТИ)

Предмет:

[НЕСОРТИРОВАННОЕ]

Файл:

[Boyd]_cvxslides

.pdf

Скачиваний:

Добавлен:

03.06.2015

Размер:

1.74 Mб

Скачать

☆

<<< < Предыдущая 7 8 9 10 11 12 13 14 15 16 17 18 1920 / 3120 21 22 23 24 25 26 27 28 29 30 31 > Следующая >>>

Lagrange dual of maximum margin separation problem (1)

maximize

λ + 1T µ

subject to

i=1

λixi −

i=1

µiyi

≤ 1

(2)

1 λ = 1 µ,

from duality, optimal value is inverse of maximum margin of separation

interpretation

• change variables to θi = λi/1T λ, γi = µi/1T µ, t = 1/(1T λ + 1T µ)

• invert objective to minimize 1/(1T λ + 1T µ) = t

minimize t

subject to

−

≤ t

0, 1T

θ = 1, γ

0, 1T γ = 1

optimal value is distance between convex hulls

Geometric problems

8–10

Approximate linear separation of non-separable sets

minimize	1T u + 1T v
subject to	aT xi + b ≥ 1 − ui, i = 1, . . . , N
	aT yi + b ≤ −1 + vi, i = 1, . . . , M
	u 0, v 0
• an LP in a, b, u, v

• at optimum, ui = max{0, 1 − aT xi − b}, vi = max{0, 1 + aT yi + b}

• can be interpreted as a heuristic for minimizing #misclassiﬁed points

Geometric problems

8–11

Support vector classiﬁer

minimize	kak2 + γ(1T u + 1T v)
subject to	aT xi + b ≥ 1 − ui, i = 1, . . . , N
	aT yi + b ≤ −1 + vi, i = 1, . . . , M
	u 0, v 0

produces point on trade-o curve between inverse of margin 2/kak2 and classiﬁcation error, measured by total slack 1T u + 1T v

same example as previous page, with γ = 0.1:

Geometric problems

8–12

Nonlinear discrimination

separate two sets of points by a nonlinear function:

f(xi) > 0, i = 1, . . . , N, f(yi) < 0, i = 1, . . . , M

• choose a linearly parametrized family of functions

f(z) = θT F (z)

F = (F1, . . . , Fk) : Rn → Rk are basis functions

•solve a set of linear inequalities in θ:

θT F (xi) ≥ 1, i = 1, . . . , N, θT F (yi) ≤ −1, i = 1, . . . , M

Geometric problems

8–13

quadratic discrimination: f(z) = zT P z + qT z + r

xTi P xi + qT xi + r ≥ 1, yiT P yi + qT yi + r ≤ −1

can add additional constraints (E.G., P −I to separate by an ellipsoid) polynomial discrimination: F (z) are all monomials up to a given degree

separation by ellipsoid

separation by 4th degree polynomial

Geometric problems

8–14

Placement and facility location

•N points with coordinates xi R2 (or R3)

•some positions xi are given; the other xi’s are variables

•for each pair of points, a cost function fij(xi, xj)

placement problem

	P
minimize	i=6 j fij(xi, xj)

variables are positions of free points interpretations

•points represent plants or warehouses; fij is transportation cost between facilities i and j

•points represent cells on an IC; fij represents wirelength

Geometric problems

8–15

example: minimize (i,j) A h(kxi − xjk2), with 6 free points, 27 links

optimal placement for h(z) = z, h(z) = z2, h(z) = z4

1		1
0		0
−1	0	−1
−1	0	1

		1
		0
	0	−1
−1	0	1

−1

histograms of connection lengths kxi − xjk2

4
3
2
1
00	0.5	1	1.5	2

4
3
2
1
00	0.5	1	1.5

Geometric problems

6
5
4
3
2
1
00	0.5	1	1.5
			8–16

Convex Optimization — Boyd & Vandenberghe

9.Numerical linear algebra background

•matrix structure and algorithm complexity

•solving linear equations with factored matrices

•LU, Cholesky, LDLT factorization

•block elimination and the matrix inversion lemma

•solving underdetermined equations

9–1

Matrix structure and algorithm complexity

cost (execution time) of solving Ax = b with A Rn×n

•for general methods, grows as n3

•less if A is structured (banded, sparse, Toeplitz, . . . )

ﬂop counts

•ﬂop (ﬂoating-point operation): one addition, subtraction, multiplication, or division of two ﬂoating-point numbers

•to estimate complexity of an algorithm: express number of ﬂops as a (polynomial) function of the problem dimensions, and simplify by keeping only the leading terms

•not an accurate predictor of computation time on modern computers

•useful as a rough estimate of complexity

Numerical linear algebra background

9–2

vector-vector operations (x, y Rn)

• inner product xT y: 2n − 1 ﬂops (or 2n if n is large)

• sum x + y, scalar multiplication αx: n ﬂops

matrix-vector product y = Ax with A Rm×n

•m(2n − 1) ﬂops (or 2mn if n large)

•2N if A is sparse with N nonzero elements

•2p(n + m) if A is given as A = UV T , U Rm×p, V Rn×p

matrix-matrix product C = AB with A Rm×n, B Rn×p

•mp(2n − 1) ﬂops (or 2mnp if n large)

•less if A and/or B are sparse

•(1/2)m(m + 1)(2n − 1) ≈ m2n if m = p and C symmetric

Numerical linear algebra background

9–3

<<< < Предыдущая 7 8 9 10 11 12 13 14 15 16 17 18 1920 / 3120 21 22 23 24 25 26 27 28 29 30 31 > Следующая >>>

Соседние файлы в предмете [НЕСОРТИРОВАННОЕ]

#
28.08.2019421.89 Кб20ZDR Aero Express.doc
#
03.06.201510.08 Mб24Zhadan_lektsii_6_semestr.pdf
#
03.06.201519.92 Mб95Zhukov_2007.pdf
#
06.12.2018309.76 Кб7Zima_9_klass.doc
#
03.06.2015117.64 Кб7Zubov_114_2_TeorInf.pdf
#
03.06.20151.74 Mб7[Boyd]_cvxslides.pdf
#
03.06.20152.92 Mб54[Raymond_Wacks]_Law_A_Very_Short_Introduction.pdf
#
27.03.2016276.31 Кб228[Щур, Фейгельман] Метод трансфер-матрицы для модели Изинга (2011).pdf
#
03.06.20153.52 Mб13_genetika_pola.pdf
#
03.06.20151.12 Mб16Адронное рассеяние.pdf
#
01.05.2025503.81 Кб0АИБС_МУ_Экономич.оценки_Скрипкин_Ермолкевич_010...doc