appendix D Matrix algebra in R

Many of the functions described in this book operate on matrices. The manipulation of matrices is built deeply into the R language. Table D.1 describes operators and functions that are particularly important for solving linear algebra problems. In the table, A and B are matrices, x and b are vectors, and k is a scalar.

Table D.1 R functions and operators for matrix algebra

Operator or function		Description

+ - * / ^	Element-wise addition, subtraction, multiplication, division, and exponentia-
	tion, respectively.
A %*% B	Matrix multiplication.
A %o% B	Outer product: AB'.
cbind(A, B, …)	Combines matrices or vectors horizontally. Returns a matrix.
chol(A)	Choleski factorization of A. If R <- chol(A), then chol(A) contains the
	upper triangular factor, such that R'R = A.
colMeans(A)	Returns a vector containing the column means of A.
crossprod(A)	Returns A'A.
crossprod(A,B)	Returns A'B.
colSums(A)	Returns a vector containing the column sums of A.
diag(A)	Returns a vector containing the elements of the principal diagonal.
diag(x)	Creates a diagonal matrix with elements of x in the principal diagonal.
diag(k)	If k is a scalar, this creates a k × k identity matrix. Go figure.
eigen(A)	Eigenvalues and eigenvectors of A. If y <- eigen(A) then
	■	y$val are the eigenvalues of A.
	■	y$vec are the eigenvectors of A.

542

		APPENDIX D Matrix algebra in R	543
Table D.1 R functions and operators for matrix algebra

Operator or function		Description

ginv(A)	Moore-Penrose Generalized Inverse of A. (Requires the MASS package.)
qr(A)	QR decomposition of A. If y <- qr(A), then
	■	y$qr has an upper triangle that contains the decomposition and a lower
		triangle that contains information on the decomposition.
	■	y$rank is the rank of A.
	■	y$qraux is a vector which contains additional information on Q.
	■	y$pivot contains information on the pivoting strategy used.
rbind(A, B, …)	Combines matrices or vectors vertically. Returns a matrix.
rowMeans(A)	Returns a vector containing the row means of A.
rowSums(A)	Returns a vector containing the row sums of A.
solve(A)	Inverse of A where A is a square matrix.
solve(A, b)	Solves for vector x in the equation b = Ax.
svd(A)	Single-value decomposition of A. If y <- svd(A), then
	■	y$d is a vector containing the singular values of A.
	■	y$u is a matrix with columns containing the left singular vectors of A.
	■	y$v is a matrix with columns containing the right singular vectors of A.
t(A)	Transpose of A.

Several user-contributed packages are particularly useful for matrix algebra. The matlab package contains wrapper functions and variables used to replicate MATLAB function calls as closely as possible. These functions can help you port MATLAB applications and code to R. There’s also a useful cheat sheet for converting MATLAB statements to R statements at http://mathesaurus.sourceforge.net/octave-r.html.

The Matrix package contains functions that extend R in order to support highly dense or sparse matrices. It provides efficient access to BLAS (Basic Linear Algebra Subroutines), Lapack (dense matrix), TAUCS (sparse matrix), and UMFPACK (sparse matrix) routines.

Finally, the matrixStats package provides methods for operating on the rows and columns of matrices, including functions that calculate counts, sums, products, central tendency, dispersion, and more. Each is optimized for speed and efficient memory use.

appendix E Packages used in this book

R derives much of its breadth and power from the contributions of selfless authors. Table E.1 lists the user-contributed packages described in this book, along with the chapter(s) in which they appear.

Table E.1 Contributed packages used in this book

Package	Authors	Description	Chapter(s)

AER	Christian Kleiber and Achim	Functions, data sets, examples,	13
	Zeileis	demos, and vignettes from the
		book Applied Econometrics with R
		by Christian Kleiber and Achim
		Zeileis (Springer, 2008)
Amelia	James Honaker, Gary King, and	Amelia II: a program for missing	18
	Matthew Blackwell	data via multiple imputation
arrayImpute	Eun-kyung Lee, Dankyu Yoon, and	Missing imputation for microarray	18
	Taesung Park	data
arrayMiss-	Eun-kyung Lee and Taesung	Exploratory analysis of missing pat-	18
Pattern	Park	terns for microarray data
boot	S original by Angelo Canty. R port	Bootstrap functions	12
	by Brian Ripley
ca	Michael Greenacre and Oleg	Simple, multiple, and joint corre-	7
	Nenadic	spondence analysis
car	John Fox and Sanford Weisberg	Companion to Applied	1, 8, 9,
		Regression	10, 11,
			19, 22
cat	Ported to R by Ted Harding and	Analysis of categorical-variable	15
	Fernando Tusell; original by	datasets with missing values
	Joseph L. Schafer

544

APPENDIX E Packages used in this book

545