Image Processing and Recognition

PRINCIPAL COMPONENT ANALYSIS

Principal component analysis (PCA) is an algebraic method to obtain a representation of the data that emphasizes the differences among the samples (ie, their scatter). PCA is good for reconstruction: it provides a compact representation of the most important features of the samples. It is not good for discrimination: if the samples belong to two classes, PCA does not necessarily emphasize the differences between the classes. The PCA representation is decorrelated: it is a projection technique and the components are independent as the projection axes are orthogonal.

Let X₁, ..., X_n be the n samples, each being a vector of size m, ie, the components of the sample vectors are X_i,a, where i=1..n is the sample index, and a=1..m is the component index. The mean of the samples is

X^o_a = (1/n) ∑_i=1..n X_i,a

and the reduced samples are denoted Y_i,a = X_i,a - X^o_a. The covariance of the samples is the mxm matrix (up to a factor 1/n)

C_a,b = ∑_i Y_i,a Y_i,b

The principal components are the eigenvectors of C_a,b,

∑_b C_a,b U_k,b = L_k U_k,b

where L_k denote the eigenvalues, k=1..m. The covariance matrix is positive semidefinite, therefore its eigenvalues are non-negative. However its rank is at most n and therefore it is singular when n<m, ie, when the number of samples is much smaller than the size of the vector space. In this case we can find the non-zero n eigenvectors and eigenvalues by considering the nxn matrix

D_i,j = ∑_a Y_i,a Y_j,a

Given the eigenvalues of D_i,j,

∑_j D_i,j V_k,j = L_k V_k,i

where k=1..n, the vectors

U_k,a = ∑_i Y_i,a V_k,a

are the first n eigenvalues of C_a,b, with eigenvalues L_k.

Fisher Linear Discriminant Analysis

The Fisher Linear Discriminant Analysis (FDA or LDA) is aimed to overcome the poor discriminability of PCA, while retaining the capability of dimensionality reduction. The basic idea is to choose a base that emphsizes the discrimination between the classes.

Given k classes, with n_k samples X_k,i each, the intraclass covariance is

C_W = ∑_k (n_k/n ) (X_k^M - X^M) (X_k^M - X^M)^T

where X_k^M is the mean vector of the class k and X^M is the global mean. This is a matrix of size mxm.

The interclass covariance is (again a matrix of size mxm),

C_B = (1/n) ∑_k ∑_i Y_k,i Y_k,i^T

where Y_k,i is the i-th sample of class k normalized to the mean of the class.
Notice that C = C_B + C_W

If C_W is non singular the optimal basis is the matrix with orthonormal columns that maximizes tha ratio of the determinant of C_B to that of C_W. It can be shown that the transformation matrix Z solves the eigenvalue problem (i = 1, ..., N)

C_B Z_i = L_i C_W Z_i

The first eigenvectors Z_i form the most discriminant basis (Fisher basis).

The rank of C_W is at most n - k. When the intraclass covariance is singular, for instance because the number of samples is smaller than their size, one can perform a dimensionality reduction using the PCA, thus reducing from m-dim vectors to n - k size vectors and apply the FDA to them. The first reduction should have complete reconstruction properties due to the features of PCA. Next a new basis, more discriminant is chosen by the FDA in the reduced representation.

Linear Feature Analysis

The Linear Feature Analysis (LFA) is yet another technique for extracting a small set of most significant features that has both good reconstruction properties and small dimensionality. Like all the reconstruction techniques there is a tradeoff between reconstruction and generalization, therefore it has limited discrimination capabilities.

...

Marco Corvi - Page hosted by geocities.com.