当前位置：和泉文库 > 计算机 > 浏览文档

北京大学：《模式识别》课程教学资源（参考资料）Algorithms for Clustering Data

文件格式：PDF，文件大小：38.71MB，售价：30.45元

共326页，可试读40页，点击往前阅读 ↑↑

文档详细内容（约326页）

34 Data Representation Chap.2 A set of basis vectors that minimizes SE(m)has been shown to be a set of eigenvectors (c,...,c)for the covariance matrix (Tou and Heydorn, 1967;Sebestyen,1962;Wilks,1963;Fukunaga.1972).In addition,the basis vectors that minimize SE(m)correspond to the m largest eigenvalues of.Projecting x;into this optimal subspace is precisely the operation in Eq.(2.4).The degree of approximation can be determined by expressing the minimum square error in terms of eigenvalues of SE(m)min This result justifies the rule suggested earlier for choosing m.Retaining only those new features that provide the largest spreads minimizes the square error.In other words,the importance,in a square-error sense,of each prospective new feature is measured by an eigenvalue of 9.The normalization of Eq.(2.2) scales the pattern space so that A;=d,but the development given above is wholly applicable. The second reason for the importance of Eg.(2.4)is that the eigenvector projection maximizes scatter.This interpretation of Eq.(2.4)has its roots in theoreti- cal statistics (Wilks,1963).Assuming normalization by Eq.(2.1)or (2.2),the scatter of the set of patterns is given by 9=n别 where is the determinant of the covariance matrix,9. Wilks (1963)has provided a geometrical interpretation of scatter.Another interpretation follows from the relation between S and 9.From Eg.(C.2), i=1 This shows that the scatter is invariant through a rotation of the pattern space, such as Eq.(2.4)when m =d.A set of eigenvalues for 9 is also a set for More important,it shows that the scatter is proportional to the product of the sample variances(,···,入d)along the rotated axes. If the patterns are projected into an m-dimensional space by Eq.(2.4),their scatter is maximized in the m-dimensional space with respect to all other orthogonal m-dimensional projections because Eq.(2.4)uses eigenvectors corresponding to the m largest eigenvalues of(or of )From an intuitive point of view,maximizing scatter might not appear to be as worthy a goal as minimizing square error,although both are achieved with the same transformation. 2.4.3 Discriminant Analysis Classical discriminant analysis (Wilks,1963:Friedman and Rubin,1967; Fortier and Solomon,1966;Lachenbruch and Goldstein,1979)attempts to project patterns into a space having fewer dimensions than the original pattern space

点击进入文档下载页（PDF格式）

共326页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录