当前位置：和泉文库 > 基础医学 > 浏览文档

《医学影像信息学概论》课程参考资源：Support Vector Machines for Pattern Recognition（Support Vector Machines for Pattern Classification, Second Edition）

文件格式：PDF，文件大小：5.9MB，售价：84.3元

文档详细内容（约482页）

Contents xvii 7 Feature Selection and Extraction . . . . . . . . . . . . . . . . . . . . . . . . . 331 7.1 Selecting an Initial Set of Features . . . . . . . . . . . . . . . . . . . . . . . . 331 7.2 Procedure for Feature Selection . . . . . . . . . . . . . . . . . . . . . . . . . . 332 7.3 Feature Selection Using Support Vector Machines . . . . . . . . . . 333 7.3.1 Backward or Forward Feature Selection . . . . . . . . . . . . . 333 7.3.2 Support Vector Machine-Based Feature Selection . . . . . 336 7.3.3 Feature Selection by Cross-Validation . . . . . . . . . . . . . . . 337 7.4 Feature Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340 8 Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343 8.1 Domain Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 343 8.2 Extension to Clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 349 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351 9 Maximum-Margin Multilayer Neural Networks . . . . . . . . . . . 353 9.1 Approach. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 353 9.2 Three-Layer Neural Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354 9.3 CARVE Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 357 9.4 Determination of Hidden-Layer Hyperplanes . . . . . . . . . . . . . . . 358 9.4.1 Rotation of Hyperplanes . . . . . . . . . . . . . . . . . . . . . . . . . . 359 9.4.2 Training Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 362 9.5 Determination of Output-Layer Hyperplanes . . . . . . . . . . . . . . . 363 9.6 Determination of Parameter Values . . . . . . . . . . . . . . . . . . . . . . . 363 9.7 Performance Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 364 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 365 10 Maximum-Margin Fuzzy Classifiers . . . . . . . . . . . . . . . . . . . . . . . 367 10.1 Kernel Fuzzy Classifiers with Ellipsoidal Regions . . . . . . . . . . . 368 10.1.1 Conventional Fuzzy Classifiers with Ellipsoidal Regions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 368 10.1.2 Extension to a Feature Space . . . . . . . . . . . . . . . . . . . . . . 369 10.1.3 Transductive Training. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370 10.1.4 Maximizing Margins . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375 10.1.5 Performance Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . 378 10.2 Fuzzy Classifiers with Polyhedral Regions . . . . . . . . . . . . . . . . . 382 10.2.1 Training Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 383 10.2.2 Performance Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . 391 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 393 11 Function Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395 11.1 Optimal Hyperplanes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395 11.2 L1 Soft-Margin Support Vector Regressors . . . . . . . . . . . . . . . . . 399 11.3 L2 Soft-Margin Support Vector Regressors . . . . . . . . . . . . . . . . . 401 11.4 Model Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403 11.5 Training Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403

xviii Contents 11.5.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403 11.5.2 Newton’s Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 405 11.5.3 Active Set Training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 422 11.6 Variants of Support Vector Regressors. . . . . . . . . . . . . . . . . . . . . 429 11.6.1 Linear Programming Support Vector Regressors . . . . . . 430 11.6.2 ν-Support Vector Regressors . . . . . . . . . . . . . . . . . . . . . . . 431 11.6.3 Least-Squares Support Vector Regressors . . . . . . . . . . . . 432 11.7 Variable Selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435 11.7.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435 11.7.2 Variable Selection by Block Deletion . . . . . . . . . . . . . . . . 436 11.7.3 Performance Evaluation . . . . . . . . . . . . . . . . . . . . . . . . . . . 437 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 438 A Conventional Classifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 443 A.1 Bayesian Classifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 443 A.2 Nearest-Neighbor Classifiers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 444 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 445 B Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 447 B.1 Matrix Properties . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 447 B.2 Least-Squares Methods and Singular Value Decomposition . . . 449 B.3 Covariance Matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 452 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 454 C Quadratic Programming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 455 C.1 Optimality Conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 455 C.2 Properties of Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 456 D Positive Semidefinite Kernels and Reproducing Kernel Hilbert Space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459 D.1 Positive Semidefinite Kernels . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 459 D.2 Reproducing Kernel Hilbert Space . . . . . . . . . . . . . . . . . . . . . . . . 463 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 465 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 467

Chapter 1 Introduction Support vector machines and their variants and extensions, often called kernel-based methods (or simply kernel methods), have been studied extensively and applied to various pattern classification and function approximation problems. Pattern classification is to classify some object into one of the given categories called classes. For a specific pattern classification problem, a classifier, which is computer software, is developed so that objects are classified correctly with reasonably good accuracy. Inputs to the classifier are called features, because they are determined so that they represent each class well or so that data belonging to different classes are well separated in the input space. In general there are two approaches to develop classifiers: a parametric approach [1], in which a priori knowledge of data distributions is assumed, and a nonparametric approach, in which no a priori knowledge is assumed. Neural networks [2–4], fuzzy systems [5–7], and support vector machines [8, 9] are typical nonparametric classifiers. Through training using input– output pairs, classifiers acquire decision functions that classify an input into one of the given classes. In this chapter we first classify decision functions for a two-class problem into direct and indirect decision functions. The class boundary given by a direct decision function corresponds to the curve where the function vanishes, while the class boundary given by two indirect decision functions corresponds to the curve where the two functions give the same values. Then we discuss how to define and determine the direct decision functions for multiclass problems and list up benchmark data sets used in the book. Finally we discuss some measures to evaluate performance of classifiers and function approximators for a given data set. S. Abe, Support Vector Machines for Pattern Classification, 1 Advances in Pattern Recognition, DOI 10.1007/978-1-84996-098-4_1, c Springer-Verlag London Limited 2010

点击进入文档下载页（PDF格式）

共482页，可试读40页，点击继续阅读 ↓↓

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录