当前位置：和泉文库 > 基础医学 > 浏览文档

《医学影像信息学概论》课程参考资源：Geoff Dougherty：Pattern Recognition and Classification An Introduction

1 Introduction 2 Classification 3 Nonmetric Methods 4 Statistical Pattern Recognition 5 Supervised Learning 6 Nonparametric Learning 7 Feature Extraction and Selection 8 Unsupervised Learning 9 Estimating and Comparing Classifiers 10 Projects

文件格式：PDF，文件大小：7.45MB，售价：41.85元

共199页，可试读30页，点击往前阅读 ↑↑

文档详细内容（约199页）

.5 Approaches to Classification Unknown Bayes Decision Learning Parametric Parametric Nonparametric Rules 巴 Mixture Density-Based Approaches Geometric Approach Fig. 2.10 Various approaches in statistical pattern recognition the model that characterizes the class-specific expectations for the distribution (Csiszar 1996). Instance-based learning algorithms are lazy-learning algorithms(Mitchel 1997), so-called because they delay the induction or generalization process until classification is performed. Lazy-learning algorithms(Aha 1998: De Mantaras and Armengol 1998)require less computation time during the training phase than eager-learning algorithms( such as Bayesian networks, decision trees, or neural networks) but more computation time during the classification process One of the most straightforward instance-based learning algorithms is the nearest neighbor algorithm. The relationship between a number of statistical pattern recognition methods shown in Fig. 2.10. Moving from top to bottom and left to right, less information is available and as a result, the difficulty of classification increases 2. Nonmetric approaches( Chap. 3): decision trees, syntactic (or grammatical) methods. and rule-based classifiers It is natural and intuitive to classify a pattern by asking a series of questions in which the next question depends on the r to the previos approach is particularly useful for nonmetric (or categorical) data, because the questions can be posed to elicit"yes/no"or"true/false"answers, although it can so be used with quantitative data. The sequence of questions can be displayed as a decision tree in the form of a tree structure(Fig. 2. 11), which has decision nodes that ask a question and have branches for each possible answer(outcome) These are connected until we reach the terminal or leaf node which indicates the classification In the case of complex patterns, a pattern can be viewed as a hierarchical composite of simple sub-patterns which themselves are built from yet simpler

the model that characterizes the class-specific expectations for the distribution (Csiszar 1996). Instance-based learning algorithms are lazy-learning algorithms (Mitchell 1997), so-called because they delay the induction or generalization process until classification is performed. Lazy-learning algorithms (Aha 1998; De Mantaras and Armengol 1998) require less computation time during the training phase than eager-learning algorithms (such as Bayesian networks, decision trees, or neural networks) but more computation time during the classification process. One of the most straightforward instance-based learning algorithms is the nearest neighbor algorithm. The relationship between a number of statistical pattern recognition methods is shown in Fig. 2.10. Moving from top to bottom and left to right, less information is available and as a result, the difficulty of classification increases. 2. Nonmetric approaches (Chap. 3): decision trees, syntactic (or grammatical) methods, and rule-based classifiers. It is natural and intuitive to classify a pattern by asking a series of questions, in which the next question depends on the answer to the previous question. This approach is particularly useful for nonmetric (or categorical) data, because the questions can be posed to elicit “yes/no” or “true/false” answers, although it can also be used with quantitative data. The sequence of questions can be displayed as a decision tree in the form of a tree structure (Fig. 2.11), which has decision nodes that ask a question and have branches for each possible answer (outcome). These are connected until we reach the terminal or leaf node which indicates the classification. In the case of complex patterns, a pattern can be viewed as a hierarchical composite of simple sub-patterns which themselves are built from yet simpler Class-Conditional Densities Supervised Learning Unsupervised Learning Bayes Decision Theory Known Parametric Parametric Nonparametric Nonparametric “Optimal” Rules Density-Based Approaches Geometric Approach Plug-in Rules Decision Boundary Construction (e.g., k-NN) Density Estimation Artificial Neural Networks Mixture Resolving Cluster Analysis Unknown Fig. 2.10 Various approaches in statistical pattern recognition 2.5 Approaches to Classification 19

2 Classification Travel Cost/Km Car Ownership Fig 2.11 A decision tree(the decision nodes are colored blue, and the leaf nodes orange) sub-patterns(Fu 1982). The simplest sub-patterns are called primitives, and the complex pattern is represented in terms of relationships between these primitives in a way similar to the syntax of a langu language, and the patterns are sentences generated according to a certain gram- mar(i. e, set of rules)that is inferred from the available training samples. This approach is appealing in situations where patterns have a definite structure which can be coded by a set of rules (e. g, ECG waveforms, texture in images However, difficulties in segmentation of noisy images to detect the primitives and the inference of the grammar from training data often impede their imple- mentation. The syntactic approach may yield a combinatorial explosion of possibilities to be investigated, requiring large training sets and huge computa tional efforts(Perlovsky 1998) Cognitive approaches, which include neural networks and support vector machines(SⅤMs) Neural networks are based on the organization of the human brain, where nerve cells(neurons) are linked together by strands of fiber(axons). Neural networks are massively parallel computing systems consisting of a huge number of simple processors with many interconnections. They are able to learn com- plex non-linear input-output relationships and use sequential training procedures. However, in spite of the seemingly different underlying principles, most of the neural network models are implicitly similar to statistical pattern recognition methods(Anderson et al. 1990; Ripley 1993). It has been pointed out (Anderson et al. 1990)that"neural networks are statistics for amateurs.. Most NNs conceal the statistics from the user" Support Vector Machines, SVMs(Cristianini and Shawe-Taylor 2000),rep- resent the training examples as points in p-dimensional space, mapped so that the examples of the data classes are separated by a(p -1)-dimensional hyperplane which is chosen to maximize the"margins "on either side of the hyperplane (Fig.212)

sub-patterns (Fu 1982). The simplest sub-patterns are called primitives, and the complex pattern is represented in terms of relationships between these primitives in a way similar to the syntax of a language. The primitives are viewed as a language, and the patterns are sentences generated according to a certain grammar (i.e., set of rules) that is inferred from the available training samples. This approach is appealing in situations where patterns have a definite structure which can be coded by a set of rules (e.g., ECG waveforms, texture in images). However, difficulties in segmentation of noisy images to detect the primitives and the inference of the grammar from training data often impede their implementation. The syntactic approach may yield a combinatorial explosion of possibilities to be investigated, requiring large training sets and huge computational efforts (Perlovsky 1998). 3. Cognitive approaches, which include neural networks and support vector machines (SVMs). Neural networks are based on the organization of the human brain, where nerve cells (neurons) are linked together by strands of fiber (axons). Neural networks are massively parallel computing systems consisting of a huge number of simple processors with many interconnections. They are able to learn complex non-linear input–output relationships and use sequential training procedures. However, in spite of the seemingly different underlying principles, most of the neural network models are implicitly similar to statistical pattern recognition methods (Anderson et al. 1990; Ripley 1993). It has been pointed out (Anderson et al. 1990) that “neural networks are statistics for amateurs ... Most NNs conceal the statistics from the user”. Support Vector Machines, SVMs (Cristianini and Shawe-Taylor 2000), represent the training examples as points in p-dimensional space, mapped so that the examples of the data classes are separated by a (p 1)-dimensional hyperplane, which is chosen to maximize the “margins” on either side of the hyperplane (Fig. 2.12). Travel Cost/Km ? Gender ? Car Ownership ? Train Car Expensive Standard Cheap Male Female 0 1 Bus Bus Train Fig. 2.11 A decision tree (the decision nodes are colored blue, and the leaf nodes orange) 20 2 Classification

点击进入文档下载页（PDF格式）

共199页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录