当前位置：和泉文库 > 计算机 > 浏览文档

《机器学习 Machine Learning》课程教学资源（书籍文献）An introduction to neural networks

文件格式：PDF，文件大小：4.1MB，售价：50.24元

文档详细内容（约314页）

Figure 1.3 Simple example of neural network. the width of their corresponding arrows,weights are shown by multiplication symbols in circles,and their values are supposed to be proportional to the symbol's size;only positive weights have been used.The TLU is the simplest (and historically the earliest(McCulloch Pitts 1943))model of an artificial neuron. The term "network"will be used to refer to any system of artificial neurons.This may range from something as simple as a single node to a large collection of nodes in which each one is connected to every other node in the net.One type of network is shown in Figure 1.3.Each node is now shown by only a circle but weights are implicit on all connections.The nodes are arranged in a layered structure in which each signal emanates from an input and passes via two nodes before reaching an output beyond which it is no longer transformed.This feedforward structure is only one of several available and is typically used to place an input pattern into one of several classes according to the resulting pattern of outputs.For example,if the input consists of an encoding of the patterns of light and dark in an image of handwritten letters,the output layer(topmost in the figure)may contain 26 nodes- one for each letter of the alphabet-to flag which letter class the input character is from.This would be done by allocating one output node per class and requiring that only one such node fires whenever a pattern of the corresponding class is supplied at the input. So much for the basic structural elements and their operation.Returning to our working definition,notice the emphasis on learning from experience.In real neurons the synaptic strengths may,under certain circumstances,be modified so that the behaviour of each neuron can change or adapt to its particular stimulus input.In artificial neurons the equivalent of this is the modification of the weight values.In terms of processing information,there are no computer programs here-the "knowledge"the network has is supposed to be stored in its weights,which evolve by a process of adaptation to stimulus from a set of pattern examples.In one training paradigm called supervised learning,used in conjunction with nets of the type shown in Figure 13,an input pattern is presented to the net and its response then compared with a target output.In terms of our previous letter recognition example,an "A",say,may be input and the network output compared with the classification code for A.The difference between the two patterns of output then determines how the weights are altered.Each particular recipe for change constitutes a learning rule,details of which form a substantial part of subsequent chapters.When the required weight updates have been made another pattern is presented,the output compared with the target,and new changes made.This sequence of events is repeated iteratively many times until (hopefully)the network's behaviour converges so that its response to each pattern is close to the 15

Figure 1.3 Simple example of neural network. the width of their corresponding arrows, weights are shown by multiplication symbols in circles, and their values are supposed to be proportional to the symbol's size; only positive weights have been used. The TLU is the simplest (and historically the earliest (McCulloch & Pitts 1943)) model of an artificial neuron. The term "network" will be used to refer to any system of artificial neurons. This may range from something as simple as a single node to a large collection of nodes in which each one is connected to every other node in the net. One type of network is shown in Figure 1.3. Each node is now shown by only a circle but weights are implicit on all connections. The nodes are arranged in a layered structure in which each signal emanates from an input and passes via two nodes before reaching an output beyond which it is no longer transformed. This feedforward structure is only one of several available and is typically used to place an input pattern into one of several classes according to the resulting pattern of outputs. For example, if the input consists of an encoding of the patterns of light and dark in an image of handwritten letters, the output layer (topmost in the figure) may contain 26 nodes— one for each letter of the alphabet—to flag which letter class the input character is from. This would be done by allocating one output node per class and requiring that only one such node fires whenever a pattern of the corresponding class is supplied at the input. So much for the basic structural elements and their operation. Returning to our working definition, notice the emphasis on learning from experience. In real neurons the synaptic strengths may, under certain circumstances, be modified so that the behaviour of each neuron can change or adapt to its particular stimulus input. In artificial neurons the equivalent of this is the modification of the weight values. In terms of processing information, there are no computer programs here—the "knowledge" the network has is supposed to be stored in its weights, which evolve by a process of adaptation to stimulus from a set of pattern examples. In one training paradigm called supervised learning, used in conjunction with nets of the type shown in Figure 1.3, an input pattern is presented to the net and its response then compared with a target output. In terms of our previous letter recognition example, an "A", say, may be input and the network output compared with the classification code for A. The difference between the two patterns of output then determines how the weights are altered. Each particular recipe for change constitutes a learning rule, details of which form a substantial part of subsequent chapters. When the required weight updates have been made another pattern is presented, the output compared with the target, and new changes made. This sequence of events is repeated iteratively many times until (hopefully) the network's behaviour converges so that its response to each pattern is close to the 15

1.2 Why study neural networks? This question is pertinent here because,depending on one's motive,the study of connectionism can take place from differing perspectives.It also helps to know what questions we are trying to answer in order to avoid the kind of religious wars that sometimes break out when the words "connectionism"or "neural network"are mentioned. Neural networks are often used for statistical analysis and data modelling,in which their role is perceived as an alternative to standard nonlinear regression or cluster analysis techniques (Cheng Titterington 1994).Thus,they are typically used in problems that may be couched in terms of classification,or forecasting.Some examples include image and speech recognition,textual character recognition,and domains of human expertise such as medical diagnosis,geological survey for oil, and financial market indicator prediction.This type of problem also falls within the domain of classical artificial intelligence (AI)so that engineers and computer scientists see neural nets as offering a style of parallel distributed computing, thereby providing an alternative to the conventional algorithmic techniques that have dominated in machine intelligence.This is a theme pursued further in the final chapter but,by way of a brief explanation of this term now,the parallelism refers to the fact that each node is conceived of as operating independently and concurrently (in parallel with)the others,and the "knowledge"in the network is distributed over the entire set of weights,rather than focused in a few memory locations as in a conventional computer.The practitioners in this area do not concern themselves with biological realism and are often motivated by the ease of implementing solutions in digital hardware or the efficiency and accuracy of particular techniques.Haykin(1994)gives a comprehensive survey of many neural network techniques from an engineering perspective. Neuroscientists and psychologists are interested in nets as computational models of the animal brain developed by abstracting what are believed to be those properties of real nervous tissue that are essential for information processing.The artificial neurons that connectionist models use are often extremely simplified versions of their biological counterparts and many neuroscientists are sceptical about the ultimate power of these impoverished models,insisting that more detail is necessary to explain the brain's function.Only time will tell but,by drawing on knowledge about how real neurons are interconnected as local "circuits", substantial inroads have been made in modelling brain functionality.A good introduction to this programme of computational neuroscience is given by Churchland Sejnowski (1992). Finally,physicists and mathematicians are drawn to the study of networks from an 17

1.2 Why study neural networks? This question is pertinent here because, depending on one's motive, the study of connectionism can take place from differing perspectives. It also helps to know what questions we are trying to answer in order to avoid the kind of religious wars that sometimes break out when the words "connectionism" or "neural network" are mentioned. Neural networks are often used for statistical analysis and data modelling, in which their role is perceived as an alternative to standard nonlinear regression or cluster analysis techniques (Cheng & Titterington 1994). Thus, they are typically used in problems that may be couched in terms of classification, or forecasting. Some examples include image and speech recognition, textual character recognition, and domains of human expertise such as medical diagnosis, geological survey for oil, and financial market indicator prediction. This type of problem also falls within the domain of classical artificial intelligence (AI) so that engineers and computer scientists see neural nets as offering a style of parallel distributed computing, thereby providing an alternative to the conventional algorithmic techniques that have dominated in machine intelligence. This is a theme pursued further in the final chapter but, by way of a brief explanation of this term now, the parallelism refers to the fact that each node is conceived of as operating independently and concurrently (in parallel with) the others, and the "knowledge" in the network is distributed over the entire set of weights, rather than focused in a few memory locations as in a conventional computer. The practitioners in this area do not concern themselves with biological realism and are often motivated by the ease of implementing solutions in digital hardware or the efficiency and accuracy of particular techniques. Haykin (1994) gives a comprehensive survey of many neural network techniques from an engineering perspective. Neuroscientists and psychologists are interested in nets as computational models of the animal brain developed by abstracting what are believed to be those properties of real nervous tissue that are essential for information processing. The artificial neurons that connectionist models use are often extremely simplified versions of their biological counterparts and many neuroscientists are sceptical about the ultimate power of these impoverished models, insisting that more detail is necessary to explain the brain's function. Only time will tell but, by drawing on knowledge about how real neurons are interconnected as local "circuits", substantial inroads have been made in modelling brain functionality. A good introduction to this programme of computational neuroscience is given by Churchland & Sejnowski (1992). Finally, physicists and mathematicians are drawn to the study of networks from an 17

点击进入文档下载页（PDF格式）

共314页，可试读40页，点击继续阅读 ↓↓

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录