当前位置：和泉文库 > 计算机 > 浏览文档

《机器学习 Machine Learning》课程教学资源（书籍文献）An introduction to neural networks for beginners

文件格式：PDF，文件大小：1.33MB，售价：11.4元

文档详细内容（约40页）

PAGE 10 weight on the connection between the bias in layer 1 and the second node in layer 2 is given by 𝑏2 (1) . Remember, these values – 𝑤𝑖𝑗 (1) and 𝑏𝑖 (𝑙) – all need to be calculated in the training phase of the ANN. Finally, the node output notation is ℎ𝑗 (𝑙) , where j denotes the node number in layer l of the network. As can be observed in the three layer network above, the output of node 2 in layer 2 has the notation of ℎ2 (2) . Now that we have the notation all sorted out, it is now time to look at how you calculate the output of the network when the input and the weights are known. The process of calculating the output of the neural network given these values is called the feed-forward pass or process. 1.3 THE FEED-FORWARD PASS To demonstrate how to calculate the output from the input in neural networks, let’s start with the specific case of the three layer neural network that was presented above. Below it is presented in equation form, then it will be demonstrated with a concrete example and some Python code: ℎ1 (2) = 𝑓(𝑤11 (1) 𝑥1 + 𝑤12 (1) 𝑥2 + 𝑤13 (1) 𝑥3 + 𝑏1 (1) ) ℎ2 (2) = 𝑓(𝑤21 (1) 𝑥1 + 𝑤22 (1) 𝑥2 + 𝑤23 (1) 𝑥3 + 𝑏2 (1) ) ℎ3 (2) = 𝑓(𝑤31 (1) 𝑥1 + 𝑤32 (1) 𝑥2 + 𝑤33 (1) 𝑥3 + 𝑏3 (1) ) ℎ𝑊,𝑏(𝑥) = ℎ1 (3) = 𝑓(𝑤11 (2) ℎ1 (2) + 𝑤12 (2) ℎ2 (2) + 𝑤13 (2) ℎ3 (2) + 𝑏1 (2) ) In the equation above 𝑓(∙) refers to the node activation function, in this case the sigmoid function. The first line, ℎ1 (2) is the output of the first node in the second layer, and its inputs are 𝑤11 (1) 𝑥1, 𝑤12 (1) 𝑥2, 𝑤13 (1) 𝑥3 and 𝑏1 (1) . These inputs can be traced in the three-layer connection diagram above. They are simply summed and then passed through the activation function to calculate the output of the first node. Likewise, for the other two nodes in the second layer. The final line is the output of the only node in the third and final layer, which is ultimate output of the neural network. As can be observed, rather than taking the weighted input variables (𝑥1, 𝑥2, 𝑥3), the final node takes as input the weighted output of the nodes of the second layer (ℎ1 (2) , ℎ2 (2) , ℎ3 (2) ), plus the weighted bias. Therefore, you can see in equation form the hierarchical nature of artificial neural networks

PAGE 13 Calling the function: gives the output of 0.8354. We can confirm this results by manually performing the calculations in the original equations: ℎ1 (2) = 𝑓(0.2 ∗ 1.5 + 0.2 ∗ 2.0 + 0.2 ∗ 3.0 + 0.8) = 0.8909 ℎ2 (2) = 𝑓(0.4 ∗ 1.5 + 0.4 ∗ 2.0 + 0.4 ∗ 3.0 + 0.8) = 0.9677 ℎ3 (2) = 𝑓(0.6 ∗ 1.5 + 0.6 ∗ 2.0 + 0.6 ∗ 3.0 + 0.8) = 0.9909 ℎ𝑊,𝑏(𝑥) = ℎ1 (3) = 𝑓(0.5 ∗ 0.8909 + 0.5 ∗ 0.9677 + 0.5 ∗ 0.9909 + 0.2) = 0.8354 1.3.3 A more efficient implementation As was stated earlier – using loops isn’t the most efficient way of calculating the feed forward step in Python. This is because the loops in Python are notoriously slow. An alternative, more efficient mechanism of doing the feed forward step in Python and numpy will be discussed shortly. We can benchmark how efficient the algorithm is by using the %timeit function in IPython, which runs the function a number of times and returns the average time that the function takes to run: Running this tells us that the looped feed forward takes 40μs. A result in the tens of microseconds sounds very fast, but when applied to very large practical NNs with 100s of nodes per layer, this speed will become prohibitive, especially when training the network, as will become clear later in this tutorial. If we try a four layer neural network using the same code, we get significantly worse performance – 70μs in fact. 1.3.4 Vectorisation in neural networks There is a way to write the equations even more compactly, and to calculate the feed forward process in neural networks more efficiently, from a computational perspective. Firstly, we can introduce a new variable 𝑧𝑖 (𝑙) which is the summated input into node I of layer l, including the bias term. So in the case of the first node in layer 2, z is equal to: 𝑧1 (2) = 𝑤11 (1) 𝑥1 + 𝑤12 (1) 𝑥2 + 𝑤13 (1) 𝑥3 + 𝑏1 (1) = ∑𝑤𝑖𝑗 (1) 𝑥𝑖 + 𝑏𝑖 (1) 𝑛 𝑗=1 where n is the number of nodes in layer 1. Using this notation, the unwieldy previous set of equations for the example three layer network can be reduced to:

点击进入文档下载页（PDF格式）

共40页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录