当前位置：和泉文库 > 数学 > 浏览文档

《概率论与数理统计》课程教学资源（参考教材）概率论教材 Probability and Statistics《A FIRST COURSE IN PROBABILITY》英文电子版（Eighth Edition，Sheldon Ross）

文件格式：PDF，文件大小：3.05MB，售价：64.68元

文档详细内容（约536页）

Section2.3 Axioms of Probability 27 That is,P(E)is defined as the(limiting)proportion of time that E occurs.It is thus the limiting fre quency of e Although the eding definition is certainly intuitively pleasing and should alway be kept in mind by the reader,it poss s a serious drawhack.Ho w do we know that n(E)/n will converge to some co nstant limiting value that will be the same for each possible sequence of repetitions of the experiment?For example,suppose that the experiment to be repeatedly performed consists of flipping a coin.How do we know that the proportion of heads obtained in the first n flips will converge to some value as n gets large?Also,even if it does converge to some value,how do we know that, if the experiment is repeatedly performed a second time,we shall obtain the same imiting proportion of heads? Proponents of the relative frequency definition of probability usually answer this objection by stating that the convergence of n(E)/n to a constant limiting val lue is an assumption,or an axiom,of the system.However,to assume t t n(E)/n will nece sarily converge to some raordi noug ght i 0 at a ld it all seem t dent pr y an d th tant lin ist?Th shall ac we shall asepp e that fo each event fin the sa P(FY referred to as the probability of E.We shall then as me that all thes probabilities satisfy a certain set of axioms,which,we hope the reader will agree,is in accordance with our intuitive notion of probability Consider an experiment whose san ple space is S.For each event E of the sample space S,we assume that a number P(E)is defined and satisfies the following three axioms: Axiom 1 0≤P(E≤1 Axiom 2 P(S)=1 Axiom 3 For any sequence of mutually exclusive events E1,E2,...(that is,events for which EiE=☑when i≠. P =1 =1 We refer to P(E)as the probability of the event E. eIstates that the probability that the outcome of thep ween 0 an states tnat,wit probability sequence ve events,the pro occurring is just the sum c of th e respective proba y of at least one of these events

Section 2.3 Axioms of Probability 27 That is, P(E) is defined as the (limiting) proportion of time that E occurs. It is thus the limiting frequency of E. Although the preceding definition is certainly intuitively pleasing and should always be kept in mind by the reader, it possesses a serious drawback: How do we know that n(E)/n will converge to some constant limiting value that will be the same for each possible sequence of repetitions of the experiment? For example, suppose that the experiment to be repeatedly performed consists of flipping a coin. How do we know that the proportion of heads obtained in the first n flips will converge to some value as n gets large? Also, even if it does converge to some value, how do we know that, if the experiment is repeatedly performed a second time, we shall obtain the same limiting proportion of heads? Proponents of the relative frequency definition of probability usually answer this objection by stating that the convergence of n(E)/n to a constant limiting value is an assumption, or an axiom, of the system. However, to assume that n(E)/n will necessarily converge to some constant value seems to be an extraordinarily complicated assumption. For, although we might indeed hope that such a constant limiting frequency exists, it does not at all seem to be a priori evident that this need be the case. In fact, would it not be more reasonable to assume a set of simpler and more self-evident axioms about probability and then attempt to prove that such a constant limiting frequency does in some sense exist? The latter approach is the modern axiomatic approach to probability theory that we shall adopt in this text. In particular, we shall assume that, for each event E in the sample space S, there exists a value P(E), referred to as the probability of E. We shall then assume that all these probabilities satisfy a certain set of axioms, which, we hope the reader will agree, is in accordance with our intuitive notion of probability. Consider an experiment whose sample space is S. For each event E of the sample space S, we assume that a number P(E) is defined and satisfies the following three axioms: Axiom 1 0 … P(E) … 1 Axiom 2 P(S) = 1 Axiom 3 For any sequence of mutually exclusive events E1, E2, ... (that is, events for which EiEj = Ø when i Z j), P ⎛ ⎝q i=1 Ei ⎞ ⎠ = q i=1 P(Ei) We refer to P(E) as the probability of the event E. Thus, Axiom 1 states that the probability that the outcome of the experiment is an outcome in E is some number between 0 and 1. Axiom 2 states that, with probability 1, the outcome will be a point in the sample space S. Axiom 3 states that, for any sequence of mutually exclusive events, the probability of at least one of these events occurring is just the sum of their respective probabilities

28 Chapter 2 Axioms of Probability If we consider a sequence of events E1, E2, ..., where E1 = S and Ei = Ø for i > 1, then, because the events are mutually exclusive and because S = q i=1 Ei, we have, from Axiom 3, P(S) = q i=1 P(Ei) = P(S) + q i=2 P(Ø) implying that P(Ø) = 0 That is, the null event has probability 0 of occurring. Note that it follows that, for any finite sequence of mutually exclusive events E1, E2, ... , En, P ⎛ ⎝n 1 Ei ⎞ ⎠ = n i=1 P(Ei) (3.1) This equation follows from Axiom 3 by defining Ei as the null event for all values of i greater than n. Axiom 3 is equivalent to Equation (3.1) when the sample space is finite. (Why?) However, the added generality of Axiom 3 is necessary when the sample space consists of an infinite number of points. EXAMPLE 3a If our experiment consists of tossing a coin and if we assume that a head is as likely to appear as a tail, then we would have P({H}) = P({T}) = 1 2 On the other hand, if the coin were biased and we felt that a head were twice as likely to appear as a tail, then we would have P({H}) = 2 3 P({T}) = 1 3 . EXAMPLE 3b If a die is rolled and we suppose that all six sides are equally likely to appear, then we would have P({1}) = P({2}) = P({3}) = P({4}) = P({5}) = P({6}) = 1 6 . From Axiom 3, it would thus follow that the probability of rolling an even number would equal P({2, 4, 6}) = P({2}) + P({4}) + P({6}) = 1 2 . The assumption of the existence of a set function P, defined on the events of a sample space S and satisfying Axioms 1, 2, and 3, constitutes the modern mathematical approach to probability theory. Hopefully, the reader will agree that the axioms are natural and in accordance with our intuitive concept of probability as related to chance and randomness. Furthermore, using these axioms we shall be able to prove that if an experiment is repeated over and over again, then, with probability 1, the proportion of time during which any specific event E occurs will equal P(E). This result, known as the strong law of large numbers, is presented in Chapter 8. In addition, we present another possible interpretation of probability—as being a measure of belief—in Section 2.7

Section 2.4 Some Simple Propositions 29 Technical Remark. We have supposed that P(E) is defined for all the events E of the sample space. Actually, when the sample space is an uncountably infinite set, P(E) is defined only for a class of events called measurable. However, this restriction need not concern us, as all events of any practical interest are measurable. 2.4 SOME SIMPLE PROPOSITIONS In this section, we prove some simple propositions regarding probabilities. We first note that, since E and Ec are always mutually exclusive and since E ∪ Ec = S, we have, by Axioms 2 and 3, 1 = P(S) = P(E ∪ Ec ) = P(E) + P(Ec ) Or, equivalently, we have Proposition 4.1. Proposition 4.1. P(Ec ) = 1 − P(E) In words, Proposition 4.1 states that the probability that an event does not occur is 1 minus the probability that it does occur. For instance, if the probability of obtaining a head on the toss of a coin is 3 8 , then the probability of obtaining a tail must be 5 8 . Our second proposition states that if the event E is contained in the event F, then the probability of E is no greater than the probability of F. Proposition 4.2. If E ( F, then P(E) … P(F). Proof. Since E ( F, it follows that we can express F as F = E ∪ Ec F Hence, because E and EcF are mutually exclusive, we obtain, from Axiom 3, P(F) = P(E) + P(Ec F) which proves the result, since P(EcF) Ú 0. Proposition 4.2 tells us, for instance, that the probability of rolling a 1 with a die is less than or equal to the probability of rolling an odd value with the die. The next proposition gives the relationship between the probability of the union of two events, expressed in terms of the individual probabilities, and the probability of the intersection of the events. Proposition 4.3. P(E ∪ F) = P(E) + P(F) − P(EF) Proof. To derive a formula for P(E ∪ F), we first note that E ∪ F can be written as the union of the two disjoint events E and EcF. Thus, from Axiom 3, we obtain P(E ∪ F) = P(E ∪ Ec F) = P(E) + P(Ec F) Furthermore, since F = EF ∪ EcF, we again obtain from Axiom 3 P(F) = P(EF) + P(Ec F)

Section 2.4 Some Simple Propositions 31 Solution. Let Bi denote the event that J likes book i, i = 1, 2. Then the probability that she likes at least one of the books is P(B1 ∪ B2) = P(B1) + P(B2) − P(B1B2) = .5 + .4 − .3 = .6 Because the event that J likes neither book is the complement of the event that she likes at least one of them, we obtain the result P(Bc 1Bc 2) = P (B1 ∪ B2) c = 1 − P(B1 ∪ B2) = .4 . We may also calculate the probability that any one of the three events E, F, and G occurs, namely, P(E ∪ F ∪ G) = P[(E ∪ F) ∪ G] which, by Proposition 4.3, equals P(E ∪ F) + P(G) − P[(E ∪ F)G] Now, it follows from the distributive law that the events (E ∪ F)G and EG ∪ FG are equivalent; hence, from the preceding equations, we obtain P(E ∪ F ∪ G) = P(E) + P(F) − P(EF) + P(G) − P(EG ∪ FG) = P(E) + P(F) − P(EF) + P(G) − P(EG) − P(FG) + P(EGFG) = P(E) + P(F) + P(G) − P(EF) − P(EG) − P(FG) + P(EFG) In fact, the following proposition, known as the inclusion–exclusion identity, can be proved by mathematical induction: Proposition 4.4. P(E1 ∪ E2 ∪ ··· ∪ En) = n i=1 P(Ei) − i1<i2 P(Ei1Ei2 ) + ··· + (−1) r+1 i1<i2<···<ir P(Ei1Ei2 ··· Eir) + ··· + (−1) n+1P(E1E2 ··· En) The summation i1<i2<···<ir P(Ei1Ei2 ··· Eir) is taken over all of the n r possible subsets of size r of the set {1, 2, ... , n}. In words, Proposition 4.4 states that the probability of the union of n events equals the sum of the probabilities of these events taken one at a time, minus the sum of the probabilities of these events taken two at a time, plus the sum of the probabilities of these events taken three at a time, and so on. Remarks. 1. For a noninductive argument for Proposition 4.4, note first that if an outcome of the sample space is not a member of any of the sets Ei, then its probability does not contribute anything to either side of the equality. Now, suppose that an outcome is in exactly m of the events Ei, where m > 0. Then, since it is in i Ei, its

点击进入文档下载页（PDF格式）

共536页，试读已结束，阅读完整版请下载

您可能感兴趣的文档

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录