THE APPLICATION OF BAYESIAN 'smIsncmf f7" * AUDITING: mscmm: VERSUS commons PRIOR DISTRIBUTIONS ~ * Thesis for the Degree a! HI D; ' HICHGAN SWE UMVERSn-‘Y‘ ALBERT WIRE FRANCISEO 19H LIBRARY (nexus DANijgQDISRIOO University This is to certify that the thesis entitled THE APPLICATION OF BAYESIAN STATISTICS TO AUDITING: DISCRETE VERSUS CONTINUOUS PRIOR DISTRIBUTIONS presented by Albert K. Francisco has been accepted towards fulfillment of the requirements for Ph.D degreein Business - Accounting Date May ‘5, 1972 0-7639 that Stud info butiI have such the 1 Bayes with I UOuS J Prior which methOC POOr 5 rates ABSTRACT THE APPLICATION OF BAYESIAN STATISTICS TO AUDITING: DISCRETE VERSUS CONTINUOUS PRIOR DISTRIBUTIONS BY Albert Kenning Francisco It has been suggested in the auditing literature that auditors adept Bayesian statistical techniques. Studies have shown that auditors are willing to provide information that can be used to construct prior distri— butions for this purpose. Practicing auditors, however, have not widely adepted Bayesian techniques, even though such techniques have received considerable exposure in the literature. The articles suggesting the use of Bayesian techniques have all demonstrated such techniques with discrete prior distributions. Although less difficult to work with than contin- uous prior distributions in simple examples, discrete prior distributions have several practical disadvantages which may partly explain the lack of adoption of Bayesian methods by auditors. Discrete prior distributions are poor approximations of the continuous range of possible rates found in most audit attributes sampling situations. -— pl It re; dis USU OVEI is c lati It i: Sketc resea Prior distr mOde) addit a giv a bin< and s Albert Kenning Francisco The derivation of specific discrete prior distributions is difficult, because a number of points must be assigned probabilities, and these probabilities must total 1.00. It is difficult for an auditor to perceive the meaning of a discrete distribution because only a few of the possible points are assigned positive probabilities. Involved com— putations (many multiplication and division Operations) are required for the solution of any practical problem using discrete distributions and Bayesian statistics. This would usually require a computer. Bayesian methods using beta prior distributions can overcome most of these difficulties. Such a distribution is continuous, and fits the large number of possible popu- lation error rates better than does the discrete model. It is easier for an auditor to visualize continuous curves. Sketches of representative distributions deve10ped in this research can allow an auditor to quickly pick the specific prior desired, and write down the numeric parameters of that distribution. Tables of statistics (mean, variance, and mode) of these representative distributions can provide additional information to the auditor seeking a prior for a given audit situation. The revision of a beta prior by a binomial sample result is not difficult. A few addition and subtraction Operations are required. These can be __—J one but. give able quic and tion Whick compu Table moder tions SiMps Can bI generg poster table c adopt; Albert Kenning Francisco performed by an auditor with the aid of nothing more soPhisticated than a piece of paper and a pencil. Once this is done, the auditor can refer again to his tables for a sketch of the posterior distribution (or a very similar one) and for statistics he wishes to know for that distri- bution. Another set of tables deve10ped in this study can give confidence intervals. What would have taken consider- able computer time with a discrete prior distribution is quickly taken care of with this method using a few tables and a quick addition and subtraction Operation. Although tables and sketches of the beta distribu- tion are not currently available, techniques exist with which such information can be generated. The formulas for computing statistics of beta distributions are not complex. Tables of such statistics can be generated in seconds on modern computers. Similarly, sketches of such distribu- tions can be created by plotters attached to computers. Simpson's rule for the evaluation of definite integrals can be used as the basis for computer programs which will generate confidence intervals based upon representative posterior beta distributions. Examples of sketches and tables are included in the study. It is concluded that, although auditors have not adOpted Bayesian statistical methods, such methods hold Albert Kenning Francisco promise. There are more SOphisticated statistical techni— ques available than those demonstrated in the articles which have prOposed Bayesian methods for use in auditing. Such techniques, such as the beta prior distribution dis- cussed here, may allow auditors to benefit from the advan- tages Of Bayesian statistics that have been claimed in the literature. THE APPLICATION CW'BAYESIAN STATISTICS TO AUDITING: DISCRETE VERSUS CONTINUOUS PRIOR DISTRIBUTIONS BY Albert Kenning Francisco A THESIS Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Accounting and Financial Administration 1972 COpyright by ALBERT KENNING FRANCISCO © 1972 — —— comr Me a. and exp: Jone SUPP to p Denv inw Prov Orga Nati. fOr ' natiI ACKNOWLEDGMENTS Thanks are due the members of my dissertation committee, Professors Alvin Arens (chairman), George Mead, and Robert Staudte for their review of my thesis and constructive suggestions. Appreciation is also expressed to Professors James Don Edwards and Gardner Jones who, as department chairmen, ensured financial support during my Ph.D. program. I am also indebted to Professor James E. Sorensen of the University of Denver for arousing my interest in the general area in which the dissertation was done. Appreciation is due the organizations which provided financial support for my Ph.D. program. These organizations include the American Accounting Associa- tion, Ernst & Ernst, and the U. S. Government for its National Defense Education Act fellowship. Finally, appreciation is expressed to my parents, for encouraging me to remain in school when other alter- natives arose, and to my wife Barbara and daughter Eileen for help and encouragement throughout my graduate studies. ii of I would like to thank Mrs. Donna Ford for an excellent job of typing the final manuscript. iii LIE LIE Cha II” TABLE OF CONTENTS Page ACKNOWLEDGEMENTS. . . . . . . . . . . . . . . . . . . ii LIST OF TABLES. . . . . . . . . . . . . . . . . . . . Vii LIST OF FIGURES . . . . . . . . . . . . . . . . . . . viii Chapter I INTRODUCTION. . . . . . . . . . . . . . . . l A. The Auditor in Society. . . . . . . . 1 B. Audit Sampling. . . . . . . . . . . . 3 C. Classical Statistics. . . . . . . . . 6 D. Bayesian Statistics . . . . . . . . . 7 E. Purpose of this Study . . . . . . . . 8 F. Limitations of the Study. . . . . . . 11 II AUDIT EVIDENCE AND AUDIT VARIABLES. . . . . 14 A. Audit Evidence. . . . . . . . . . . . 14 B. Audit Variables . . . . . . . . . . . 16 C. Effect of Variables on Audit. . . . . 20 III ATTRIBUTES SAMPLING AND CLASSICAL STATISTICS. . . . . . . . . . . . . . . . . 24 A. Introduction. . . . . . . . . . . . . 24 B. Classical Statistics as Used by Auditors . . . . . . . . . . . . . . 26 C. Effect of Audit Variables . . . . . . 30 iv Ch VII Chapter IV VI VII ATTRIBUTES SAMPLING USING BAYESIAN STATISTICS: DISCRETE PRIOR DISTRIBWIONS . O C O C C O C O O O O O O A. B. C. Bayes Theorem. . . . . . . . . . . Discrete Prior Distributions . . . Difficulties Found in the Use of Discrete Prior Distributions. . . BAYESIAN STATISTICS AND THE BETA: A CONTINUOUS PRIOR DISTRIBUTIONS . . . . . A. B._ C. D. E. The Beta Distribution. . . . . . . Selection of a Prior Distribution Revision Of a Beta Prior by a Binomial Sample Result. . . . . . Confidence Intervals from Beta Posterior Distributions . . . . . An Example of the Use of a Continuous Prior Distribution . . SUBJECTIVE PROBABILITIES . . . . . . . . A. B. Definition . . . . . . . . . . . . Audit Evidence and Prior Distributions . . . . . . . . . . CONCLUS I ON C O C C O C C O O O O O O O O A. B. C. D. Summary. . . . . . . . . . . . . . Relative Advantages of Various Distributions in Applications of Bayesian Statistics to Auditing. . . . . . . . . . . . . Conclusions. . . . . . . . . . . . Suggestions for Further Research . Page 35 35 37 43 49 49 61 66 68 73 79 79 83 89 89 93 95 98 SE . 1- IIIUIINS APPENDIX Page A. BETA DISTRIBUTION TABLES. . . . . . . . . . . . 100 B. SKETCHES OF BETA DISTRIBUTIONS. . . . . . . . . 105 C. REVISION OF A BETA PRIOR DISTRIBUTION BY A BINOMIAL SAMPLE RESULT . . . . . . . . . . . . 110 D. COMPUTER PROGRAMS . . . . . . . . . . . . . . . 113 E. EVALUATION OF INTEGRALS NOT SOLVABLE BY NORML MEANS C O C O O O O O 0 O O I C O O O C 12 3 SELECTED BIBLIOGRAPHY . . . . . . . . . . . . . . . . 125 vi LIST OF TABLES Table 1 Statistics of Selected Beta Distributions. . . . . . . 2 One Sided Beta Confidence Coefficients . . . . . . . 3 One Sided Beta Confidence Coefficients . . . . . . . 4 One Sided Beta Confidence Coefficients . . . . . . . 5 Computer Program "Table". . 6 Computer Program "Plotit" . 7 Computer PrOgram ”Value”. . vii Page 101 102 103 104 115 119 121 Figure 10 11 12 13 14 15 16 Beta Distribution Graphx°“2(1-x)P'1 + (-1)cx 403-1) (1905‘2 :2 (car-1)x°"2(1--x)'3'1 - x“‘1(p-1)(1-x)9‘2 = 0. (d-1)Xq—2(l-X)B’l = (p-1)§'1(1-x>P'2 E :; x(l-x)-l dram-1+x == fix-x d'l ‘3 (6‘2”)x The mode of the beta distribution occurs at X: 00-1 “+fi-2 To illustrate the computation of the mode of a beta dis- tribution, the mode of the distribution shown in Figure 1 will be computed. o( -1 = 1 «+5-2 1.4 + Mode: = 0.05 The mean, variance, and mode discussed here are useful to an auditor attempting to fit a prior beta 61 distribution to evidence available to him. Once he has determined the specific constants (0(and p ) which deter- mine the prior distribution, it is necessary to revise the prior by the results of a current audit sample. Determina— tion of the prior distribution is discussed in Section B, while the revision of the prior by the results of a current audit sample is explained in Section C, and the procedure used to make inferences from the result is covered in Section D. B. (Selection of a Prior Distribution Before Bayes formula or a posterior confidence interval can be used, a meaningful prior distribution must be selected by the auditor. This prior is based upon evi- dence actually available to the auditor: evidence which relates to the error rate of the pOpulation under con— sideration. It is important that an auditor interested in the use of Bayesian statistics understand the meaning of the prior distribution he selects. To aid in this, various statistics can be computed about representative prior distributions. The mean, mode, and variance have been discussed in this study, and such statistics can be of value to an auditor who has had even limited 62 statistical training. A short table of such statistics computed for selected beta distributions appears in Appendix A (Table l). A useful aid to the auditor faced with the problem of deriving a prior distribution might be a book of sketches of representative beta prior distributions. These might be arranged in a fashion similar to a police department's books of "mug shots" which include a number to identify each portrait. As the victim of a criminal act might thumb through the "mug shots" and note the identifying number of a picture which bears a resemblance to the offender, an auditor attempting to fit a prior distribution to specific circumstances might thumb through the sketches and note the appr0priate o( and ,3 parameters for the distribution which best fits his situation. These sketches might be ordered in sections by mean or by mode to provide the auditor with a starting point. A short example of such a "mug boo " of sketches of Specific beta distributions appears in Appendix B. The use of such sketches would save time. In- stead of listing a number of probability points (the Inethod used by those who have prOposed the use of discrete Iprior distributions in auditing applications of Bayesian 63 statistics), the auditor would merely look through the sketches until the appr0priate prior was located. Marked on that sketch would be the parameters 6! and 5 which deter- mine the function. After noting those parameters, the auditor would be ready to take the current sample and revise the prior with Bayes theorem. Graphs of beta distributions, in the form of the sketches discussed above, can be produced by a plotter attached to a digital computer. The sketches shown in Appendix B were derived in that fashion. The computer programs which resulted in those sketches are shown in Appendix D. Also shown in that appendix are the programs used to produce the beta distribution tables which appear in Appendix A. Several statistics can be used by the auditor in locating the apprOpriate prior distribution for his use. One mark with which he might begin is the mean of his prior expectations. This is not the single rate he feels is most likely to occur, based on his prior expectations, but rather is the expected value of the function. The mean is a useful measure and it is widely employed. However, when faced with the problem of specifying a distribution, the typical auditor might well have 64 difficulty identifying the expected value of the result. He could specify the single error rate he feels is most likely to occur. This is the definition of the mode, another statistic discussed with reference to the beta distribution earlier in this chapter. Because it appears that the mode is less difficult for an auditor to use, it is used to identify the distributions shown in Appendix A and Appendix B of this study, although the mean is given as supplementary information. One factor the user of Bayesian statitics would consider is the relative weights given to the prior dis- tribution and the current sample in the derivation of the posterior. As the sum (Xi-Sis increased, with the mode of the prior held constant, the variance of the prior decreases, implying greater certainty. An auditor who has considerable evidence that an error rate is near 5% may use a prior distribution with a relatively large (when compared to the current sample size):K4-fi3. Another auditor with some smaller amount of evidence which also indicates an error rate near 5% is likely to select a prior in which the sum cup is somewhat smaller than did the first auditor. If each draws the same current sample result, the prior selected by the first auditor will 65 have a greater relative effect upon the posterior than that selected by the second auditor. An auditor with no statistical training could not be expected to derive meaningful prior distributions for use with Bayes theorem. One study has shown that auditors are "willing to quantify their conclusions as probability distributions."6 The same study7 found that different auditors produce different prior distributions from the same data. This implies either a lack of agree- ment between the auditors as to the audit variables under- lying the situation, or a difference in how each auditor subjectively interprets the audit variables. Either alternative suggests that more than a quick introductory lesson would be necessary before several auditors would reach similar independent conclusions from the same data. Once an auditor has selected a prior distribution from a set Of sketches or tables of statistics, he has the constants c: and (3 which are required for the revision of a prior beta distribution by Bayes theorem. These constants are combined with the results of the current audit sample to find the posterior distribution upon which statistical inferences will be based. The Operation of revising a beta prior distribution by a binomial sample result is discussed in the following section. 66 C. Revision of a Beta Prior by a Binomial Sample Result After an auditor has selected a prior distribution and taken the current sample, only two steps remain in the use of Bayesian statistics with beta prior distribu- tions in audit attributes sampling situations: The revi- sion of the prior with Bayes formula and the creation of confidence intervals based upon the posterior beta distri- bution. This section discusses the revision Of the prior distribution, while section D eXplains how a confidence interval can be obtained from a beta posterior distribu- tion. An example illustrating the entire process appears in Section E of this chapter. The revision of a beta prior distribution by a binomial sample result is unexpectedly simple. A beta distribution is determined by only two parameters, re- ferred to as o( and f3 . The result of an audit sample can be specified by n (the sample size) and x (the number of errors found in the sample). If the prior is a beta distribution with parameters at and p , and the sample a binomial of size n with x errors, the posterior will be a beta with parameters a" z o<+ x - 1 and fl'2 P-F n - X - l. 67 The derivation of this result is shown in Appendix C. Here it is important only to note that revision of a beta prior by a binomial sample is a simple arithmetic manipulation requiring no higher mathematics than addi- tion and subtraction, no computer, and no tables. The resulting posterior distribution is another beta, which may be further revised by additional binomial samples, in the same manner. The result will always be another beta distribution. The Bayesian method discussed in the preceding paragraph would be relatively easy for an auditor in the field to use. If he had sketches of a number of possible beta prior distributions and tables of relevant statistics, he could select the prior which best represents the evi- dence available about a given error rate before a current sample is taken. He would then write down the appr0priate constantscx and f for that distribution. Once the current sample has been completed, its outcome can be summarized by two figures, n (the sample size), and x (the number of errors found in the sample). (Note that this x is not the same as the x used to represent the error rate in the beta function. Statisticians use the same term in two different ways.) The determination of the posterior 68 distribution can be quickly accomplished with the aid of a pencil, a piece of paper, and the two formulas discussed previously in this section. The final step in the analysis involves drawing inferences from the posterior distribu- tion thus Obtained. This requires the derivation of confidence intervals from posterior beta distributions. The following section contains an example of the revision of a beta prior distribution by Bayes theorem and explains how confidence intervals can be obtained from beta posterior distributions. D. Confidence Intervals from Beta Posterior Distributions The procedure used to select a prior distribution, summarize the results of the current audit sample, revise the prior by those results, and obtain the posterior beta distribution has been discussed in the three preceding sections. Once this procedure has been followed in an actual audit situation, there remains only the problem of determining confidence intervals from the posterior distribution. In the first section of this chapter, an example of the calculation of a confidence interval from a beta posterior distribution was shown. The method used in 69 that example involved the evaluation of a beta integral through the calculus technique of integrating by parts. That method is not generally applicable to beta integrals in which the constants‘d.and p are not integers. Since auditors would use beta distributions with non-integer constants, an alternative computation technique is required. Mathematicians have deve10ped approximation techniques for the evaluation of functions which can not be computed directly. One of these techniques is Simpson's rule for the approximation of definite integrals. It approximates the area under a given curve by summing the areas Of a number of segments which approximate the area of portions of the desired area. A further discussion of Simpson's rule can be found in Appendix E. Simpson's rule can be used to create tables of confidence intervals for the beta posterior distribution. If a set of representative posterior distributions were used as the basis for a group of confidence interval tables, the auditor with access to such tables could quickly determine the limits of the desired confidence interval, once he had completed the tasks leading to the posterior distribution. An abbreviated set of such 70 tables, which were computed with the use of Simpson's rule, is shown in Appendix A (Table 2, 3, and 4). Those tables are shown as an example. They are not complete enough for use in practical situations. The computer programs which were used to obtain those tables appear in Appendix D. The entire procedure followed by the auditor, from selecting a prior distribution through looking up the limits Of the confidence interval in the tables, would take only a few minutes (aside from the time required to draw and examine sample documents, etc.). The auditor would not require the use of a computer, or even an adding machine. To illustrate the use of a beta prior distribution and a binomial sample result in an application Of Bayesian statistics to auditing, it will be assumed that an auditor has evidence regarding errors in some pOpulation which he feels is represented by a beta distribution with para- meters D(=1.4 and p=8.6. He takes a sample of 92 (n) items and finds 30 (x) errors. What are the parameters of the posterior beta distribution? a' «+X-l. fl :fi+n-x-l II = 1.4'+'30.0 - 1.0 ::8.6 +~92.0 - 30.0 - 1.0 =- 30.4 =69.6 71 Illustrations of the shapes of both the prior and posterior distributions in this example can be found on page 51. It should be noted that the mode of the prior is 0.05, while the information contained in the sample results in the mode of the posterior equaling 0.30, certainly a significant shift for most auditors. The prior in this example has relatively less influence upon the posterior than does the sample result. Confidence intervals can be constructed from the posterior beta distribution by referring to apprOpriate tables, such as those in Appendix A (Tables 2, 3, and 4). To use the confidence interval tables, the appropriate at and p parameters are located along the left side of the tables, while the confidence coefficient desired is located in the body of the table on that same line. The number at the tOp of the column in which the desired confidence coefficient appears is the upper limit of a one sided confidence interval. If the auditor in the example discussed in the preceding paragraph wished to construct a one sided confidence interval with a 97.8% confidence coefficient, the tables would show him that 97.8% of the area under a beta curve with parameters °( = 30.4 and flz69.6 would lie below an error rate of 72 40%. In Chapter III, it was shown that the classical statistics approach resulted in a similar conclusion of a 97.8% confidence coefficient that the pOpulation error rate was below 40%. That result, however, was based upon a sample of 169, while the Bayesian example shown here requires a sample of only 92 to reach the same confidence interval. In each instance the sample error rate was 32.5%. Thus, the additional information contributed by the auditor's prior distribution reduced the sample size necessary to obtain the same assurance as that gained from the classical model. The relative effect upon the posterior distribution by the prior and current sample result is determined by both the "gentleness" of the prior curve and the size of the sample taken. If a prior which is rather noncommittal (has a large variance) is revised by a large sample, the result will be largely determined by the sample. On the other hand, if a prior with a small variance is revised by a small sample, the result will be similar to the prior.8 This matter is further considered in Chapter VI. In summary, the use of a beta prior distribution by auditors in applications of Bayesian statistics to auditing requires several steps. In the first section Of 73 this chapter, beta distributions were discussed and it was demonstrated how computations can be made which yield information about specific beta curves. Several ways of aiding an auditor who is faced with the problem of determining a beta prior distribution for specific circum- stances were discussed in the second section Of this chapter. That was followed by an explanation of the process used to revise a beta prior by the results of a current audit attributes sample, while the confidence intervals necessary to the making of inferences based upon beta posterior distributions were the subject of this section. The last section of this chapter contains an example of the use of this method in a specific auditing situation. E. An Example of the Use of a Continuous Prior Distribution The preceding sections have explained the pro- cedures and computations necessary to arrive at a posterior distribution from a beta prior distribution and a binomial audit sample result. The following example illustrates the application of a beta prior distribution to a common audit situation in which classical methods result in an illogical conclusion. 74 Assume an auditor has been doing the field work on a certain engagement for the past six years. Each year he has sampled 40 of the approximately 6000 sales invoices, and each year has found no errors. There have been no significant changes in the internal control system over sales or in the accounting staff. From the tables in Arking, he determined that there was a 99% confidence coefficient that the actual error rate was less than 10.8%. Bayesian methods allow the auditor to create a prior distribution which includes such information, when desired. In this situation, the auditor would be likely to use a prior with a very small mean and mode, probably in the neighborhood of 1% to 2%, because he feels strongly that the actual rate is at least that small, based upon past experience. Assume that this auditor is attempting to be very cautious in his prior and picks a beta prior with «x: 2.98 and B = 59.02. This means that he feels that "most likely“ error rate is the mode Of «-1 __,_ 2.98 - 1.00 ____ “+fl-2 2.98 + 59.02 - 2.00 3 . 30%: and that the "average" expected value is the mean of , 2.98 2.98 0‘ «+13 " 2.98 + 59.02 60.00 :1 4.83%. 75 These values are both high (conservativex compared to the auditor's expectation of the actual rate. As stated above, the actual sample yielded no errors (x=0) out of a sample of 40 sales invoices (n=40). To determine the posterior distribution, the following computations would be made: at. on—X - 1 fl' ‘: fi-f- n - X - 1 :2.98+ 0 - 1 :59.02+ 40 - 0 - 1 =1.98. 298.02. A one-sided classical confidence limit was found above to yield a 99% probability that the actual rate was less than 10.8%. By referring to the beta table, a similar confi- dence interval can be constructed from the posterior Bayesian distribution (Table II in Appendix A). Such a confidence interval would give a 99% confidence coefficient that the actual error rate lies below 7.0%, instead of the 10.8% which the classical method resulted in. Thus, even a very ”conservative" (in the sense of having mean and mode greater than the actual expectation of the auditor) prior distribution results in a more realistic posterior distri- bution and resulting confidence interval than did the classical methods. A prior which more accurately reflected the expectations of the auditor would result in an even 76 more reasonable confidence interval. It should be noted that an auditor using classical statistical methods in the example discussed above would probably approach the situation by reducing the confidence level he requires in the current year, knowing this will reduce the sample size. But he must make this decision by "feel", with no formal guide as to the effect of this change upon the analysis. The Bayesian method, on the other hand, allows the auditor to be as specific as he desires in making up a prior distribution based upon the results of previous years' audits. The preceding situation is one to which Bayesian methods seem naturally adOpted. Not all auditing situa- tions are so constructed as to fit the Bayesian model in this manner, however. For example, the audit variables which affect the risk of the auditor being subject to sanctions because of the bankruptcy of the client can be considered. It would be difficult for an auditor, even one with considerable training and experience in statis- tical methods, to construct a prior distribution which meaningfully reflected this possibility. It appears that there is a sort of scale of audit variables, with those obviously applicable to Bayesian methods at one end, and 77 others, such as the bankruptcy example mentioned here, at the opposite end. Subjective probabilities, such as those used as prior beta distributions, have been the subject of some controversy. This is the subject of the following chapter. 78 CHAPTER V--FOOTNOTES 1Harold J. Larson, Introduction £9_Probability Theory and Statistical Inference (Belmont, California: John Wiley & Sons, Inc., 1969), p. 358. 2Irvin H. LaValle, Ag Introduction £9 Probability, Decision, and Igference (New York: Holt, Rinehart and Winston, 1970), p. 256. 3Larson, p. 358. 4Ibid., p. 305. 5Ibid., p. 305. 6John C. Corless, "The Assessment of Prior Distri- butions for Applying Bayesian Statistics in Auditing Situ- ations," unpublished Ph.D. dissertation, University of Minnesota, 1971, p. 127. 7Ibid., p. 127. 8Robert Schlaifer, Introduction Egyggatistigg for Business Decisions (New York: McGraw—Hill, 1961), p. 217. 9Herbert Arkin, HandboOk gf Sampling for Auditing and Accounting Volume I Methods (New York: McGraw-Hill, 1963), p. 506. CHAPTER VI SUBJECTIVE PROBABILITIES A. Definition An auditor who has decided to use Bayesian stat— istics must express in a prior distribution the evidence available to him. The mathematics associated with the beta distribution was discussed in Chapter V. But the auditor attempting to use Bayesian statistics needs more than formulas, or tables, or other such aids. He needs an understanding of what lies behind Bayes theorem. This chapter attempts to provide some of that information. Probability distributions are pOpularly thought of as being based upon objective, verifiable evidence. Thus, in the usual sense of the term, a probability dis- tribution would not vary even when derived by different auditors from the same sample. But a qualified auditor may have Opinions as to the distributions underlying the sample taken, in addition to the sample result. Often these Opinions are based upon facts just as real as the sample result, but not directly expressible as a 79 80 probability distribution. These Opinions may give rise to a "personalistic" or "subjectivist" probability dis- tribution based upon the Opinion of the qualified auditor instead of directly upon a sample. Since different audi— tors have varying perceptions and experiences, these sub— jective distributions are not necessarily constant. This makes the information contained in them no less real, how- ever. It simply reflects the differing information con- tained in the Opinions of the various auditors. Information of this type is sometimes criticized because it is not based directly on solid, verifiable evidence. It is said that this type of information is so subjective as to be worthless. But if the auditor generating the information is qualified and has relevant experience, some part of the data will be ignored if this entire class of information is not used.1 If the Bayesian method were employed, the audi— tor would be forced to put down his expectations on paper. At the very least he would be forced to pay attention to this element, and if he were trained to use an input distribution such as the beta which lends it- self to simple adjustments for various subjective distri- butions, it would be possible for him to approximate the 81 sum total of his own experience which is relevant to the decisions being made. Some who criticize the subjective school contend that unlimited repeatability is a prerequisite for an experiment to be associated with a probability distribu- tion. They would limit probabilistic inference to games of chance, problems of social mass phenomena such as life insurance, and mechanical phenomena such as the movement Of molecules in a gas.2 A rebuttal against such criti- cisms is that the formal inclusion of subjective factors in the decision process is better than informal considera- tion outside the statistical sampling model. As a minimum, it provides written evidence of where information arose in the event of a lawsuit or inquiry into the audit after the fact. Another defense is that of usefulness. If one can Obtain better results (in the sense Of more information for the same cost or the same information for a lower cost) through the application Of statistical principles to one time only events, then one is in a more desirable position using such tools than not using them. When an auditor attempts to derive a prior dis- tribution, it is imperative that he understand the variance statistic and use it or a similar measure to insure that his prior is not so clustered about a single 82 point that other possibilities are excluded. In this the traditional conservatism of an auditor may be an asset because the use of a distribution with too small a variance has the result of putting more information in the prior than is justified by the supporting evidence. This may expose the auditor to sanctions for not being cautious enough in his work, while too large a variance in the prior allows a margin of safety for the auditor. As an auditor learns more about the use of statistics, he may approach a smaller variance with less fear when he has reason to justify the greater certainty implicit in such a distribution. The auditor just beginning to use Bayesian methods should be cautious to insure that any errors made will tend toward larger than necessary sample sizes. Once it has been determined that Bayesian stat- istics is to be used, there comes the problem of driving a prior distribution that expresses the information avail- able to the auditor before a current sample is taken. Several approaches have been suggested. In one the audi- tor lists error rates 0.00, 0.01, 0.02, 0.03,......, 0.98, 0.99, 1.00 and determines his "feelings" about the infor- mation available to him by associating a probability with each error rate in such a fashion that the sum of 83 these 101 probabilities totals 1.00. Other approaches use more or fewer possible error rates, and still others utilize continuous distributions to reflect the prior information available. Both continuous and discrete prior distributions must be based upon available evidence. The relationship between the audit variables and the selection of priors is touched upon in the following section. B. Audit Evidence and Prior Distributions It has been shown3 that auditors are willing to provide information about their feelings in an audit case which can be used to construct probability distributions adaptable to priors. One of these prior distributions can be combined with the results of an audit sample using Bayes theorem to produce a posterior probability distribu- tion from which confidence intervals can be constructed or tests of hypotheses can be conducted. A problem that has plagued those considering the use of Bayesian stat- istics in auditing has been the difficulty of translating an auditor's feelings about the audit situation with which he is faced into a statistical distribution usable in the model. The previously cited work with discrete distribu- tions has required the precise listing of a number of 84 sample points and the related probability of each. To avoid a sacrifice of accuracy in the approximation, the number of sample points must be large. This listing is not an easy task for an auditor to undertake, and he is further frustrated by the necessity that the sum of all the prob- abilities totals l.00. Consistency is difficult to obtain with such a method. If continuous distributions are also considered, however, the problem of creating a prior distribution from the auditor's feelings is simplified considerably. Because of the nature of continuous distributions, they are smooth curves which may be plotted so that the auditor can com- pare a specific curve with his feelings in a given situa— tion. Once he has found a "picture" of the curve which best fits his perceptions, the mathematical specifications Of that curve may be noted and used as input to a computer program or tables which have already solved the mathe- matics involved. An auditor attempting to construct prior distri- butions must utilize information from his environment, from the client's environment, and from within the client's organization. Several audit variables from each of these three categories were discussed in Chapter II. They vary in applicability to Bayesian statistics. 85 Information from the auditor's environment is important in the derivation of meaningful prior distri- butions. The auditor's general experience with this and with similar clients aids in determining the variance associated with prior distributions. Specific sample results from other years' work papers provide an excellent source of information about the likely rate in the current year. Such information, it must be remembered, is only a supplement to current test results, even though the infor- mation contained therein is important.4 There are numerous other audit variables from the auditor's environment, such as the relative sizes of the auditor and client, which provide some information useful in the determination of prior distributions. The client's environment is important to the auditor. General economic conditions can determine whether the client succeeds or fails. If the national economy appears to be headed toward a depression, the auditor may be less likely to rely upon information from prior distributions derived in different economic circumstances. There are many other factors, such as the absolute size of the client, the industry, the legal form chosen by the client, listing on stock exchanges, registration with the Securities and Exchange Commission, and the mode of 86 financing chosen by the client, which have some bearing upon the risk faced by the client's owners and creditors, and thus upon the risk that the audit results will be challenged. Such information can be used in determining the variance Of prior distributions and the resulting weight contributed to the posterior by the prior distri- bution and current sample results. Information from within the client's organization is especially important to the auditor attempting to derive a specific prior distribution. If the client has an in— ternal auditing staff, this fact, as well as the results Of tests by the internal audit staff, can be very helpful in determining rates which will probably be found in the samples taken, and the variability to be associated with the prior distributions. Other internal control factors also contribute information useful in the derivation of prior distributions. Finally, the auditor is interested in facts about a specific area when making inferences about that area. In deriving a prior distribution, he needs information about such things as the materiality of the items on the financial statements to which the specific sample relates. Many more items of information from the audit variables can be useful to the auditor in choosing prior distributions for the audit, but is is not 87 the purpose of this study to explore such items in depth. In addition to being reflected in the prior, the audit variables can be used in Bayesian statistics to determine the confidence coefficient and interval limits just as in classical statistics. This gives an additional input to the model for such information. In general, a smaller current sample size will be required with Bayesian statistics to produce a given confidence coefficient and interval limit as compared to classical statistics. The ultimate decision as to whether Bayesian statistics or classical statistics will be used in auditing probably depends upon the ability of auditors to develOp prior distributions. It was attempted, in this study, to deve10p more feasible approaches to Bayesian statistics in auditing than had before been prOposed. Although the pro- cess of choosing a prior distribution in a Specific audit situation is beyond the scope of this thesis, methods have been suggested which may aid in that process. This area is important to the development of Bayesian statistics in auditing, and requires study beyond that which has been devoted to the subject at this point in time. 88 CHAPTER VI--FOOTNOTES lSir Ronald Fisher, "The Nature of Probability," The Centennial Review, Summer, 1958, p. 272. 2Richard Von Ness, "Probability: An Objectivist View," reprinted in Edwin Mansfield, Elementary Statis- tics for Economics and Business (New York: Norton, 1970), p. 61. 3John C. Corless, "The Assessment of Prior Dis— tributions for Applying Bayesian Statistics in Auditing Situations," unpublished Ph.D. disseration, University Of Minnesota, 1971, p. 127. 4James E. Sorensen, "Bayesian Analysis in Auditing," The Accounting Review, July, 1969, p. 561. CHAPTER VII CONCLUSION A. Summary The objective of an audit by an independent accountant is to render an Opinion on the fairness of financial statements prepared from the records being audited. The Opinion must be based upon evidence ob- tained by the application of auditing procedures. Audit variables are the elements of an audit situation which should be considered in the decision about Which proce- dures to apply and how extensively they should be applied. Some variables determine the minimum audit program and others govern modifications to that program. The auditor's objective is to obtain sufficient evidence to justify an Opinion at the least cost. Non- statistical methods, referred to as judgment sampling methods, have been used since the early 1900's when audit- ing changed from a check upon every transaction entered in the accounting records to a sampling process. More recently, classical statistics has also become commonly 89 90 used in auditing to determine sample sizes and to construct confidence intervals about attributes, among other uses. Classical statistics as used by auditors is not necessarily classical statistics. The difference lies in the interpretation of confidence intervals. Classical statistics considers pOpulation parameters, even though unknown, as fixed. Auditors, however, typically make statements about confidence intervals which refer to the population parameter as an unknown, implying that it is drawn from a group of possible values. This indicates that a distribution of pOpulation parameters must exist. Information about this distribution can be considered in the Bayesian formula as a prior distribution and can be used to make inferences about pOpulation parameters. Bayesian statistics differs from classical methods in that information from two sources (the prior distribu- tion and the current audit sample) is combined into a single distribution. Classical auditing statistics admits only information from the current sample into the model. The inferences made from classical auditing statistical models approximate those arising from Bayesian methods in which uniform prior distributions are used. Auditors typically have information from prior audits, similar 91 engagements, other audit tests, internal control evalua— tions, and other sources which provide prior evidence about attributes tests. This prior information would seldom imply a uniform prior distribution, but instead would indicate certain rates that are more likely to occur than others. Auditors are taking larger samples than necessary to the extent that they rely upon classical auditing statistical methods when prior evidence indicates that some possible rates are more likely than others. Several articles have shown how Bayesian statisti- cal methods utilizing discrete distributions can be applied to auditing situations. Both the binomial and hypergeome— tric distributions have been discussed. Little has been said in the literature, however, about applications of continuous probability functions to Bayesian audit statistics. Other fields of study, especially the natural sciences, have found the normal distribution to be of value in a wide variety of situations. When the normal distri— bution is revised with Bayes formula, the posterior distri- bution is sometimes very difficult to evaluate. There- fore, it is not well suited to applications of Bayesian statistics to auditing. A distribution which is adaptable 92 to such uses is the beta. When a beta prior distribution (continuous) is revised by Bayes formula and a binomial sample (discrete), the result is always another beta dis— tribution. This may be further revised by another binomial distribution, if desired. The combination of continuous and discrete distributions in this manner models the typical attributes auditing sample. Revision of a beta prior distribution by a bi- nomial sample result with Bayes theorem requires only a few addition and subtraction Operations. NO computer or tables are necessary. This makes the technique feasible for use by auditors who have had the necessary training, and who have tables and graphs Of the prior and posterior distributions. Although these are not currently available to auditors, methods for producing the necessary tables and sketches do exist. To produce the tables required for the derivation of confidence intervals from posterior beta distributions, it is necessary to evaluate beta integrals. Such integrals cannot be evaluated with the methods of calculus. They can, however, be approximated to any desired degree of accuracy with Simpson's rule for the approximation of definite integrals. Even though this approximation is 93 not practical if done by hand, modern digital computers can perform the necessary calculations in seconds, at a very low cost. Some applications of Bayesian statistics have been criticized because subjective probabilities were used. It is argued that statistics should be applied only to situa- tions that can be repeated a large number of times. Suffi— cient justification for the use of such methods in the analysis of one time only events exists, however, if the application of these methods provides a basis for better decisions at a reasonable cost. In addition, the formal consideration of subjective factors in a statistical model is preferable to informal consideration. B. Relative Advantages of Various Distributions In Applications of Bayesian Statistics to Auditing If Bayesian statistics is to be used by auditors, a decision must be reached as to which of the many stat- istical distributions available best fits the auditing situation and is most efficient in actual use. Discrete distributions have been found less difficult for those not trained in higher mathematics to use than continuous distributions. But this study has shown that the actual revision of a prior distribution by a sample result is 94 less complex if a continuous prior distribution, the beta, is used. Continuous distributions are also more exact representations of the prior information available to auditors. In situations where discrete prior distributions are used, it was discovered that the hypergeometric pro- vides more information from samples which are large rela— tive to the size of the pOpulation than the binomial distribution. But this advantage was found to be out- weighed if the sample was small compared to the pOpula- tion size because of the additional complexity of computations involving the hypergeometric. The combination of a beta (continuous) prior distribution and a binomial (discrete) sample result with Bayes formula was found to closely fit the situation found in attributes audit sampling. Once the prior has been determined by the auditor and the sample taken, the actual Operation of obtaining the posterior beta distri- bution is easily accomplished in a few minutes by simple addition and subtraction. This was not found to be the case with situations in which discrete prior distributions are used. If discrete priors are used, lengthy computa— tions, generally involving a computer, are required to 95 obtain the posterior in practical auditing situations. Evaluation of the beta prior and posterior distributions is the only difficult part of a Bayesian analysis using beta prior and posterior distributions and a binomial sample result. Graphs and tables of representative distri— butions can be prepared in advance, which would make the use of this technique feasible in actual auditing situa- tions, even if no computer were available. C. Conclusions Judgment sampling was once relied upon almost exclusively in auditing, but has been supplemented by, and in some cases, replaced by, classical statistical methods. Statistical methods provide more consistent answers and are more readily defended in the event of a challenge to the results of the audit than judgmental methods. Some audit situations do not lend themselves to statistical methods, due to the small size Of the audit, or the organization of the records, and in such cases judgmental methods should continue to be used. It has been suggested that Bayesian statistics may be used in some auditing situations. Discrete distributions (the binomial and hypergeometric) have been used in examples where Bayesian statistics was suggested 96 for use in attributes audit sampling. Discrete distri- butions are, however, unusable in practical auditing situations. They are poor approximations of the contin- uous range of possible values found in most audit attri- butes samples. The derivation of specific discrete prior distributions is difficult, because a number of points must be assigned probabilities, and these probabilities must total 1.00. Involved computations (many multiplica- tion and division Operations) are required for the solu- tion of any practical problem using discrete prior dis- tributions and Bayesian statistics. This would require a computer and a means for getting information about the prior and current sample result into the computer without error. These are usually not available in the field. At this time auditors have not adOpted these methods, even though such methods have received rather wide exposure in the literature. Bayesian methods using a beta prior distribution can overcome most of these difficulties. Such a distri— bution is continuous, and therefore fits the large number of possible pOpulation error rates better than does the discrete model. Visualization of the priors is easier because continuous curves can be drawn (plotted on a l u ‘t'... 1... Jill. ul. [1»).— I 1“: 97 computer). Sketches of representative distributions can allow an auditor to quickly pick the specific prior he wants and write down the numeric parameters of that distribution. Tables of statistics (mean, mode, variance, etc.) can be prepared for these representative distribu- tions. The revision Of a beta prior by a binomial sample result is not difficult. Once this has been done, the auditor can refer again to his tables for a sketch of the posterior distribution (or a very similar one) and for statistics he wishes to know for that distribution. Another set of tables can give confidence intervals from that distribution (one sided or two sided) at several confidence levels. What would have taken considerable computer time with a discrete prior distribution is quickly taken care of with this method by using a few tables in a book and a quick addition and subtraction Operation. This study has attempted to help convert an infeasible technique with a great deal of potential into a method feasible for use by auditors. It has been limited in several respects, however. NO attempt was made to discuss variables sampling in auditing. A 98 proposal was made suggesting the use of a continuous prior distribution in auditing applications of Bayesian statistics. For this it was necessary that tables and sketches of representative beta distributions be deve10ped. These were restricted to illustrative models. Additional resources would be necessary to the develOpment of complete sets of these tables and sketches. D. Suggestions for Further Research More questions arose during this study than could possibly be answered by this project. One area yet Open for investigation is the application of Bayesian statisti- cal methods to variables sampling. In addition, there are numerous statistical distributions and combinations of distributions which may prove applicable to auditing applications and have not yet been investigated. Some studies have been done which used Bayesian statistics with discrete prior distributions in "real world" audit situations. Much can be learned by more studies of this type, especially where applications of classical statistics and Bayesian statistics utilizing various distributions are compared. 99 The derivation of prior distributions from evidence available to the auditor is an area about which little is known. Studies drawing On the behavioral sciences are needed to make prior distributions in Bayesian statistical applications more meaningful to auditors. Comparisons can also be made between the various types of discrete and continuous distributions used as priors to determine the relative efficiency of each in arriving at correct deci— sions from the information available to auditors. Finally, it would be helpful to have information as to how some Of the many audit variables fit into the Bayesian model. This could provide insight as to the types of audit situations in which judgmental sampling, classical statistics, and Bayesian statistics are most applicable. APPENDICES APPENDIX A BETA DISTRIBUTION TABLES Table l Of this Appendix gives statistics of selected beta distributions. The mode, mean, and vari- ance of each distribution are shown. If explanation is necessary, any introductory statistics discusses these measures. Also shown are the parameters a and p , and their sum. The sum ov+p roughly corresponds to the sample size that would be required if the beta distribution were derived from a sample, while the value of zquz m a m.< woo: mZOHBDmHmBmHD dfimm QMBUMAMW m0 mUHBmHfidem H mqmdb ll- 1 ills"... j 1... 1. 102 ooooo. «mooo. noooo. msmoo. noooo. m:o~o. «oooo. summo. Noaoo. :mmoo. esssc. mm:~«. yNoms. momQN. .LMJG. \ .uJMJ. ”mosm. m:po:. meoo. ”Noam. «omoo. mswmm. obmoo. mgssm. o«.o ooooo. mmooo. ooooo. obnoo. oooao. nnoao. ooooo. 0:033. omooo. mmoso. mumao. momsa. moomw. «:~:~. nmumo. o:«mn. «Nnmm. mom~:. «swam. oooms. mammo. Naboa. Ooowm. nNnnm. mo.o ooooo. «nooo. ooooo. Jnmoo. ooooo. omnao. ooooo. Oummo. moooo. maaoo. moboo. wnkaa. bNmoa. oooow. momma. omosn. oooao. nmokn. s:nbo. owsds. ooooo. maoma. ooumw. names. oo.o ooooo. haooo. ooooo. mndoo. ooooo. nomoo. ooooo. amumo. aoooo. mnm:o. ooaoo. ss«0o. nnnoo. Newsa. wmnas. owoon. Nnnno. omann. saomo. :omon. moosm. osaoa. Nonwo. mamn3. so.o ooooo. noooo. ooooo. osooo. ooooo. ammoo. ooooo. condo. ooooo. :ouno. onooo. mmcoo. oaaso. manna. nsoom. nowmm. ankws. oomow. oaomo. women. nnnam. boo:n. «msoo. :oson. po.o ooooo. noooo. ooooo. unooo. ooooo. :Hnoo. ooaoo. onooo. ooooo. o~o~o. moooo. Nan:o. oowao. :nnod. kmamn. ossom. nwoom. mman~. 00:45. oomwm. somoo. mw5mw. wnwoo. buonn. mo.o ooaoo. «oooo. ooooo. maooo. ooooo. :mdoo. ooooo. o::oo. ooooo. odwao. oooso. ooono. «bNoo. ::Nuo. nocam. «nomd. mmsun. :noou. somum. nwnow. :«qou. amunm. :«Naw. mmosw. :o .o oooao. ooooo. ooooo. moooo. ooooo. «oooo. ooooo. owaoo. ooooo. moooo. ooaoo. mdudo. owooo. omm:o. oomoo. nacoa. «whoa. «onwa. umomn. anama. nonom. :«nma. meson. osmow. no.o ooooo. ooooo. ooooo. «oooo. ooooo. odooo. ooooo. «booo. ooooo. «wmao. oaooo. onsoo. «oooo. Nomwo. sooao. nosoo. Nonmo. ouoso. :nm:«. nmmoo. omswn. onmaa. ~N~om. ssona. ~o.o ooooo. ooooo. ooooo. ooooo. ooooo. Noooo. ooooo. coooo. ooooo. onooo. ocooo. obaoo. ooooo. boson. Namao. Nomwo. mKnoo. mmmno. :mqao. «cho. noaoo. :onmo. macaw. onooo. «o.o Q<>mmEZH WUZMQHmZOU ho EHZHQ mmmmD .mfidm mommm mEZmHUHmmMOU mUZMQHmZOU demm QMQHm mZO N mqmdfi oooou.om ooooo.m ooooo.om ooooc.m ooooo.oo ooooo.o oooom.:s ooooo.~ oooo¢.o~ oooo:.~ oooon.:o ooooo.~ oooo~.oo oaoo~.o oooa«.3¢ ooooo.o ooooo.mo ooooo.o ooooo.bw oooo~.c ooo:o.~o ooo:o.n ooo~o.oo ooo~0.o m ooooo.am ooooo.m. oooo~.o: oooo~.: oooo:.on oooo:.n oooom.mN 3oooo.n ooooo.o~ ooosb.~ oao:~.m« oooom.m ooooo.o« oooao.« aooo$.m oooo:.« ooomo.: ooomn.« ooo:o.n ooo:N.« oooco.~ oooo«.« oooom.« oooso.« a 103 coooo. omoao. ooaoo. nomoa. ~m~oo. ob:n«. anwo. oouom. oosbJ. conun. -.Lo. mauCJ. mm:mm. :m mo. n¢.mm. o~ rs. oo.oo. Nfiqvuo mm._m. :mpec. ooaoo. ::zmo. anwwo ”d33c. om.o ooooo. mumaa. ooooo. Nnmmo. «mnoo. omnma. owooo. nnmnN. nnaon. :ooen. «o«~o. ~omo:. ammon. ~:«oo. mmomo. ndsns. oaaoo. oooms. oommo. o-o~. ooooo. aJmoo. nomom. noomo. o«.o v4 .ooooo. omwdo. ooooo. sooao. o:«oo. wnsna. :nsno. oaodu. nmmow. moaan. «boss. omnnc. Nooow. ooosm. ooomo. «noon. somwo. omsns. omomo. cwobs. ooooo. «0:05. moomw. assoc. o«.o H ooooo. chooo. ooooo. conno. smooo. cooaa. «ooao. :«oou. suooa. omacm. momma. :Naoa. somoo. Noonm. «Nmmo. nsaoo. noowa. momou. ooqoo. «owns. ooooo.« «macs. mmqom. Nomos. ~«.o osooo. assoc. ooooo. moono. owooo. nomoo. ooooo. womb“. munwa. newmw. mmsmm. Nnoon. onwam. mmmom. mnmmw. nmmmo. Nomwm. ouuoc. momma. mmoon. ooooo. oasnu. owomw. xenon. oa.o d ooooo. nmmoo. ooooo. «::No. coooo. onnmo. osnoo. moo3d. NoJKo. oo:NN. nosss. ommnn. 0::oo. ~o«~:. mJoom. moamo. smooo. nwamo. omoom. moaoo. 09000. «nods. Noooa. s~ons. m«.o A¢>mmBZH mUZmQHmZOU m0 EHZHA ooooo. :d:oo. ooooo. somuo. Noooo. moooo. «:«oo. buawd. ono3o. omboa. oNonn. nowon. sh04o. ommns. mmomo. mo~om. cocoa. bdmdm. msoom. wnomo. omoco. nadcm. nomad. :nads. 4« .o mmmmb .medm mommm ooooo. Ow~oo. ooooo. om:«o. oaooo. unomo. s:ooo. nOood. mooao. :«osa. busnw. oawom. o~:~s. bacon. -~mo. oommm. mosmw. omsom. onmmm. amnao. momma. Nmo:o. smoom. Noaob. n«.o ooooo. 0o~oo. ooooo. hooao. ooooo. ooéco. naooo. ownoo. noooo. oom:«. hawma. buonN. «ache. moaon. «smom. «nmdm. anawo. :no3m. nmooo. ohaom. Gmomw. :nmuo. momma. non:o. ~«.o mBZWHUHmmmov MUZMQHmZOU 49mm QMQHm mZO m mqmdfi cocoa. nndoo. ooooo. co~oo. ooooo. nosno. noooo. mNKOo. :Nnoo. :nawa. mssoo. snmom. noomm. usnmn. oNosm. smm~:. onooo. ~mmam. :moom. osnsm. owoom. o:o~m. somwm. annao. ««.o ooooa.om ooooo.m ooooo.mm ooooo.m ooooo.oo ooooo.o onoom.:s oooo:.s oooo:.w~ oooo:.~ ooaon.:n ooooo.~ oooo~.oo oooo~.o oooo«.:o ooooo.n ooooo.mo ooooo.n ooobo.oo oooo~.o oooao.~o oo::n.o ooo~o.oo ooo~0.o m ooooo.om ooooo.m. ooo=~.o: oaso~.: oooo:.on ossss.n coasm.m~ oooao.n oaooo.o~ onaso.~ oass~.m« oooo~.~ ooooo.oa oussc.4 oaaoo.m oooo:.« asswm.: ossmn.« ooaso.n ooo:~.« ossom.~ aasmu.« ooooo.« ooooo.« c 104 ooooo.a ooooo.« ooooo.« ooooo.« ooooo.a ooooo.« ooooo.« ooooo.« ooooo.« ooooo.a ooooo.« o5ooo.« ooooo.« ooooo.« ooooo.a ooooo.« ooooo.« ooooo.« ooooo.« ooooo.« ooooo.« ooooo.« ooooo.« ooooo.« old ooooo.a «“000. ooooo.« 00000. oaaoo.« 00000. ooooo.a ooooo.a ooaoo.« ooooo.« ooaoo.a ooaoo.a ooaoo.« NA000. 00000. «0000. mom00. 00:00. oa000. o0a00. ooooo.« ~Nno0. 00mo0. «on00. 0.o Q<>mMBZH mUZMQHmZOU m0 EHZHH mmamb smfifim mommm ooooo.« ~3o00. ooooo.« a3000. oooon.« ka0m0. cacao.“ 00000. oooao.« o0000. nsoao.« 00000. ooooo.« «0000. ooooo.a non0. 0:000. 0:0m0. namm0. 0Hn00. ooooo.« mo0o0. ~«n00. m~mo0. o.o 00000. 0aao0. ooooo.« 0&000. ooooo.« 0uo00. ooooo.« akm00. ooaoo.« 0ao00. oopoo.« om000. ooooo.a «0000. oosoa.« ~m~00. n~000. 0‘000. 0m000. :M300. ooooo.« ~3400. 00000. ::~00. s.o Nacho. "snag. ~0000. m3nko. o5oso.« omsmo. onosn.« ~03~0. ossoo.a knaom. 63040.“ mas00. osooo.« mskwm. assoo.« nmaom. @0000. enema. mm000. wssmo. on.so.« pmmmm. ::Nom. ~nmom. 0.9 eeosm. oseom. mamas. noses. Rooms. Nehms. esoos.« chasm. cooss.« mkkso. oesso.« mamas. cocoa.“ «nope. cases.“ memos. memos. «use». obese. comma. asses.“ asses. whose. omens. m.o n0a~o. ~000N. 0500:. s~n0:. naon0. 0~J~0. 00000. «moon. 00000. n~0:0. oraoo.a 0N000. ooooo.« 0nos0. oooao.« 00:50. 00000. Nmsm0. 00000. «00s0. ooooo.a :«400. nn000. n0ao0. :.o Noooo. «0m0o. Opmao. Nosmw. 009K3. :Nn~:. 000:0. saxnm. 30:00. ooam0. ~00m0. 0000s. ooooo.« «40:0. ooauo.« onna0. ooooo.d annm0. ©0000. 0ann0. oqooa.« nn«:0. n~000. odoam. n.o mEZmHUHmmmOU mUZmQHmZOU 48mm QmQHm mZO v mqmdfi ooooo.om ooooo.m oaoao.0m oaoo0.m oooo0.m0 oooo0.0 oauam.:~ oasgo.~ oooo:.0~ oooo:.~ oaain.70 Onoon.~ oooom.0o oaqu~.o 84.35..o oooom.0 oooco.m0 o~500.o oooco.00 ooo0~.o 044:0.a0 045:0.» oss~o.oo ooo~0.n o oooo:.om ooooo.m oaom~.o: opos~.: oooo:.on oooo:.n oaoam.m~ owowu.m oooo0.am oso_m.~ oona~.ma onoum.m oooon.od o:a.0.a on..0.m oooor.u ooom0.: oawmn.a ooo:0.n oooa~.a oooom.~ 0530A.“ oooo0.« ooomo.a a APPENDIX B SKETCHES OF BETA DISTRIBUTIONS Plots of selected beta distributions are shown in this Appendix. Certain statistics related to each of the curves shown here were given in Appendix A. Two curves, with the same mode, are shown on each plate. The error rate corresponding to the highest point on each of the two curves is the same. indicating they share the same mode. These plots were done by the CDC 3600 computer in the Michigan State University Computer Center. The actual program, written in the FORTRAN IV language, is shown in Appendix D. 105 106 L J, g4/0(=1.98 6:98.02 }. I . \ ' 1 P (X) , 7“: \_ /0(=l.08 (5:8.92 k \W '1‘ “‘T“‘--~— 1 J 0 0.25 0.50 0.75 Error Rate FIGURE 5. TWO BETA CURVES WITH MODE OF 0.01 b A )1 \/ O(=2.96 5:97.04 1 A r" (\‘(cleue {3=8.84 .- “'0-” M- l n...- 0.25 P (x) 0 0.50 Error Rate FIGURE 6. TWO BETA CURVES WITH MODE OF 0.02 ' \ P(x) "/o(=3.94 {5:96.06 1-1. .. o<=1.24 =8.76 / \“‘\’:» fl 0 I 0.25“ 0.50 0.75 Error Rate FIGURE 7. TWO BETA CURVES WITH MODE OF 0.03 107 “l‘ P(x) / 004.92 fi=95.08 I} 'N"\‘if*<.:%1_°32 p=B.68 ‘ ‘A— 0 0.25 0.50 0.75 Error Rate FIGURE 8. TWO BETA CURVES WITH MODE OF 0.04 P(x) /0(=5.90 5:94.10 .. «=1.40 =8.60 \. ‘7“*-~—— . 0 0.25 0.50 0.75 Error Rate FIGURE 9. TWO BETA CURVES WITH MODE OF 0.05 P(x) . A =10.80 fi=89.20 { < 001.80 ,6:8.20 / 6““:1 +— 1 0 0.25 0.50 0.75 Error Rate FIGURE 10. TWO BETA CURVES WITH MODE OF 0.10 108 P(x) + /9<=15.70 8:84.30 7 \ 7 . =2.20 =7.8 ',,,/./~«—--—~e..-.-....-“__.L {ff 4 ° 0 0.25 0.50 0.75 Error Rate FIGURE 11. TWO BETA CURVES WITH MODE OF 0.15 P(x) (0:20.60 [3:79.40 L. /°(=2.60 ’627.40 4 __M‘ L 0 0.25 0.50 0.75 Error Rate FIGURE 12. TWO BETA CURVES WITH MODE OF 0.20 P(X) /o(=25.50 8:74.50 ///\\\‘ x/<><=3.00 p:7.00 A’ L ‘ ‘~~._:~‘"“"T--—~— l 0 0.25 0.50 0.75 Error Rate FIGURE 13. TWO BETA CURVES WITH MODE OF 0.25 P(x) P (x) P (X) .__,4-vf- ., w---h_. p..." _/ 1 - . l 109 [0:30 .40 3:69.60 l/z/‘7~~,\\ /d:3.40 fl36.60 4"..- ’—_—---“ _ - _f’v-‘I,’ "".--\—...-.~‘.-. 0.25 0.50 0.75 Error Rate FIGURE 14. TWO BETA CURVES WITH MODE OF 0.30 /6<=40.20 [3:59.80 . = .2 = . .//\_§. /o( 4 0 )3 5 80 ,be»**::37”."_fig Ah?“"‘“*-w-— - so 0.25 0.50 0.75 Error Rate FIGURE 15. TWO BETA CURVES WITH MODE OF 0.40 «=50 .00 3:50.00 /~<. f/o<==5.00 /3=5.00 .Ivrrofir::7*I——n I\\——““**1 504 0.25 0.50 0.75 Error Rate FIGURE 16. TWO BETA CURVES WITH MODE OF 0.50 APPENDIX C REVISION OF A BETA PRIOR DISTRIBUTION BY A BINOMIAL SAMPLE RESULT Given the prior distribution with mean, mode, variance, etc. chosen to reflect a belief about the under- lying pOpulation parameter 0, and expressed as _ “-1 B-1 001 339(0) _. C19 (1-0) on where C1 is chosen so that 1 (K-1 -1 Cl J6, 0 (1—0)8 d0 = 1 then the prior distribution can be revised using the re- sults of a sample of n items, x of which were errors. _' n x _ n-x fX|9(X19) —-(X)9 (l 9) The joint (unconditional) distribution of X and 8 is simply the product fX 9(x,0) = fx|9(X\9)f9(9) I __ OK-l _ fl—l n x _ n-X fX,Q(X'9) _. C18 (1 8) (X)8 (l 8) __ n OH-X-l B-tn-x-l The margional density of X can be found by inte- grating the joint density fX e(x,8) over the range of 9 (0 to l). 110 111 fx(X):: /fX 9(X.8) d0 Range of 9 —_- l n 0H-X-1 _ 3+n-x-l A cl(x)0 (1 8) d0 :0 (n) 10“+X’1(1-8)B+n_x’ 100 l x O The posterior function f8|x(9lx) is found by divid- ing the joint distribution by the margional distribution. f . x 9(X 9) fg‘XWIX): fx(x) C1 (:2) e«H-x-l (1_9)A+n-x-1 r1 71;q+x—1 B+n—x41 C1(X)/O 8 (1-0) d9 901+X-1(1_9)5+n-X—1 ‘/019“+X-1(l_9)fl+n-X‘ld9 However, the denominator is a definite integral, equal to a constant, C2. 02 :.-. [018“+X'1(1-8)3*“’X'lde ._ F (OH-x) Fig +n—XL I r“ (006”!) Since we wish the integral (cumulative distribution function) of the revised distribution to be equal to 1 over the range (0,1) 1 _ _ _ [1 —— 0"”X 1(1-0)’Mn X 180 2 1 0 02 then C3 may be set equal to %— and 2 112 ottx-l __ B+n—x-—l f9|X(9|x) -_-. c 8 (1 8) 3 which is another beta distribution, with parameters o('= d-i-x-l (3': fl-tn-x-l. APPENDIX D COMPUTER PROGRAMS This appendix contains the computer programs used to produce the beta distribution tables in Appendix A and the plots in Appendix B. All programs were written in FORTRAN IV. The plotting program was run on the Michigan State University CDC 3600 machine, while the other routines were run on the CDC 6500. These programs are given here for reference only since certain software features used are not commonly available on other machines and changes would be necessary before they could be run on most other computers. This set Of programs was written as a group. Program TABLE computes statistics for a beta distribution with various modes and with A+B (map) of either 10 or 100. It punches a deck of cards, one card for each mode and A+B combination, which gives A, B, and the constant c for the basic beta distribution formula discussed in Chapter V. In addition, TABLE prints a table of statistics giving the mean, mode,and variance of each distribution. The main program determines A, B, and the mode. Subroutine WORK computes the mean and variance of that function and prints 113 114 the table. Subroutine SIMPSON uses the Simpson rule for the approximation of definite integrals to find the con- stant Of integration, c, which will make the value of the integral over the range X30 to X=l equal to 1.00. Sub- routine ONE finds the Y value Of the beta function at each specific X value called for by SIMPSON. Program TABLE and its subroutines follow on the next three pages. 115 2 mnHmdrHu .. Cm CO OF 06 “WC".FJ6FOFV ua .& PC¥$$¢FNPCP .mOO$.FOFv¥003 JJ¢Euflh L.Cfi.Cun .qu .m .4 if: u If. IlliJXu-fl Ill ..ccca\.JXlecnI - , :z . illobquXlllilllli-! O . "HEX .cttflw. 1| cocn m 4 m+< A .ll. It‘ll]! .- l. Ao.u:00v m mumoX.U. .uZC m2~t400m_.u . .Czwfllilillt- I 11' 2.03.700 :-:3,a.rhinitistxcm+c4ur¢b¢+um+lw¢3a0u1¢tmlllllll U.¢.4.CCN IUZZQ . :- w42~.829:(.i 9|. A+4ua4u4ua4. -I “0.4.>.x.ucmzo 4440 i: i .. -- :xux .n\I*.4*>+ 4&04 u 4EH4 .s--i2 .- .3.m.<.>.x.0»t283444u. -I IIDXuX -:.w:z+8280I¢tiIIII!.I .m\I*>#.m+4wa4u*.¢+4u.a4¢40,0.4- is .m.4.>.x.ucmzo 4440 4.0.ncoov m mqm48 112.11.-."— ~ ‘51 mi; 118 Program PLOTIT uses the cards punched by TABLE as input data and directs a routine on file in the Michi- gan State University Computer Program Library to plot the curves in Appendix B. One plot is produced for each run of the program and several curves may be plotted on the same set of X and Y axes by running the apprOpriate in— put data cards together in one run. PLOTIT first initial- izes the plotter pointer, then directs the drawing Of the X and Y axes, and finally computes and plots the apprOp- riate X and Y values for the beta curve determined by the A, B,and C values found on the input card being read. The last call to PLOT prepares the plotter mechanism for the call of the next user. Program PLOTIT appears on the following page. 119 TABLE 6. COMPUTER PROGRAM ”PLOTIT" FDCTFQ -LOTIT 9“ FORMAT (?F10050F40020) 1? ”FA” PSOAORQC -- :s—+Feswen+—tsnwss--—ww~—~ QR C=lOoO**C ——-~*~w~chLt*PLOTlhufiqfiofionQ1000!innnOn’ CALL pLOT(OOOO‘loO¢?) ~*-~-—-~CALL~PLOT(0.0.0.0.0r ----- X3000 -——————-——69—+¢0—+k$tw4 XAalA --n——»—-xsx4¢a.o~w-~u~_~-L~ CALL PLOT(0.OQX01) ' fifiLtrpLOTtnafitXrtlum-m“"* 140 CALL PLOTIO.OQX.1) ‘—————————€Atb—Pt9¥+91fiTfivfiT+}v ~0" Y=OOO oo—taswts:rrtfi—~ ”"” X8218 -.- ¥=¥RHfiofi—----—————--- --- -~ —- -' -- CALL PLOT(Y900091) cast 9L9¥+¥Tgtfi+ff§——mwmm‘“—“"““‘ 1a: CALL PL0T(Y900001) “~W"-_~-~CALL PLOTfOonafiofioll- DO 150 1:1 100 ‘---'-<-~-~-—XI=I--~ *- - ~ -* 3“: x=xI/lnn.0 _.___—————¥:€#+y44+4-}184+%e(1.0-X)** EQUOOUQ U ********* ¥****** 11 *u- *.x.*- ....*****.... #1.. 41:1..- x»). .Vu...1r*.u.m.fl..u¢** .x. “#302“ flCCNtho PDOFDO o H§$+mmfi44>1¢aqfimv€flfl 1 :mqu>: Edmwomm mmBDmSOU .h mqm -.11 511111+m.<.>px.uvlmzo-mz+ktoam3m -1 czw 11 l1.-.11112u:rmfl -1 wzzakzcu (a :11-:T ¢w« h14wu« cm or 00 .c.u.hJ.+.x.uvmzo JJqu .11. 111--..- ..111113Xflwm 1 .n\I*oc*>+ (mad u *.m+.x.uew201uu«b -1 I+Xux .-111111 .m\I*}*bc+fim¢«h«wmq .1 am.<.>.x.u.uzo Judo I+an 5.1.2... 1.3. . A 1.» in. (Do 1. .M\>*Iu.x.v+mz®:flu