To bring error itself under law might seem beyond human power. He who errs surely diverges from law, and it might be deemed hopeless out of error to draw truth. One of the most remarkable achievements of the human intellect is the establishment of a general theory which not only enables us among discrepant results to approximate to the truth, but to assign the degree of probability which fairly attaches to this conclusion. It would be a mistake indeed to suppose that this law is necessarily the best guide under all circumstances. Every measuring instrument and every form of experiment may have its own special law of error; there may in one instrument be a tendency in one direction and in another in the opposite direction. Every process has its peculiar liabilities to disturbance, and we are never relieved from the necessity of providing against special difficulties. The general Law of Error is the best guide only when we have exhausted all other means of approximation, and still find discrepancies, which are due to unknown causes. We must treat such residual differences in some way or other, since they will occur in all accurate experiments, and as their origin is assumed to be unknown, there is no reason why we should treat them differently in different cases. Accordingly the ultimate Law of Error must be a uniform and general one.
It is perfectly recognised by mathematicians that in each case a special Law of Error may exist, and should be discovered if possible. “Nothing can be more unlikely than that the errors committed in all classes of observations should follow the same law,”276 and the special Laws of Error which will apply to certain instruments, as for instance the repeating circle, have been investigated by Bravais.277 He concludes that every distinct cause of error gives rise to a curve of possibility of errors, which may have any form,—a curve which we may either be able or unable to discover, and which in the first case may be determined by à priori considerations on the peculiar nature of this cause, or which may be determined à posteriori by observation. Whenever it is practicable and worth the labour, we ought to investigate these special conditions of error; nevertheless, when there are a great number of different sources of minute error, the general resultant will always tend to obey that general law which we are about to consider.
Mathematicians agree far better as to the form of the Law of Error than they do as to the manner in which it can be deduced and proved. They agree that among a number of discrepant results of observation, that mean quantity is probably the best approximation to the truth which makes the sum of the squares of the errors as small as possible. But there are three principal ways in which this law has been arrived at respectively by Gauss, by Laplace and Quetelet, and by Sir John Herschel. Gauss proceeds much upon assumption; Herschel rests upon geometrical considerations; while Laplace and Quetelet regard the Law of Error as a development of the doctrine of combinations. A number of other mathematicians, such as Adrain of New Brunswick, Bessel, Ivory, Donkin, Leslie Ellis, Tait, and Crofton have either attempted independent proofs or have modified or commented on those here to be described. For full accounts of the literature of the subject the reader should refer either to Mr. Todhunter’s History of the Theory of Probability or to the able memoir of Mr. J. W. L. Glaisher.278
According to Gauss the Law of Error expresses the comparative probability of errors of various magnitude, and partly from experience, partly from à priori considerations, we may readily lay down certain conditions to which the law will certainly conform. It may fairly be assumed as a first principle to guide us in the selection of the law, that large errors will be far less frequent and probable than small ones. We know that very large errors are almost impossible, so that the probability must rapidly decrease as the amount of the error increases. A second principle is that positive and negative errors shall be equally probable, which may certainly be assumed, because we are supposed to be devoid of any knowledge as to the causes of the residual errors. It follows that the probability of the error must be a function of an even power of the magnitude, that is of the square, or the fourth power, or the sixth power, otherwise the probability of the same amount of error would vary according as the error was positive or negative. The even powers x2, x4, x6, &c., are always intrinsically positive, whether x be positive or negative. There is no à priori reason why one rather than another of these even powers should be selected. Gauss himself allows that the fourth or sixth power would fulfil the conditions as well as the second;279 but in the absence of any theoretical reasons we should prefer the second power, because it leads to formulæ of great comparative simplicity. Did the Law of Error necessitate the use of the higher powers of the error, the complexity of the necessary calculations would much reduce the utility of the theory.
By mathematical reasoning which it would be undesirable to attempt to follow in this book, it is shown that under these conditions, the facility of occurrence, or in other, words, the probability of error is expressed by a function of the general form ε–h2 x2, in which x represents the variable amount of errors. From this law, to be more fully described in the following sections, it at once follows that the most probable result of any observations is that which makes the sum of the squares of the consequent errors the least possible. Let a, b, c, &c., be the results of observation, and x the quantity selected as the most probable, that is the most free from unknown errors: then we must determine x so that (a - x)2 + (b - x)2 + (c - x)2 + . . . shall be the least possible quantity. Thus we arrive at the celebrated Method of Least Squares, as it is usually called, which appears to have been first distinctly put in practice by Gauss in 1795, while Legendre first published in 1806 an account of the process in his work, entitled, Nouvelles Méthodes pour la Détermination des Orbites des Comètes. It is worthy of notice, however, that Roger Cotes had long previously recommended a method of equivalent nature in his tract, “Estimatio Erroris in Mixta Mathesi.”280
A second way of arriving at the Law of Error was proposed by Herschel, and although only applicable to geometrical cases, it is remarkable as showing that from whatever point of view we regard the subject, the same principle will be detected. After assuming that some general law must exist, and that it is subject to the principles of probability, he supposes that a ball is dropped from a high point with the intention that it shall strike a given mark on a horizontal plane. In the absence of any known causes of deviation it will either strike that mark, or, as is infinitely more probable, diverge from it by an amount which we must regard as error of unknown origin. Now, to quote the words of Herschel,281 “the probability of that error is the unknown function of its square, i.e. of the sum of the squares of its deviations in any two rectangular directions. Now, the probability of any deviation depending solely on its magnitude, and not on its direction, it follows that the probability of each of these rectangular deviations must be the same function of its square. And since the observed oblique deviation is equivalent to the two rectangular ones, supposed concurrent, and which are essentially independent of one another, and is, therefore, a compound event of which they are the simple independent constituents, therefore its probability will be the product of their separate probabilities. Thus the form of our unknown function comes to be determined from this condition, viz., that the product of such functions of two independent elements is equal to the same function of their sum. But it is shown in every work on algebra that this property is the peculiar characteristic of, and belongs only to, the exponential or antilogarithmic function. This, then, is the function of the square of the error, which expresses the probability of committing that error. That probability decreases, therefore, in geometrical progression, as the square of the error increases in arithmetical.”
However much presumption the modes of determining the Law of Error, already described, may give in favour of the law usually adopted, it is difficult to feel that the arguments are satisfactory. The law adopted is chosen rather on the grounds of convenience and plausibility, than because it can be seen to be the necessary law. We can however approach the subject from an entirely different point of view, and yet get to the same result.
Let us assume that a particular observation is subject to four chances of error, each of which will increase the result one inch if it occurs. Each of these errors is to be regarded as an event independent of the rest and we can therefore assign, by the theory of probability, the comparative probability and frequency of each conjunction of errors. From the Arithmetical Triangle (pp. 182–188) we learn that no error at all can happen only in one way; an error of one inch can happen in 4 ways; and the ways of happening of errors of 2, 3 and 4 inches respectively, will be 6, 4 and 1 in number.
We may infer that the error of two inches is the most likely to occur, and will occur in the long run in six cases out of sixteen. Errors of one and three inches will be equally likely, but will occur less frequently; while no error at all, or one of four inches will be a comparatively rare occurrence. If we now suppose the errors to act as often in one direction as the other, the effect will be to alter the average error by the amount of two inches, and we shall have the following results:—
| Negative error of 2 inches | 1 |
way. |
| Negative error of 1 inch | 4 |
ways. |
| No error at all | 6 |
ways. |
| Positive error of 1 inch | 4 |
ways. |
| Positive error of 2 inches | 1 |
way. |
We may now imagine the number of causes of error increased and the amount of each error decreased, and the arithmetical triangle will give us the frequency of the resulting errors. Thus if there be five positive causes of error and five negative causes, the following table shows the numbers of errors of various amount which will be the result:—
| Direction of Error. | Positive Error. |
Negative Error. |
|
| Amount of Error. | 5, 4, 3, 2, 1 |
0 |
1, 2, 3, 4, 5 |
| Number of such Errors. | 1, 10, 45, 120, 210 |
252 |
210 120, 45, 10, 1 |
It is plain that from such numbers I can ascertain the probability of any particular amount of error under the conditions supposed. The probability of a positive error of exactly one inch is 2101024, in which fraction the numerator is the number of combinations giving one inch positive error, and the denominator the whole number of possible errors of all magnitudes. I can also, by adding together the appropriate numbers get the probability of an error not exceeding a certain amount. Thus the probability of an error of three inches or less, positive or negative, is a fraction whose numerator is the sum of 45 + 120 + 210 + 252 + 210 + 120 + 45, and the denominator, as before, giving the result 10021024. We may see at once that, according to these principles, the probability of small errors is far greater than of large ones: the odds are 1002 to 22, or more than 45 to 1, that the error will not exceed three inches; and the odds are 1022 to 2 against the occurrence of the greatest possible error of five inches.
If any case should arise in which the observer knows the number and magnitude of the chief errors which may occur, he ought certainly to calculate from the Arithmetical Triangle the special Law of Error which would apply. But the general law, of which we are in search, is to be used in the dark, when we have no knowledge whatever of the sources of error. To assume any special number of causes of error is then an arbitrary proceeding, and mathematicians have chosen the least arbitrary course of imagining the existence of an infinite number of infinitely small errors, just as, in the inverse method of probabilities, an infinite number of infinitely improbable hypotheses were submitted to calculation (p. 255).
The reasons in favour of this choice are of several different kinds.
1. It cannot be denied that there may exist infinitely numerous causes of error in any act of observation.
2. The law resulting from the hypothesis of a moderate number of causes of error, does not appreciably differ from that given by the hypothesis of an infinite number of causes of error.
3. We gain by the hypothesis of infinity a general law capable of ready calculation, and applicable by uniform rules to all problems.
4. This law, when tested by comparison with extensive series of observations, is strikingly verified, as will be shown in a later section.
When we imagine the existence of any large number of causes of error, for instance one hundred, the numbers of combinations become impracticably large, as may be seen to be the case from a glance at the Arithmetical Triangle, which proceeds only up to the seventeenth line. Quetelet, by suitable abbreviating processes, calculated out a table of probability of errors on the hypothesis of one thousand distinct causes;282 but mathematicians have generally proceeded on the hypothesis of infinity, and then, by the devices of analysis, have substituted a general law of easy treatment. In mathematical works upon the subject, it is shown that the standard Law of Error is expressed in the formula
in which x is the amount of the error, Y the maximum ordinate of the curve of error, and c a number constant for each series of observations, and expressing the amount of the tendency to error, varying between one series of observations and another. The letter ε is the mathematical constant, the sum of ratios between the numbers of permutations and combinations, previously referred to (p. 330).
To show the close correspondence of this general law with the special law which might be derived from the supposition of a moderate number of causes of error, I have in the accompanying figure drawn a curved line representing accurately the variation of y when x in the above formula is taken equal 0, 12, 1, 32, 2, &c., positive or negative, the arbitrary quantities Y and c being each assumed equal to unity, in order to simplify the calculations. In the same figure are inserted eleven dots, whose heights above the base line are proportional to the numbers in the eleventh line of the Arithmetical Triangle, thus representing the comparative probabilities of errors of various amounts arising from ten equal causes of error. The correspondence of the general and the special Law of Error is almost as close as can be exhibited in the figure, and the assumption of a greater number of equal causes of error would render the correspondence far more close.
It may be explained that the ordinates NM, nm, n′m′, represent values of y in the equation expressing the Law of Error. The occurrence of any one definite amount of error is infinitely improbable, because an infinite number of such ordinates might be drawn. But the probability of an error occurring between certain limits is finite, and is represented by a portion of the area of the curve. Thus the probability that an error, positive or negative, not exceeding unity will occur, is represented by the area Mmnn′m′, in short, by the area standing upon the line nn′. Since every observation must either have some definite error or none at all, it follows that the whole area of the curve should be considered as the unit expressing certainty, and the probability of an error falling between particular limits will then be expressed by the ratio which the area of the curve between those limits bears to the whole area of the curve.
The mere fact that the Law of Error allows of the possible existence of errors of every assignable amount shows that it is only approximately true. We may fairly say that in measuring a mile it would be impossible to commit an error of a hundred miles, and the length of life would never allow of our committing an error of one million miles. Nevertheless the general Law of Error would assign a probability for an error of that amount or more, but so small a probability as to be utterly inconsiderable and almost inconceivable. All that can, or in fact need, be said in defence of the law is, that it may be made to represent the errors in any special case to a very close approximation, and that the probability of large and practically impossible errors, as given by the law, will be so small as to be entirely inconsiderable. And as we are dealing with error itself, and our results pretend to nothing more than approximation and probability, an indefinitely small error in our process of approximation is of no importance whatever.
It is worthy of notice that this Law of Error, abstruse though the subject may seem, is really founded upon the simplest principles. It arises entirely out of the difference between permutations and combinations, a subject upon which I may seem to have dwelt with unnecessary prolixity in previous pages (pp. 170, 189). The order in which we add quantities together does not affect the amount of the sum, so that if there be three positive and five negative causes of error in operation, it does not matter in which order they are considered as acting. They may be intermixed in any arrangement, and yet the result will be the same. The reader should not fail to notice how laws or principles which appeared to be absurdly simple and evident when first noticed, reappear in the most complicated and mysterious processes of scientific method. The fundamental Laws of Identity and Difference gave rise to the Logical Alphabet which, after abstracting the character of the differences, led to the Arithmetical Triangle. The Law of Error is defined by an infinitely high line of that triangle, and the law proves that the mean is the most probable result, and that divergencies from the mean become much less probable as they increase in amount. Now the comparative greatness of the numbers towards the middle of each line of the Arithmetical Triangle is entirely due to the indifference of order in space or time, which was first prominently pointed out as a condition of logical relations, and the symbols indicating them (pp. 32–35), and which was afterwards shown to attach equally to numerical symbols, the derivatives of logical terms (p. 160).
The theory of error which we have been considering rests entirely upon an assumption, namely that when known sources of disturbances are allowed for, there yet remain an indefinite, possibly an infinite number of other minute sources of error, which will as often produce excess as deficiency. Granting this assumption, the Law of Error must be as it is usually taken to be, and there is no more need to verify it empirically than to test the truth of one of Euclid’s propositions mechanically. Nevertheless, it is an interesting occupation to verify even the propositions of geometry, and it is still more instructive to try whether a large number of observations will justify our assumption of the Law of Error.
Encke has given an excellent instance of the correspondence of theory with experience, in the case of observations of the differences of Right Ascension of the sun and two stars, namely α Aquilæ and α Canis minoris. The observations were 470 in number, and were made by Bradley and reduced by Bessel, who found the probable error of the final result to be only about one-fourth part of a second (0·2637). He then compared the numbers of errors of each magnitude from 0·1 second upwards, as actually given by the observations, with what should occur according to the Law of Error.
The results were as follow:—283
Magnitude of the errors in parts of a second. |
Number of errors of each magnitude according to |
|||
Observation. |
Theory. |
|||
0·0 |
to |
0·1 | 94 |
95 |
·1 |
" |
·2 | 88 |
89 |
·2 |
" |
·3 | 78 |
78 |
·3 |
" |
·4 | 58 |
64 |
·4 |
" |
·5 | 51 |
50 |
·5 |
" |
·6 | 36 |
36 |
·6 |
" |
·7 | 26 |
24 |
·7 |
" |
·8 | 14 |
15 |
·8 |
" |
·9 | 10 |
9 |
·9 |
" |
1·0 | 7 |
5 |
above |
1·0 | 8 |
5 | |
The reader will remark that the correspondence is very close, except as regards larger errors, which are excessive in practice. It is one objection, indeed, to the theory of error, that, being expressed in a continuous mathematical function, it contemplates the existence of errors of every magnitude, such as could not practically occur; yet in this case the theory seems to under-estimate the number of large errors.
Another comparison of the law with observation was made by Quetelet, who investigated the errors of 487 determinations in time of the Right Ascension of the Pole-Star made at Greenwich during the four years 1836–39. These observations, although carefully corrected for all known causes of error, as well as for nutation, precession, &c., are yet of course found to differ, and being classified as regards intervals of one-half second of time, and then proportionately increased in number, so that their sum may be one thousand, give the following results as compared with what Quetelet’s theory would lead us to expect:—284
Magnitude of error in tenths of a second. |
Number of Errors |
Magnitude of error in tenths of a second. |
Number of Errors |
||
by Observation. |
by Theory. |
by Observation. |
by Theory. |
||
0·0 |
168 |
163 |
– |
– |
– |
+0·5 |
148 |
147 |
–0·5 |
150 |
152 |
+1·0 |
129 |
112 |
–1·0 |
126 |
121 |
+1·5 |
78 |
72 |
–1·5 |
74 |
82 |
+2·0 |
33 |
40 |
–2·0 |
43 |
46 |
+2·5 |
10 |
19 |
–2·5 |
25 |
22 |
+3·0 |
2 |
10 |
–3·0 |
12 |
10 |
– |
– |
– |
–3·5 |
2 |
4 |
In this instance also the correspondence is satisfactory, but the divergence between theory and fact is in the opposite direction to that discovered in the former comparison, the larger errors being less frequent than theory would indicate. It will be noticed that Quetelet’s theoretical results are not symmetrical.
One immediate result of the Law of Error, as thus stated, is that the mean result is the most probable one; and when there is only a single variable this mean is found by the familiar arithmetical process. An unfortunate error has crept into several works which allude to this subject. Mill, in treating of the “Elimination of Chance,” remarks in a note285 that “the mean is spoken of as if it were exactly the same thing as the average. But the mean, for purposes of inductive inquiry, is not the average, or arithmetical mean, though in a familiar illustration of the theory the difference may be disregarded.” He goes on to say that, according to mathematical principles, the most probable result is that for which the sums of the squares of the deviations is the least possible. It seems probable that Mill and other writers were misled by Whewell, who says286 that “The method of least squares is in fact a method of means, but with some peculiar characters.... The method proceeds upon this supposition: that all errors are not equally probable, but that small errors are more probable than large ones.” He adds that this method “removes much that is arbitrary in the method of means.” It is strange to find a mathematician like Whewell making such remarks, when there is no doubt whatever that the Method of Means is only an application of the Method of Least Squares. They are, in fact, the same method, except that the latter method may be applied to cases where two or more quantities have to be determined at the same time. Lubbock and Drinkwater say,287 “If only one quantity has to be determined, this method evidently resolves itself into taking the mean of all the values given by observation.” Encke says,288 that the expression for the probability of an error “not only contains in itself the principle of the arithmetical mean, but depends so immediately upon it, that for all those magnitudes for which the arithmetical mean holds good in the simple cases in which it is principally applied, no other law of probability can be assumed than that which is expressed by this formula.”
When we draw a conclusion from the numerical results of observations we ought not to consider it sufficient, in cases of importance, to content ourselves with finding the simple mean and treating it as true. We ought also to ascertain what is the degree of confidence we may place in this mean, and our confidence should be measured by the degree of concurrence of the observations from which it is derived. In some cases the mean may be approximately certain and accurate. In other cases it may really be worth little or nothing. The Law of Error enables us to give exact expression to the degree of confidence proper in any case; for it shows how to calculate the probability of a divergence of any amount from the mean, and we can thence ascertain the probability that the mean in question is within a certain distance from the true number. The probable error is taken by mathematicians to mean the limits within which it is as likely as not that the truth will fall. Thus if 5·45 be the mean of all the determinations of the density of the earth, and ·20 be approximately the probable error, the meaning is that the probability of the real density of the earth falling between 5·25 and 5·65 is 12. Any other limits might have been selected at will. We might calculate the limits within which it was one hundred or one thousand to one that the truth would fall; but there is a convention to take the even odds one to one, as the quantity of probability of which the limits are to be estimated.
Many books on probability give rules for making the calculations, but as, in the progress of science, persons ought to become more familiar with these processes, I propose to repeat the rules here and illustrate their use. The calculations, when made in accordance with the directions, involve none but arithmetic or logarithmic operations.
The following are the rules for treating a mean result, so as thoroughly to ascertain its trustworthiness.
1. Draw the mean of all the observed results.
2. Find the excess or defect, that is, the error of each result from the mean.
3. Square each of these reputed errors.
4. Add together all these squares of the errors, which are of course all positive.
5. Divide by one less than the number of observations. This gives the square of the mean error.
6. Take the square root of the last result; it is the mean error of a single observation.
7. Divide now by the square root of the number of observations, and we get the mean error of the mean result.
8. Lastly, multiply by the natural constant 0·6745 (or approximately by 0·674, or even by 23), and we arrive at the probable error of the mean result.
Suppose, for instance, that five measurements of the height of a hill, by the barometer or otherwise, have given the numbers of feet as 293, 301, 306, 307, 313; we want to know the probable error of the mean, namely 304. Now the differences between this mean and the above numbers, paying no regard to direction, are 11, 3, 2, 3, 9; their squares are 121, 9, 4, 9, 81, and the sum of the squares of the errors consequently 224. The number of observations being 5, we divide by 1 less, or 4, getting 56. This is the square of the mean error, and taking its square root we have 7·48 (say 7 12), the mean error of a single observation. Dividing by 2·236, the square root of 5, the number of observations, we find the mean error of the mean result to be 3·35, or say 3 13, and lastly, multiplying by ·6745, we arrive at the probable error of the mean result, which is found to be 2·259, or say 2 14. The meaning of this is that the probability is one half, or the odds are even that the true height of the mountain lies between 301 34 and 306 14 feet. We have thus an exact measure of the degree of credibility of our mean result, which mean indicates the most likely point for the truth to fall upon.
The reader should observe that as the object in these calculations is only to gain a notion of the degree of confidence with which we view the mean, there is no real use in carrying the calculations to any great degree of precision; and whenever the neglect of decimal fractions, or even the slight alteration of a number, will much abbreviate the computations, it may be fearlessly done, except in cases of high importance and precision. Brodie has shown how the law of error may be usefully applied in chemical investigations, and some illustrations of its employment may be found in his paper.289
The experiments of Benzenberg to detect the revolution of the earth, by the deviation of a ball from the perpendicular line in falling down a deep pit, have been cited by Encke290 as an interesting illustration of the Law of Error. The mean deviation was 5·086 lines, and its probable error was calculated by Encke to be not more than ·950 line, that is, the odds were even that the true result lay between 4·136 and 6·036. As the deviation, according to astronomical theory, should be 4·6 lines, which lies well within the limits, we may consider that the experiments are consistent with the Copernican system of the universe.
It will of course be understood that the probable error has regard only to those causes of errors which in the long run act as much in one direction as another; it takes no account of constant errors. The true result accordingly will often fall far beyond the limits of probable error, owing to some considerable constant error or errors, of the existence of which we are unaware.
We ought always to bear in mind that the mean of any series of observations is the best, that is, the most probable approximation to the truth, only in the absence of knowledge to the contrary. The selection of the mean rests entirely upon the probability that unknown causes of error will in the long run fall as often in one direction as the opposite, so that in drawing the mean they will balance each other. If we have any reason to suppose that there exists a tendency to error in one direction rather than the other, then to choose the mean would be to ignore that tendency. We may certainly approximate to the length of the circumference of a circle, by taking the mean of the perimeters of inscribed and circumscribed polygons of an equal and large number of sides. The length of the circular line undoubtedly lies between the lengths of the two perimeters, but it does not follow that the mean is the best approximation. It may in fact be shown that the circumference of the circle is very nearly equal to the perimeter of the inscribed polygon, together with one-third part of the difference between the inscribed and circumscribed polygons of the same number of sides. Having this knowledge, we ought of course to act upon it, instead of trusting to probability.
We may often perceive that a series of measurements tends towards an extreme limit rather than towards a mean. In endeavouring to obtain a correct estimate of the apparent diameter of the brightest fixed stars, we find a continuous diminution in estimates as the powers of observation increased. Kepler assigned to Sirius an apparent diameter of 240 seconds; Tycho Brahe made it 126; Gassendi 10 seconds; Galileo, Hevelius, and J. Cassini, 5 or 6 seconds. Halley, Michell, and subsequently Sir W. Herschel came to the conclusion that the brightest stars in the heavens could not have real discs of a second, and were probably much less in diameter. It would of course be absurd to take the mean of quantities which differ more than 240 times; and as the tendency has always been to smaller estimates, there is a considerable presumption in favour of the smallest.291
In many experiments and measurements we know that there is a preponderating tendency to error in one direction. The readings of a thermometer tend to rise as the age of the instrument increases, and no drawing of means will correct this result. Barometers, on the other hand, are likely to read too low instead of too high, owing to the imperfection of the vacuum and the action of capillary attraction. If the mercury be perfectly pure and no appreciable error be due to the measuring apparatus, the best barometer will be that which gives the highest result. In determining the specific gravity of a solid body the chief danger of error arises from bubbles of air adhering to the body, which would tend to make the specific gravity too small. Much attention must always be given to one-sided errors of this kind, since the multiplication of experiments does not remove the error. In such cases one very careful experiment is better than any number of careless ones.
When we have reasonable grounds for supposing that certain experimental results are liable to grave errors, we should exclude them in drawing a mean. If we want to find the most probable approximation to the velocity of sound in air, it would be absurd to go back to the old experiments which made the velocity from 1200 to 1474 feet per second; for we know that the old observers did not guard against errors arising from wind and other causes. Old chemical experiments are valueless as regards quantitative results. The old chemists found the atmosphere in different places to differ in composition nearly ten per cent., whereas modern accurate experimenters find very slight variations. Any method of measurement which we know to avoid a source of error is far to be preferred to others which trust to probabilities for the elimination of the error. As Flamsteed says,292 “One good instrument is of as much worth as a hundred indifferent ones.” But an instrument is good or bad only in a comparative sense, and no instrument gives invariable and truthful results. Hence we must always ultimately fall back upon probabilities for the selection of the final mean, when other precautions are exhausted.
Legendre, the discoverer of the method of Least Squares, recommended that observations differing very much from the results of his method should be rejected. The subject has been carefully investigated by Professor Pierce, who has proposed a criterion for the rejection of doubtful observations based on the following principle:293′“—observations should be rejected when the probability of the system of errors obtained by retaining them is less than that of the system of errors obtained by their rejection multiplied by the probability of making so many and no more abnormal observations.” Professor Pierce’s investigation is given nearly in his own words in Professor W. Chauvenet’s “Manual of Spherical and Practical Astronomy,” which contains a full and excellent discussion of the methods of treating numerical observations.294
Very difficult questions sometimes arise when one or more results of a method of experiment diverge widely from the mean of the rest. Are we or are we not to exclude them in adopting the supposed true mean result of the method? The drawing of a mean result rests, as I have frequently explained, upon the assumption that every error acting in one direction will probably be balanced by other errors acting in an opposite direction. If then we know or can possibly discover any causes of error not agreeing with this assumption, we shall be justified in excluding results which seem to be affected by this cause.
In reducing large series of astronomical observations, it is not uncommon to meet with numbers differing from others by a whole degree or half a degree, or some considerable integral quantity. These are errors which could hardly arise in the act of observation or in instrumental irregularity; but they might readily be accounted for by misreading of figures or mistaking of division marks. It would be absurd to trust to chance that such mistakes would balance each other in the long run, and it is therefore better to correct arbitrarily the supposed mistake, or better still, if new observations can be made, to strike out the divergent numbers altogether. When results come sometimes too great or too small in a regular manner, we should suspect that some part of the instrument slips through a definite space, or that a definite cause of error enters at times, and not at others. We should then make it a point of prime importance to discover the exact nature and amount of such an error, and either prevent its occurrence for the future or else introduce a corresponding correction. In many researches the whole difficulty will consist in this detection and avoidance of sources of error. Professor Roscoe found that the presence of phosphorus caused serious and almost unavoidable errors in the determination of the atomic weight of vanadium.295 Herschel, in reducing his observations of double stars at the Cape of Good Hope, was perplexed by an unaccountable difference of the angles of position as measured by the seven-feet equatorial and the twenty-feet reflector telescopes, and after a careful investigation was obliged to be contented with introducing a correction experimentally determined.296
When observations are sufficiently numerous it seems desirable to project the apparent errors into a curve, and then to observe whether this curve exhibits the symmetrical and characteristic form of the curve of error. If so, it may be inferred that the errors arise from many minute independent sources, and probably compensate each other in the mean result. Any considerable irregularity will indicate the existence of one-sided or large causes of error, which should be made the subject of investigation.
Even the most patient and exhaustive investigations will sometimes fail to disclose any reason why some results diverge from others. The question again recurs—Are we arbitrarily to exclude them? The answer should be in the negative as a general rule. The mere fact of divergence ought not to be taken as conclusive against a result, and the exertion of arbitrary choice would open the way to the fatal influence of bias, and what is commonly known as the “cooking” of figures. It would amount to judging fact by theory instead of theory by fact. The apparently divergent number may prove in time to be the true one. It may be an exception of that valuable kind which upsets our false theories, a real exception, exploding apparent coincidences, and opening a way to a new view of the subject. To establish this position for the divergent fact will require additional research; but in the meantime we should give it some weight in our mean conclusions, and should bear in mind the discrepancy as one demanding attention. To neglect a divergent result is to neglect the possible clue to a great discovery.
When two or more unknown quantities are so involved that they cannot be separately determined by the Simple Method of Means, we can yet obtain their most probable values by the Method of Least Squares, without more difficulty than arises from the length of the arithmetical computations. If the result of each observation gives an equation between two unknown quantities of the form