It cannot indeed be denied that the methods we employ must have some connection with the arrangement of the facts to which they are applied; but the two things are none the less distinct in their nature, and in this case the connection does not seem at all a necessary one, but at most one of propriety and convenience. The Method of Least Squares is usually applied, no doubt, to the most familiar and common form of the Law of Error, namely the exponential form with which we have been recently occupied. But other forms of laws of error may exist, and, if they did, the method in question might equally well be applied to them. I am not asserting that it would necessarily be the best method in every case, but it would be a possible one; indeed we may go further and say, as will be shown in a future chapter, that it would be a good method in almost every case. But its particular merits or demerits do not interfere with its possible employment in every case in which we may choose to resort to it. It will be seen therefore, even from the few remarks that can be made upon the subject here, that the fact that one and the same method is very commonly employed with satisfactory results affords little or no proof that the errors to which it is applied must be arranged according to one fixed law.

§ 13. So much then for the attempt to prove the prevalence, in all cases, of this particular law of divergence. The next point in Quetelet's treatment of the subject which deserves attention as erroneous or confusing, is the doctrine maintained by him and others as to the existence of what he terms a type in the groups of things in question. This is a not unnatural consequence from some of the data and conclusions of the last few paragraphs. Refer back to two of the three classes of things already mentioned in § 4. If it really were the case that in arranging in order a series of incorrect observations or attempts of our own, and a collection of natural objects belonging to some one and the same species or class, we found that the law of their divergence was in each case identical in the long run, we should be naturally disposed to apply the same expression ‘Law of Error’ to both instances alike, though in strictness it could only be appropriate to the former. When we perform an operation ourselves with a clear consciousness of what we are aiming at, we may quite correctly speak of every deviation from this as being an error; but when Nature presents us with a group of objects of any kind, it is using a rather bold metaphor to speak in this case also of a law of error, as if she had been aiming at something all the time, and had like the rest of us missed her mark more or less in almost every instance.[13]

Suppose we make a long succession of attempts to measure accurately the precise height of a man, we should from one cause or another seldom or never succeed in doing so with absolute accuracy. But we have no right to assume that these imperfect measurements of ours would be found so to deviate according to one particular law of error as to present the precise counterpart of a series of actual heights of different men, supposing that these latter were assigned with absolute precision. What might be the actual law of error in a series of direct measurements of any given magnitude could hardly be asserted beforehand, and probably the attempt to determine it by experience has not been made sufficiently often to enable us to ascertain it; but upon general grounds it seems by no means certain that it would follow the so-called exponential law. Be this however as it may, it is rather a licence of language to talk as if nature had been at work in the same way as one of us; aiming (ineffectually for the most part) at a given result, that is at producing a man endowed with a certain stature, proportions, and so on, who might therefore be regarded as the typical man.

§ 14. Stated as above, namely, that there is a fixed invariable human type to which all individual specimens of humanity may be regarded as having been meant to attain, but from which they have deviated in one direction or another; according to a law of deviation capable of à priori determination, the doctrine is little else than absurd. But if we look somewhat closer at the facts of the case, and the probable explanation of these facts, we may see our way to an important truth. The facts, on the authority of Quetelet's statistics (the great interest and value of which must be frankly admitted), are very briefly as follows: if we take any element of our physical frame which admits of accurate measurement, say the height, and determine this measure in a great number of different individuals belonging to any tolerably homogeneous class of people, we shall find that these heights do admit of an orderly arrangement about a mean, after the fashion which has been already repeatedly mentioned. What is meant by a homogeneous class? is a pertinent and significant enquiry, but applying this condition to any simple cases its meaning is readily stated. It implies that the mean in question will be different according to the nationality of the persons under measurement. According to Quetelet,[14] in the case of Englishmen the mean is about 5 ft. 9 in.; for Belgians about 5 ft. 7 in.; for the French about 5 ft. 4 in. It need hardly be added that these measures are those of adult males.

§ 15. It may fairly be asked here what would have been the consequence, had we, instead of keeping the English and the French apart, mixed the results of our measurements of them all together? The question is an important one, as it will oblige us to understand more clearly what we mean by homogeneous classes. The answer that would usually be given to it, though substantially correct, is somewhat too decisive and summary. It would be said that we are here mixing distinctly heterogeneous elements, and that in consequence the resultant law of error will be by no means of the simple character previously exhibited. So far as such an answer is to be admitted its grounds are easy to appreciate. In accordance with the usual law of error the divergences from the mean grow continuously less numerous as they increase in amount. Now, if we mix up the French and English heights, what will follow? Beginning from the English mean of 5 feet 9 inches, the heights will at first follow almost entirely the law determined by these English conditions, for at this point the English data are very numerous, and the French by comparison very few. But, as we begin to approach the French mean, the numbers will cease to show that continual diminution which they should show, according to the English scale of arrangement, for here the French data are in turn very numerous, and the English by comparison few. The result of such a combination of heterogeneous elements is illustrated by the figure annexed, of course in a very exaggerated form.

Superposition of two Gaussians

§ 16. In the above case the nature of the heterogeneity, and the reasons why the statistics should be so collected and arranged as to avoid it, seemed tolerably obvious. It will be seen still more plainly if we take a parallel case drawn from artificial proceedings. Suppose that after a man had fired a few thousand shots at a certain spot, say a wafer fixed somewhere on a wall, the position of the spot at which he aims were shifted, and he fired a few thousand more shots at the wafer in its new position. Now let us collect and arrange all the shots of both series in the order of their departure from either of the centres, say the new one. Here we should really be mingling together two discordant sets of elements, either of which, if kept apart from the other, would have been of a simple and homogeneous character. We should find, in consequence, that the resultant law of error betrayed its composite or heterogeneous origin by a glaring departure from the customary form, somewhat after the fashion indicated in the above diagram.

The instance of the English and French heights resembles the one just given, but falls far short of it in the stringency with which the requisite conditions are secured. The fact is we have not here got the most suitable requirements, viz. a group consisting of a few fixed causes supplemented by innumerable little disturbing influences. What we call a nation is really a highly artificial body, the members of which are subject to a considerable number of local or occasional disturbing causes. Amongst Frenchmen were included, presumably, Bretons, Provençals, Alsatians, and so on, thus commingling distinctions which, though less than those between French and English, regarded as wholes, are very far from being insignificant. And to these differences of race must be added other disturbances, also highly important, dependent upon varying climate, food and occupation. It is plain, therefore, that whatever objections exist against confusing together French and English statistics, exist also, though of course in a less degree, against confusing together those of the various provincial and other components which make up the French people.

§ 17. Out of the great variety of important causes which influence the height of men, it is probable that those which most nearly fulfil the main conditions required by the ‘Law of Error’ are those about which we know the least. Upon the effects of food and employment, observation has something to say, but upon the purely physiological causes by which the height of the parents influences the height of the offspring, we have probably nothing which deserves to be called knowledge. Perhaps the best supposition we can make is one which, in accordance with the saying that ‘like breeds like’, would assume that the purely physiological causes represent the constant element; that is, given a homogeneous race of people to begin with, who freely intermarry, and are subject to like circumstances of climate, food, and occupation, the standard would remain on the whole constant.[15] In such a case the man who possessed the mean height, mean weight, mean strength, and so on, might then be called, in a sort of way, a ‘type’. The deviations from this type would then be produced by innumerable small influences, partly physiological, partly physical and social, acting for the most part independently of one another, and resulting in a Law of Error of the usual description. Under such restrictions and explanations as these, there seems to be no reasonable objection to speaking of a French or English type or mean. But it must always be remembered that under the present circumstances of every political nation, these somewhat heterogeneous bodies might be subdivided into various smaller groups, each of which would frequently exhibit the characteristics of such a type in an even more marked degree.

§ 18. On this point the reports of the Anthropometrical Committee, already referred to, are most instructive. They illustrate the extent to which this subdivision could be carried out, and prove,—if any proof were necessary,—that the discovery of Quetelet's homme moyen would lead us a long chase. So far as their results go the mean ‘English’ stature (in inches) is 67.66. But this is composed of Scotch, Irish, English and Welsh constituents, the separate means of these being, respectively; 68.71, 67.90, 67.36, and 66.66. But these again may be subdivided; for careful observation shows that the mean English stature is distinctly greater in certain districts (e.g. the North-Eastern counties) than in others. Then again the mean of the professional classes is considerably greater than that of the labourers; and that of the honest and intelligent is very much greater than that of the criminal and lunatic constituents of the population. And, so far as the observations are extensive enough for the purpose, it appears that every characteristic in respect of the grouping about a mean which can be detected in the more extensive of these classes can be detected also in the narrower. Nor is there any reason to suppose that the same process of subdivision could not be carried out as much farther as we chose to prolong it.

§ 19. It need hardly be added to the above remarks that no one who gives the slightest adhesion to the Doctrine of Evolution could regard the type, in the above qualified sense of the term, as possessing any real permanence and fixity. If the constant causes, whatever they may be, remain unchanged, and if the variable ones continue in the long run to balance one another, the results will continue to cluster about the same mean. But if the constant ones undergo a gradual change, or if the variable ones, instead of balancing each other suffer one or more of their number to begin to acquire a preponderating influence, so as to put a sort of bias upon their aggregate effect, the mean will at once begin, so to say, to shift its ground. And having once begun to shift, it may continue to do so, to whatever extent we recognize that Species are variable and Development is a fact. It is as if the point on the target at which we aim, instead of being fixed, were slowly changing its position as we continue to fire at it; changing almost certainly to some extent and temporarily, and not improbably to a considerable extent and permanently.

§ 20. Our examples throughout this chapter have been almost exclusively drawn from physical characteristics, whether of man or of inanimate things; but it need not be supposed that we are necessarily confined to such instances. Mr Galton, for instance, has proposed to extend the same principles of calculation to mental phenomena, with a view to their more accurate determination. The objects to be gained by so doing belong rather to the inferential part of our subject, and will be better indicated further on; but they do not involve any distinct principle. Like other attempts to apply the methods of science in the region of the mind, this proposal has met with some opposition; with very slight reason, as it seems to me. That our mental qualities, if they could be submitted to accurate measurement, would be found to follow the usual Law of Error, may be assumed without much hesitation. The known extent of the correlation of mental and bodily characteristics gives high probability to the supposition that what is proved to prevail, at any rate approximately, amongst most bodily elements which have been submitted to measurement, will prevail also amongst the mental elements.

To what extent such measurements could be carried out practically, is another matter. It does not seem to me that it could be done with much success; partly because our mental qualities are so closely connected with, indeed so run into one another, that it is impossible to isolate them for purposes of comparison.[16] This is to some extent indeed a difficulty in bodily measurements, but it is far more so in those of the mind, where we can hardly get beyond what can be called a good guess. The doctrine, therefore, that mental qualities follow the now familiar law of arrangement can scarcely be grounded upon anything more than a strong analogy. Still this analogy is quite strong enough to justify us in accepting the doctrine and all the conclusions which follow from it, in so far as our estimates and measurements can be regarded as trustworthy. There seems therefore nothing unreasonable in the attempt to establish a system of natural classification of mankind by arranging them into a certain number of groups above and below the average, each group being intended to correspond to certain limits of excellency or deficiency.[17] All that is necessary for such a purpose is that the rate of departure from the mean should be tolerably constant under widely different circumstances: in this case throughout all the races of man. Of course if the law of divergence is the same as that which prevails in inanimate nature we have a still wider and more natural system of classification at hand, and one which ought to be familiar, more or less, to every one who has thus to estimate qualities.

§ 21. Perhaps one of the best illustrations of the legitimate application of such principles is to be found in Mr Galton's work on Hereditary Genius. Indeed the full force and purport of some of his reasonings there can hardly be appreciated except by those who are familiar with the conceptions which we have been discussing in this chapter. We can only afford space to notice one or two points, but the student will find in the perusal, of at any rate the more argumentive parts, of that volume[18] an interesting illustration of the doctrines now under discussion. For one thing it may be safely asserted, that no one unfamiliar with the Law of Error would ever in the least appreciate the excessive rapidity with which the superior degrees of excellence tend to become scarce. Every one, of course, can see at once, in a numerical way at least, what is involved in being ‘one of a million’; but they would not at all understand, how very little extra superiority is to be looked for in the man who is ‘one of two million’. They would confound the mere numerical distinction, which seems in some way to imply double excellence, with the intrinsic superiority, which would mostly be represented by a very small fractional advantage. To be ‘one of ten million’ sounds very grand, but if the qualities under consideration could be estimated in themselves without the knowledge of the vastly wider area from which the selection had been made, and in freedom therefore from any consequent numerical bias, people would be surprised to find what a very slight comparative superiority was, as a rule, thus obtained.

§ 22. The point just mentioned is an important one in arguments from statistics. If, for instance, we find a small group of persons, connected together by blood-relationship, and all possessing some mental characteristic in marked superiority, much depends upon the comparative rarity of such excellence when we are endeavouring to decide whether or not the common possession of these qualities was accidental. Such a decision can never be more than a rough one, but if it is to be made at all this consideration must enter as a factor. Again, when we are comparing one nation with another,[19] say the Athenian with any modern European people, does the popular mind at all appreciate what sort of evidence of general superiority is implied by the production, out of one nation, of such a group as can be composed of Socrates, Plato, and a few of their contemporaries? In this latter case we are also, it should be remarked, employing the ‘Law of Error’ in a second way; for we are assuming that where the extremes are great so will also the means be, in other words we are assuming that every amount of departure from the mean occurs with a (roughly) calculable degree of relative frequency. However generally this truth may be accepted in a vague way, its evidence can only be appreciated by those who know the reasons which can be given in its favour.

But the same principles will also supply a caution in the case of the last example. They remind us that, for the mere purpose of comparison, the average man of any group or class is a much better object for selection than the eminent one. There may be greater difficulties in the way of detecting him, but when we have done so we have got possession of a securer and more stable basis of comparison. He is selected, by the nature of the case, from the most numerous stratum of his society; the eminent man from a thinly occupied stratum. In accordance therefore with the now familiar laws of averages and of large numbers the fluctuations amongst the former will generally be very few and small in comparison with those amongst the latter.


1 Essai de Physique Sociale, 1869. Anthropométrie, 1870.

2 As regards later statistics on the same subject the reader can refer to the Reports of the Anthropometrical Committee of the British Association (1879, 1880, 1881, 1883;—especially this last). These reports seem to me to represent a great advance on the results obtained by Quetelet, and fully to justify the claim of the Secretary (Mr C. Roberts) that their statistics are “unique in range and numbers”. They embrace not merely military recruits—like most of the previous tables—but almost every class and age, and both sexes. Moreover they refer not only to stature but to a number of other physical characteristics.

3 As every mathematician knows, the relative numbers of each of these possible throws are given by the successive terms of the expansion of (1 + 1)10, viz. 1, 10, 45, 120, 210, 252, 210, 120, 45, 10, 1.

4 That is they will be more densely aggregated. If a space the size of the bull's-eye be examined in each successive circle, the number of shot marks which it contains will be successively less. The actual number of shots which strike the bull's-eye will not be the greatest, since it covers so much less surface than any of the other circles.

5 Commonly called the exponential law; its equation being of the form y = Aehx2. The curve corresponding to it cuts the axis of y at right angles (expressing the fact that near the mean there are a large number of values approximately equal); after a time it begins to slope away rapidly towards the axis of x (expressing the fact that the results soon begin to grow less common as we recede from the mean); and the axis of x is an asymptote in both directions (expressing the fact that no magnitude, however remote from the mean, is strictly impossible; that is, every deviation, however excessive, will have to be encountered at length within the range of a sufficiently long experience). The curve is obviously symmetrical, expressing the fact that equal deviations from the mean, in excess and in defect, tend to occur equally often in the long run.

Gaussian normal distributions

A rough graphic representation of the curve is given above. For the benefit of those unfamiliar with mathematics one or two brief remarks may be here appended concerning some of its properties. (1) It must not be supposed that all specimens of the curve are similar to one another. The dotted lines are equally specimens of it. In fact, by varying the essentially arbitrary units in which x and y are respectively estimated, we may make the portion towards the vertex of the curve as obtuse or as acute as we please. This consideration is of importance; for it reminds us that, by varying one of these arbitrary units, we could get an ‘exponential curve’ which should tolerably closely resemble any symmetrical curve of error, provided that this latter recognized and was founded upon the assumption that extreme divergences were excessively rare. Hence it would be difficult, by mere observation, to prove that the law of error in any given case was not exponential; unless the statistics were very extensive, or the actual results departed considerably from the exponential form. (2) It is quite impossible by any graphic representation to give an adequate idea of the excessive rapidity with which the curve after a time approaches the axis of x. At the point R, on our scale, the curve would approach within the fifteen-thousandth part of an inch from the axis of x, a distance which only a very good microscope could detect. Whereas in the hyperbola, e.g. the rate of approach of the curve to its asymptote is continually decreasing, it is here just the reverse; this rate is continually increasing. Hence the two, viz. the curve and the axis of x, appear to the eye, after a very short time, to merge into one another.

6 As by Quetelet: noted, amongst others, by Herschel, Essays, page 409.

7 Proc. R. Soc. Oct. 21, 1879.

8 We are here considering, remember, the case of a finite amount of statistics; so that there are actual limits at each end.

9 It must be admitted that experience has not yet (I believe) shown this asymmetry in respect of heights.

10 The above reasoning will probably be accepted as valid at this stage of enquiry. But in strictness, assumptions are made here, which however justifiable they may be in themselves, involve somewhat of an anticipation. They demand, and in a future chapter will receive, closer scrutiny and criticism.

11 A definite numerical example of this kind of concentration of frequency about the mean was given in the note to § 4. It was of a binomial form, consisting of the successive terms of the expansion of (1 + 1)m. Now it may be shown (Quetelet, Letters, p. 263; Liagre, Calcul des Probabilités, § 34) that the expansion of such a binomial, as m becomes indefinitely great, approaches as its limit the exponential form; that is, if we take a number of equidistant ordinates proportional respectively to 1, m, m(m − 1)/1·2 &c., and connect their vertices, the figure we obtain approximately represents some form of the curve y = Aehx2, and tends to become identical with it, as m is increased without limit. In other words, if we suppose the errors to be produced by a limited number of finite, equal and independent causes, we have an approximation to the exponential Law of Error, which merges into identity as the causes are increased in number and diminished in magnitude without limit. Jevons has given (Principles of Science, p. 381) a diagram drawn to scale, to show how rapid this approximation is. One point must be carefully remembered here, as it is frequently overlooked (by Quetelet, for instance). The coefficients of a binomial of two equal terms—as (1 + 1)m, in the preceding paragraph—are symmetrical in their arrangement from the first, and very speedily become indistinguishable in (graphical) outline from the final exponential form. But if, on the other hand, we were to consider the successive terms of such a binomial as (1 + 4)m (which are proportional to the relative chances of 0, 1, 2, 3, … failures in m ventures, of an event which has one chance in its favour to four against it) we should have an unsymmetrical succession. If however we suppose m to increase without limit, as in the former supposition, the unsymmetry gradually disappears and we tend towards precisely the same exponential form as if we had begun with two equal terms. The only difference is that the position of the vertex of the curve is no longer in the centre: in other words, the likeliest term or event is not an equal number of successes and failures but successes and failures in the ratio of 1 to 4.

12 ‘Law of Error’ is the usual technical term for what has been elsewhere spoken of above as a Law of Divergence from a mean. It is in strictness only appropriate in the case of one, namely the third, of the three classes of phenomena mentioned in § 4, but by a convenient generalization it is equally applied to the other two; so that we term the amount of the divergence from the mean an ‘error’ in every case, however it may have been brought about.

13 This however seems to be the purport, either by direct assertion or by implication, of two elaborate works by Quetelet, viz. his Physique Sociale and his Anthropométrie.

14 He scarcely, however, professes to give these as an accurate measure of the mean height, nor does he always give precisely the same measure. Practically, none but soldiers being measured in any great numbers, the English stature did not afford accurate data on any large scale. The statistics given a few pages further on are probably far more trustworthy.

15 This statement will receive some explanation and correction in the next chapter.

16 I am not speaking here of the now familiar results of Psychophysics, which are mainly occupied with the measurement of perceptions and other simple states of consciousness.

17 Perhaps the best brief account of Mr Galton's method is to be found in a paper in Mind (July, 1880) on the statistics of Mental Imagery. The subject under comparison here—viz. the relative power, possessed by different persons, of raising clear visual images of objects no longer present to us—is one which it seems impossible to ‘measure’, in the ordinary sense of the term. But by arranging all the answers in the order in which the faculty in question seems to be possessed we can, with some approach to accuracy, select the middlemost person in the row and use him as a basis of comparison with the corresponding person in any other batch. And similarly with those who occupy other relative positions than that of the middlemost.

18 I refer to the introductory and concluding chapters: the bulk of the book is, from the nature of the case, mainly occupied with statistical and biographical details.

19 See Galton's Hereditary Genius, pp. 336–350, “On the comparative worth of different races.”

CHAPTER III.

ON THE CAUSAL PROCESS BY WHICH THE GROUPS OR SERIES OF PROBABILITY ARE BROUGHT ABOUT.

§ 1. In discussing the question whether all the various groups and series with which Probability is concerned are of precisely one and the same type, we made some examination of the process by which they are naturally produced, but we must now enter a little more into the details of this process. All events are the results of numerous and complicated antecedents, far too numerous and complicated in fact for it to be possible for us to determine or take them all into account. Now, though it is strictly true that we can never determine them all, there is a broad distinction between the case of Induction, in which we can make out enough of them, and with sufficient accuracy, to satisfy a reasonable certainty, and Probability, in which we cannot do so. To Induction we shall return in a future chapter, and therefore no more need be said about it here.

We shall find it convenient to begin with a division which, though not pretending to any philosophical accuracy, will serve as a preliminary guide. It is the simple division into objects, and the agencies which affect them. All the phenomena with which Probability is concerned (as indeed most of those with which science of any kind is concerned) are the product of certain objects natural and artificial, acting under the influence of certain agencies natural and artificial. In the tossing of a penny, for instance, the objects would be the penny or pence which were successively thrown; the agencies would be the act of throwing, and everything which combined directly or indirectly with this to make any particular face come uppermost. This is a simple and intelligible division, and can easily be so extended in meaning as to embrace every class of objects with which we are concerned.

Now if, in any two or more cases, we had the same object, or objects indistinguishably alike, and if they were exposed to the influence of agencies in all respects precisely alike, we should expect the results to be precisely similar. By one of the applications of the familiar principle of the uniformity of nature we should be confident that exact likeness in the antecedents would be followed by exact likeness in the consequents. If the same penny, or similar pence, were thrown in exactly the same way, we should invariably find that the same face falls uppermost.

§ 2. What we actually find is, of course, very far removed from this. In the case of the objects, when they are artificial constructions, e.g. dice, pence, cards, it is true that they are purposely made as nearly as possible indistinguishably alike. We either use the same thing over and over again or different ones made according to precisely the same model. But in natural objects nothing of the sort prevails. In fact when we come to examine them, we find reproduced in them precisely the same characteristics as those which present themselves in the final result which we were asked to explain, so that unless we examine them a stage further back, as we shall have to do to some extent at any rate, we seem to be merely postulating again the very peculiarity of the phenomena which we were undertaking to explain. They will be found, for instance, to consist of large classes of objects, throughout all the individual members of which a general resemblance extends. Suppose that we were considering the length of life. The objects here are the human beings, or that selected class of them, whose lives we are considering. The resemblance existing among them is to be found in the strength and soundness of their principal vital organs, together with all the circumstances which collectively make up what we call the goodness of their constitutions. It is true that most of these circumstances do not admit of any approach to actual measurement; but, as was pointed out in the last chapter, very many of the circumstances which do admit of such measurement have been measured, and found to display the characteristics in question. Hence, from the known analogy and correlation between our various organs, there can be no reasonable doubt that if we could arrange human constitutions in general, or the various elements which compose them in particular, in the order of their strength, we should find just such an aggregate regularity and just such groupings about the mean, as the final result (viz. in this case the length of their lives) presents to our notice.

§ 3. It will be observed therefore that for this purpose the existence of natural kinds or groups is necessary. In our games of chance of course the same die may be thrown, or a card be drawn from the same pack, as often as we please; but many of the events which occur to human beings either cannot be repeated at all, or not often enough to secure in the case of the single individual any sufficient statistical uniformity. Such regularity as we trace in nature is owing, much more than is often suspected, to the arrangement of things in natural kinds, each of them containing a large number of individuals. Were each kind of animals or vegetables limited to a single pair, or even to but a few pairs, there would not be much scope left for the collection of statistical tables amongst them. Or to take a less violent supposition, if the numbers in each natural class of objects were much smaller than they are at present, or the differences between their varieties and sub-species much more marked, the consequent difficulty of extracting from them any sufficient length of statistical tables, though not fatal, might be very serious. A large number of objects in the class, together with that general similarity which entitles the objects to be fairly comprised in one class, seem to be important conditions for the applicability of the theory of Probability to any phenomenon. Something analogous to this excessive paucity of objects in a class would be found in the attempt to apply special Insurance offices to the case of those trades where the numbers are very limited, and the employment so dangerous as to put them in a class by themselves. If an insurance society were started for the workmen in gunpowder mills alone, a premium would have to be charged to avoid possible ruin, so high as to illustrate the extreme paucity of appropriate statistics.

§ 4. So much (at present) for the objects. If we turn to what we have termed the agencies, we find much the same thing again here. By the adjustment of their relative intensity, and the respective frequency of their occurrence, the total effects which they produce are found to be also tolerably uniform. It is of course conceivable that this should have been otherwise. It might have been found that the second group of conditions so exactly corrected the former as to convert the merely general uniformity into an absolute one; or it might have been found, on the other hand, that the second group should aggravate or disturb the influence of the former to such an extent as to destroy all the uniformity of its effects. Practically neither is the case. The second condition simply varies the details, leaving the uniformity on the whole of precisely the same general description as it was before. Or if the objects were supposed to be absolutely alike, as in the case of successive throws of a penny, it may serve to bring about a uniformity. Analysis will show these agencies to be thus made up of an almost infinite number of different components, but it will detect the same peculiarity that we have so often had occasion to refer to, pervading almost all these components. The proportions in which they are combined will be found to be nearly, though not quite, the same; the intensity with which they act will be nearly though not quite equal. And they will all unite and blend into a more and more perfect regularity as we proceed to take the average of a larger number of instances.

Take, for instance, the length of life. As we have seen, the constitutions of a very large number of persons selected at random will be found to present much the same feature; general uniformity accompanied by individual irregularity. Now when these persons go out into the world, they are exposed to a variety of agencies, the collective influence of which will assign to each the length of life allotted to him. These agencies are of course innumerable, and their mutual interaction complicated beyond all power of analysis to extricate. Each effect becomes in its turn a cause, is interwoven inextricably with an indefinite number of other causes, and reacts upon the final result. Climate, food, clothing, are some of these agencies, or rather comprise aggregate groups of them. The nature of a man's work is also important. One man overworks himself, another follows an unhealthy trade, a third exposes himself to infection, and so on.

The result of all this interaction between what we have thus called objects and agencies is that the final outcome presents the same general characteristics of uniformity as may be detected separately in the two constituent elements. Or rather, as we shall proceed presently to show, it does so in the great majority of cases.

§ 5. It may be objected that such an explanation as the above does not really amount to anything deserving of the name, for that instead of explaining how a particular state of things is caused it merely points out that the same state exists elsewhere. There is a uniformity discovered in the objects at the stage when they are commonly submitted to calculation; we then grope about amongst the causes of them, and after all only discover a precisely similar uniformity existing amongst these causes. This is to some extent true, for though part of the objection can be removed, it must always remain the case that the foundations of an objective science will rest in the last resort upon the mere fact that things are found to be of such and such a character.

§ 6. This division, into objects and the agencies which affect them, is merely intended for a rough practical arrangement, sufficient to point out to the reader the immediate nature of the causes which bring about our familiar uniformities. If we go back a step further, it might fairly be maintained that they may be reduced to one, namely, to the agencies. The objects, as we have termed them, are not an original creation in the state in which we now find them. No one supposes that whole groups or classes were brought into existence simultaneously, with all their general resemblances and particular differences fully developed. Even if it were the case that the first parents of each natural kind had been specially created, instead of being developed out of pre-existing forms, it would still be true that amongst the numbers of each that now present themselves the characteristic differences and resemblances are the result of what we have termed agencies. Take, for instance, a single characteristic only, say the height; what determines this as we find it in any given group of men? Partly, no doubt, the nature of their own food, clothing, employment, and so on, especially in the earliest years of their life; partly also, very likely, similar conditions and circumstances on the part of their parents at one time or another. No one, I presume, in the present state of knowledge, would attempt to enumerate the remaining causes, or even to give any indication of their exact nature; but at the same time few would entertain any doubt that agencies of this general description have been the determining causes at work.

If it be asked again, Into what may these agencies themselves be ultimately analysed? the answer to this question, in so far as it involves any detailed examination of them, would be foreign to the plan of this essay. In so far as any general remarks, applicable to nearly all classes alike of such agencies, are called for, we are led back to the point from which we started in the previous chapter, when we were discussing whether there is necessarily one fixed law according to which all our series are formed. We there saw that every event might be regarded as being brought about by a comparatively few important causes, of the kind which comprises all of which ordinary observation takes any notice, and an indefinitely numerous group of small causes, too numerous, minute, and uncertain in their action for us to be able to estimate them or indeed to take them individually into account at all. The important ones, it is true, may also in turn be themselves conceived to be made up of aggregates of small components, but they are still best regarded as being by comparison simple and distinct, for their component parts act mostly in groups collectively, appearing and disappearing together, so that they possess the essential characteristics of unity.

§ 7. Now, broadly speaking, it appears to me that the most suitable conditions for Probability are these: that the important causes should be by comparison fixed and permanent, and that the remaining ones should on the average continue to act as often in one direction as in the other. This they may do in two ways. In the first place we may be able to predicate nothing more of them than the mere fact that they act[1] as often in one direction as the other; what we should then obtain would be merely the simple statistical uniformity that is described in the first chapter. But it may be the case, and in practice generally is so more or less approximately, that these minor causes act also in independence of one another. What we then get is a group of uniformities such as was explained and illustrated in the second chapter. Every possible combination of these causes then occurring with a regular degree of frequency, we find one peculiar kind of uniformity exhibited, not merely in the mere fact of excess and defect (of whatever may be the variable quality in question), but also in every particular amount of excess and defect. Hence, in this case, we get what some writers term a ‘mean’ or ‘type,’ instead of a simple average. For instance, suppose a man throwing a quoit at a mark. Here our fixed causes are his strength, the weight of the quoit, and the intention of aiming at a given point. These we must of course suppose to remain unchanged, if we are to obtain any such uniformity as we are seeking. The minor and variable causes are all those innumerable little disturbing influences referred to in the last chapter. It might conceivably be the case that we were only able to ascertain that these acted as often in one direction as in the other; what we should then find was that the quoit tended to fall short of the mark as often as beyond it. But owing to these little causes being mostly independent of one another, and more or less equal in their influence, we find also that every amount of excess and defect presents the same general characteristics, and that in a large number of throws the quantity of divergences from the mark, of any given amount, is a tolerably determinate function, according to a regular law, of that amount of divergence.[2]

§ 8. The necessity of the conditions just hinted at will best be seen by a reference to cases in which any of them happen to be missing. Thus we know that the length of life is on the whole tolerably regular, and so are the numbers of those who die in successive years or centuries of most of the commoner diseases. But it does not seem to be the case with all diseases. What, for instance, of the Sweating Sickness, the Black Death, the Asiatic Cholera? The two former either do not recur, or, if they do, recur in such a mild form as not to deserve the same name. What in fact of any of the diseases which are epidemic rather than endemic? All these have their causes doubtless, and would be produced again by the recurrence of the conditions which caused them before. But some of them apparently do not recur at all. They seem to have depended upon such rare conditions that their occurrence was almost unique. And of those which do recur the course is frequently so eccentric and irregular, often so much dependent upon human will or want of will, as to entirely deprive their results (that is, the annual number of deaths which they cause) of the statistical uniformity of which we are speaking.

The explanation probably is that one of the principal causes in such cases is what we commonly call contagion. If so, we have at once a cause which so far from being fixed is subject to the utmost variability. Stringent caution may destroy it, carelessness may aggravate it to any extent. The will of man, as finding its expression either on the part of government, of doctors, or of the public, may make of it pretty nearly what is wished, though against the possibility of its entrance into any community no precautions can absolutely insure us.

§ 9. If it be replied that this want of statistical regularity only arises from the fact of our having confined ourselves to too limited a time, and that we should find irregularity disappear here, as elsewhere, if we kept our tables open long enough, we shall find that the answer will suggest another case in which the requisite conditions for Probability are wanting. Such a reply would only be conclusive upon the supposition that the ways and thoughts of men are in the long run invariable, or if variable, subject to periodic changes only. On the assumption of a steady progress in society, either for the better or the worse, the argument falls to the ground at once. From what we know of the course of the world, these fearful pests of the past may be considered as solitary events in our history, or at least events which will not be repeated. No continued uniformity would therefore be found in the deaths which they occasion, though the registrar's books were kept open for a thousand years. The reason here is probably to be sought in the gradual alteration of those indefinitely numerous conditions which we term collectively progress or civilization. Every little circumstance of this kind has some bearing upon the liability of any one to catch a disease. But when a kind of slow and steady tide sets in, in consequence of which these influences no longer remain at about the same average strength, warring on about equal terms with hostile influences, but on the contrary show a steady tendency to increase their power, the statistics will, with consequent steadiness and permanence, take the impress of such a change.

§ 10. Briefly then, if we were asked where the distinctive characteristics of Probability are most prominently to be found, and where they are most prominently absent, we might say that (1) they prevail principally in the properties of natural kinds, both in the ultimate and in the derivative or accidental properties. In all the characteristics of natural species, in all they do and in all which happens to them, so far as it depends upon their properties, we seldom fail to detect this regularity. Thus in men; their height, strength, weight, the age to which they live, the diseases of which they die; all present a well-known uniformity. Life insurance tables offer the most familiar instance of the importance of these applications of Probability.

(2) The same peculiarity prevails again in the force and frequency of most natural agencies. Wind and weather are seen to lose their proverbial irregularity when examined on a large scale. Man's work therefore, when operated on by such agencies as these, even though it had been made in different cases absolutely alike to begin with, afterwards shows only a general regularity. I may sow exactly the same amount of seed in my field every year. The yield may one year be moderate, the next year be abundant through favourable weather, and then again in turn be destroyed by hail. But in the long run these irregularities will be equalized in the result of my crops, because they are equalized in the power and frequency of the productive agencies. The business of underwriters, and offices which insure the crops against hail, would fall under this class; though, as already remarked, there is no very profound distinction between them and the former class.

The reader must be reminded again that this fixity is only temporary, that is, that even here the series belong to the class of those which possess a fluctuating type. Those indeed who believe in the fixity of natural species will have the best chance of finding a series of the really permanent type amongst them, though even they will admit that some change in the characteristic is attainable in length of time. In the case of the principal natural agencies, it is of course incontestable that the present average is referable to the present geological period only. Our average temperature and average rainfall have in former times been widely different from what they now are, and doubtless will be so again.

Any fuller investigation of the process by which, on the Theory of Evolution, out of a primeval simplicity and uniformity the present variety was educed, hardly belongs to the scope of the present work: at most, a few hints must suffice.

§ 11. The above, then, are instances of natural objects and natural agencies. There seems reason to believe that it is in such things only, as distinguished from things artificial, that the property in question is to be found. This is an assertion that will need some discussion and explanation. Two instances, in apparent opposition, will at once occur to the mind of some readers; one of which, from its great intrinsic importance, and the other, from the frequency of the problems which it furnishes, will demand a few minutes' separate examination.

(1) The first of these is the already mentioned case of instrumental observations. In the use of astronomical and other instruments the utmost possible degree of accuracy is often desired, a degree which cannot be reasonably hoped for in any one single observation. What we do therefore in these cases is to make a large number of successive observations which are naturally found to differ somewhat from each other in their results; by means of these the true value (as explained in a future chapter, on the Method of Least Squares) is to be determined as accurately as possible. The subjects then of calculation here are a certain number of elements, slightly incorrect elements, given by successive observations. Are not these observations artificial, or the direct product of voluntary agency? Certainly not: or rather, the answer depends on what we understand by voluntary. What is really intended and aimed at by the observer, is of course, perfect accuracy, that is, the true observation, or the voluntary steps and preliminaries on which this observation depends. Whether voluntary or not, this result only can be called intentional. But this result is not obtained. What we actually get in its place is a series of deviations from it, containing results more or less wide of the truth. Now by what are these deviations caused? By just such agencies as we have been considering in some of the earlier sections in this chapter. Heat and its irregular warping influence, draughts of air producing their corresponding effects, dust and consequent friction in one part or another, the slight distortion of the instrument by strains or the slow uneven contraction which continues long after the metal was cast; these and such as these are some of the causes which divert us from the truth. Besides this group, there are others which certainly do depend upon human agency, but which are not, strictly speaking, voluntary. They are such as the irregular action of the muscles, inability to make our various organs and members execute precisely the purposes we have in mind, perhaps different rates in the rapidity of the nervous currents, or in the response to stimuli, in the same or different observers. The effect produced by some of these, and the allowance that has in consequence to be made, are becoming familiar even to the outside world under the name of the ‘personal equation’ in astronomical, psychophysical, and other observations.

§ 12. (2) The other example, alluded to above, is the stock one of cards and dice. Here, as in the last case, the result is remotely voluntary, in the sense that deliberate volition presents itself at one stage. But subsequently to this stage, the result is produced or affected by so many involuntary agencies that it owes its characteristic properties to these. The turning up, for example, of a particular face of a die is the result of voluntary agency, but it is not an immediate result. That particular face was not chosen, though the fact of its being chosen was the remote consequence of an act of choice. There has been an intermediate chaos of conflicting agencies, which no one can calculate before or distinguish afterwards. These agencies seem to show a uniformity in the long run, and thence to produce a similar uniformity in the result. The drawing of a card from a pack is indeed more directly volitional, as in cutting for partners in a game of whist. But no one continues to do this long without having the pack well shuffled in the interval, whereby a host of involuntary influences are let in.

§ 13. The once startling but now familiar uniformities exhibited in the cases of suicides and misdirected letters, do not belong to the same class. The final resolution, or want of it, which leads to these results, is in each case indeed an important ingredient in the individual's action or omission; but, in so far as volition has anything to do with the results as a whole, it instantly disturbs them. If the voice of the Legislature speaks out, or any great preacher or moralist succeeds in deterring, or any impressive example in influencing, our moral statistics are instantly tampered with. Some further discussion will be devoted to this subject in a future chapter; it need only be remarked here that (always excluding such common or general influence as those just mentioned) the average volition, potent as it is in each separate case, is on the whole swayed by non-voluntary conditions, such as those of health, the casualties of employment, &c., in fact the various circumstances which influence the length of a man's life.

§ 14. Such distinctions as those just insisted on may seem to some persons to be needless, but serious errors have occasionally arisen from the neglect of them. The immediate products of man's mind, so far indeed as we can make an attempt to obtain them, do not seem to possess this essential characteristic of Probability. Their characteristic seems rather to be, either perfect mathematical accuracy or utter want of it, either law unfailing or mere caprice. If, e.g., we find the trees in a forest growing in straight lines, we unhesitatingly conclude that they were planted by man as they stand. It is true on the other hand, that if we find them not regularly planted, we cannot conclude that they were not planted by man; partly because the planter may have worked without a plan, partly because the subsequent irregularities brought on by nature may have obscured the plan. Practically the mind has to work by the aid of imperfect instruments, and is subjected to many hindrances through various and conflicting agencies, and by these means the work loses its original properties. Suppose, for instance, that a man, instead of producing numerical results by imperfect observations or by the cast of dice, were to select them at first hand for himself by simply thinking of them at once; what sort of series would he obtain? It would be about as difficult to obtain in this way any such series as those appropriate to Probability as it would be to keep his heart or pulse working regularly by direct acts of volition, supposing that he had the requisite control over these organs. But the mere suggestion is absurd. A man must have an object in thinking, he must think according to a rule or formula; but unless he takes some natural series as a copy, he will never be able to construct one mentally which shall permanently imitate the originals. Or take another product of human efforts, in which the intention can be executed with tolerable success. When any one builds a house, there are many slight disturbing influences at work, such as shrinking of bricks and mortar, settling of foundations, &c. But the effect which these disturbances are able to produce is so inappreciably small, that we may fairly consider that the result obtained is the direct product of the mind, the accurate realization of its intention. What is the consequence? Every house in the row, if designed by one man and at one time, is of exactly the same height, width, &c. as its neighbours; or if there are variations they are few, definite, and regular. The result offers no resemblance whatever to the heights, weights, &c. of a number of men selected at random. The builder probably had some regular design in contemplation, and he has succeeded in executing it.

§ 15. It may be replied that if we extend our observations, say to the houses of a large city, we shall then detect the property under discussion. The different heights of a great number, when grouped together, might be found to resemble those of a great number of human beings under similar treatment. Something of this kind might not improbably be found to be the case, though the resemblance would be far from being a close one. But to raise this question is to get on to different ground, for we were speaking (as remarked above) not of the work of different minds with their different aims, but of that of one mind. In a multiplicity of designs, there may be that variable uniformity, for which we may look in vain in a single design. The heights which the different builders contemplated might be found to group themselves into something of the same kind of uniformity as that which prevails in most other things which they should undertake to do independently. We might then trace the action of the same two conditions,—a uniformity in the multitude of their different designs, a uniformity also in the infinite variety of the influences which have modified those designs. But this is a very different thing from saying that the work of one man will show such a result as this. The difference is much like that between the tread of a thousand men who are stepping without thinking of each other, and their tread when they are drilled into a regiment. In the former case there is the working, in one way or another, of a thousand minds; in the latter, of one only.

The investigations of this and the former chapter constitute a sufficiently close examination into the detailed causes by which the peculiar form of statistical results with which we are concerned is actually produced, to serve the purpose of a work which is occupied mainly with the methods of the Science of Probability. The great importance, however, of certain statistical or sociological enquiries will demand a recurrence in a future chapter to one particular application of these statistics, viz. to those concerned with some classes of human actions.

§ 16. The only important addition to, or modification of, the foregoing remarks which I have found occasion to make is due to Mr Galton. He has recently pointed out,—and was I believe the first to do so,—that in certain cases some analysis of the causal processes can be effected, and is in fact absolutely necessary in order to account for the facts observed. Take, for instance, the heights of the population of any country. If the distribution or dispersion of these about their mean value were left to the unimpeded action of those myriad productive agencies alluded to above, we should certainly obtain such an arrangement in the posterity of any one generation as had already been exhibited in the parents. That is, we should find repeated in the previous stage the same kind of order as we were trying to account for in the following stage.

But then, as Mr Galton insists, if such agencies acted freely and independently, though we should get the same kind of arrangement or distribution, we should not get the same degree of it: there would, on the contrary, be a tendency towards further dispersion. The ‘curve of facility’ (v. the diagram on p. 29) would belong to the same class, but would have a different modulus. We shall see this at once if we take for comparison a case in which similar agencies work their way without any counteraction whatever. Suppose, for instance, that a large number of persons, whose fortunes were equal to begin with, were to commence gambling or betting continually for some small sum. If we examine their circumstances after successive intervals of time, we should expect to find their fortunes distributed according to the same general law,—i.e. the now familiar law in question,—but we should also expect to find that the poorest ones were slightly poorer, and the richest ones slightly richer, on each successive occasion. We shall see more about this in a future chapter (on Gambling), but it may be taken for granted here that there is nothing in the laws of chance to resist this tendency towards intensifying the extremes.

Now it is found, on the contrary, in the case of vital phenomena,—for instance in that of height, and presumably of most of the other qualities which are in any way characteristic of natural kinds,—that there is, through a number of successive generations, a remarkable degree of fixity. The tall men are not taller, and the short men are not shorter, per cent. of the population in successive generations: always supposing of course that some general change of circumstances, such as climate, diet, &c. has not set in. There must therefore here be some cause at work which tends, so to say, to draw in the extremes and thus to check the otherwise continually increasing dispersion.

§ 17. The facts were first tested by careful experiment. At the date of Mr Galton's original paper on the subject,[3] there were no available statistics of heights of human beings; so a physical element admitting of careful experiment (viz. the size or weight of certain seeds) was accurately estimated. From these data the actual amount of reversion from the extremes, that is, of the slight pressure continually put upon the extreme members with the result of crowding them back towards the mean, was determined, and this was compared with what theory would require in order to keep the characteristics of the species permanently fixed. Since then, statistics have been obtained to a large extent which deal directly with the heights of human beings.

The general conclusion at which we arrive is that there are several causes at work which are neither slight nor independent. There is, for instance, the observed fact that the extremes are as a rule not equally fertile with the means, nor equally capable of resisting death and disease. Hence as regards their mere numbers, there is a tendency for them somewhat to thin out. Then again there is a distinct positive cause in respect of ‘reversion.’ Not only are the offspring of the extremes less numerous, but these offspring also tend to cluster about a mean which is, so to say, shifted a little towards the true centre of the whole group; i.e. towards the mean offspring of the mean parents.

§ 18. For a full discussion of these characteristics, and for a variety of most ingenious illustrations of their mode of agency and of their comparative efficacy, the reader may be referred to Mr Galton's original articles. For our present purpose it will suffice to say that these characteristics tend towards maintaining the fixity of species; and that though they do not affect what may be called the general nature of the ‘probability curve’ or ‘law of facility’, they do determine its precise value in the cases in question. If, indeed, it be asked why there is no need for any such corrective influence in the case of, say, firing at a mark: the answer is that there is no opening for it except where a cumulative influence is introduced. The reason why the fortunes of our betting party showed an ever increasing divergency, and why some special correction was needed in order to avert such a tendency in the case of vital phenomena, was that the new starting-point at every step was slightly determined by the results of the previous step. The man who has lost a shilling one time starts, next time, worse off by just a shilling; and, but for the corrections we have been indicating, the man who was born tall would, so to say, throw off his descendants from a vantage ground of superior height. The true parallel in the case of the marksmen would be to suppose that their new points of aim were always shifted a little in the direction of the last divergence. The spreading out of the shot-marks would then continue without limit, just as would the divergence of fortunes of the supposed gamblers.