The proposer treated this as a ‘bag and balls’ problem, analogous to the following: 10 balls from one bag gave 7 white and 3 black, 14 from another bag gave 9 white and 5 black: what is the chance that the actual ratio of white to black balls was greater in the former than in the latter?—this actual ratio being of course considered a true indication of what would be the ultimate proportions of white and black drawings. This seems to me to be the only reasonable way of treating the problem, if it is to be considered capable of numerical solution at all.

Of course the inevitable assumption has to be made here about the equal prevalence of the different possible kinds of bag,—or, as the supporters of the justice of the calculation would put it, of the obligation to assume the equal à priori likelihood of each kind,—but I think that in this particular example the arbitrariness of the assumption is less than usual. This is because the problem discusses simply a balance between two extremely similar cases, and there is a certain set-off against each other of the objectionable assumptions on each side. Had one set of experiments only been proposed, and had we been asked to evaluate the probability of continued repetition of them confirming their verdict, I should have felt all the scruples I have already mentioned. But here we have got two sets of experiments carried on under almost exactly similar circumstances, and there is therefore less arbitrariness in assuming that their unknown conditions are tolerably equally prevalent.

§ 18. Examples of the description commonly introduced seem objectionable enough, but if we wish to realize to its full extent the vagueness of some of the problems submitted to this Inverse Probability, we have not far to seek. In natural as in artificial examples, where statistics are unattainable the enquiry becomes utterly hopeless, and all attempts at laying down rules for calculation must be abandoned. Take, for instance, the question which has given rise to some discussion,[9] whether such and such groups of stars are or are not to be regarded as the results of an accidental distribution; or the still wider and vaguer question, whether such and such things, or say the world itself, have been produced by chance?

In cases of this kind the insuperable difficulty is in determining what sense exactly is to be attached to the words ‘accidental’ and ‘random’ which enter into the discussion. Some account was given, in the fourth chapter, of their scientific and conventional meaning in Probability. There seem to be the same objections to generalizing them out of such relation, as there is in metaphysics to talking of the Infinite or the Absolute. Infinite magnitude, or infinite power, one can to some extent comprehend, or at least one may understand what is being talked about, but ‘the infinite’ seems to me a term devoid of meaning. So of anything supposed to have been produced at random: tell us the nature of the agency, the limits of its randomness and so on, and we can venture upon the problem, but without such data we know not what to do. The further consideration of such a problem might, I think, without arrogance be relegated to the Chapter on Fallacies. Accordingly any further remarks which I have to make upon the subject will be found there, and at the conclusion of the chapter on Causation and Design.

1 It might be more accurate to speak of ‘incompatible hypotheses with respect to any individual case’, or ‘mutually exclusive classes of events’.

2 The examples, of this kind, referring to human mortality are taken from the Carlisle tables. These differ considerably, as is well known, from other tables, but we have the high authority of De Morgan for regarding them as the best representative of the average mortality of the English middle classes at the present day.

3 I say, almost any proportion, because, as may easily be seen, arithmetic imposes certain restrictions upon the assumptions that can be made. We could not, for instance, suppose that all the black-haired men are short-sighted, for in any given batch of men the former are more numerous. But the range of these restrictions is limited, and their existence is not of importance in the above discussion.

4 Essay on Probabilities, p. 53. I have been reminded that in his article on Probability in the Encyclopædia Metropolitana he has stated that such rules involve no new principle.

5 This point will be fully discussed in a future chapter, after the general stand-point of an objective system of logic has been explained and illustrated.

6 Whitworth's Choice and Chance, Ed. II., p. 123. See also Boole's Laws of Thought, p. 370.

7 Opinions differ about the defence of such suppositions, as they do about the nature of them. Some writers, admitting the above assumption to be doubtful, call it the most impartial hypothesis. Others regard it as a sort of mean hypothesis.

8 Educational Times; Reprint, Vol. xxxvii. p. 40. The question was proposed by Dr. Macalister and gave rise to considerable controversy. As usual with problems of this inverse kind hardly any two of the writers were in agreement as to the assumptions to be made, or therefore as to the numerical estimate of the odds.

9 See Todhunter's History, pp. 333, 4.

There is an interesting discussion upon this question by the late J. D. Forbes in a paper in the Philosophical Magazine for Dec. 1850. It was replied to in a subsequent number by Prof. Donkin.

CHAPTER VIII.

THE RULE OF SUCCESSION.[*]

* A word of apology may be offered here for the introduction of a new name. The only other alternative would have been to entitle the rule one of Induction. But such a title I cannot admit, for reasons which will be almost immediately explained.

§ 1. In the last chapter we discussed at some length the nature of the kinds of inference in Probability which correspond to those termed, in Logic, immediate and mediate inferences. We ascertained what was the meaning of saying, for example, that the chance of any given man A. B. dying in a year is ¹/₃, when concluded from the general proposition that one man out of three in his circumstances dies. We also discussed the nature and evidence of rules of a more completely inferential character. But to stop at this point would be to take a very imperfect view of the subject. If Probability is a science of real inference about things, it must surely lead up to something more than such merely formal conclusions; we must be able, if not by means of it, at any rate by some means, to step beyond the limits of what has been actually observed, and to draw conclusions about what is as yet unobserved. This leads at once to the question, What is the connection of Probability with Induction? This is a question into which it will be necessary to enter now with some minuteness.

That there is a close connection between Probability and Induction, must have been observed by almost every one who has treated of either subject; I have not however seen any account of this connection that seemed to me to be satisfactory. An explicit description of it should rather be sought in treatises upon the narrower subject, Probability; but it is precisely here that the most confusion is to be found. The province of Probability being somewhat narrow, incursions have been constantly made from it into the adjacent territory of Induction. In this way, amongst the arithmetical rules discussed in the last chapter, others have been frequently introduced which ought not in strictness to be classed with them, as they rest on an entirely different basis.

§ 2. The origin of such confusion is easy of explanation; it arises, doubtless, from the habit of laying undue stress upon the subjective side of Probability, upon that which treats of the quantity of our belief upon different subjects and the variations of which that quantity is susceptible. It has been already urged that this variation of belief is at most but a constant accompaniment of what is really essential to Probability, and is moreover common to other subjects as well. By defining the science therefore from this side these other subjects would claim admittance into it; some of these, as Induction, have been accepted, but others have been somewhat arbitrarily rejected. Our belief in a wider proposition gained by Induction is, prior to verification, not so strong as that of the narrower generalization from which it is inferred. This being observed, a so-called rule of probability has been given by which it is supposed that this diminution of assent could in many instances be calculated.

But time also works changes in our conviction; our belief in the happening of almost every event, if we recur to it long afterwards, when the evidence has faded from the mind, is less strong than it was at the time. Why are not rules of oblivion inserted in treatises upon Probability? If a man is told how firmly he ought to expect the tide to rise again, because it has already risen ten times, might he not also ask for a rule which should tell him how firm should be his belief of an event which rests upon a ten years' recollection?[1] The infractions of a rule of this latter kind could scarcely be more numerous and extensive, as we shall see presently, than those of the former confessedly are. The fact is that the agencies, by which the strength of our conviction is modified, are so indefinitely numerous that they cannot all be assembled into one science; for purposes of definition therefore the quantity of belief had better be omitted from consideration, or at any rate regarded as a mere appendage, and the science, defined from the other or statistical side of the subject, in which, as has been shown, a tolerably clear boundary-line can be traced.

§ 3. Induction, however, from its importance does merit a separate discussion; a single example will show its bearing upon this part of our subject. We are considering the prospect of a given man, A. B. living another year, and we find that nine out of ten men of his age do survive. In forming an opinion about his surviving, however, we shall find that there are in reality two very distinct causes which aid in determining the strength of our conviction; distinct, but in practice so intimately connected that we are very apt to overlook one, and attribute the effect entirely to the other.

(I.) There is that which strictly belongs to Probability; that which (as was explained in Chap VI.) measures our belief of the individual case as deduced from the general proposition. Granted that nine men out of ten of the kind to which A. B. belongs do live another year, it obviously does not follow at all necessarily that he will. We describe this state of things by saying, that our belief of his surviving is diminished from certainty in the ratio of 10 to 9, or, in other words, is measured by the fraction ⁹/₁₀.

(II.) But are we certain that nine men out of ten like him will live another year? we know that they have so survived in time past, but will they continue to do so? Since A. B. is still alive it is plain that this proposition is to a certain extent assumed, or rather obtained by Induction. We cannot however be as certain of the inductive inference as we are of the data from which it was inferred. Here, therefore, is a second cause which tends to diminish our belief; in practice these two causes always accompany each other, but in thought they can be separated.

The two distinct causes described above are very liable to be confused together, and the class of cases from which examples are necessarily for the most part drawn increases this liability. The step from the statement ‘all men have died in a certain proportion’ to the inference ‘they will continue to die in that proportion’ is so slight a step that it is unnoticed, and the diminution of conviction that should accompany it is unsuspected. In what are called à priori examples the step is still slighter. We feel so certain about the permanence of the laws of mechanics, that few people would think of regarding it as an inference when they believe that a die will in the long run turn up all its faces equally often, because other dice have done so in time past.

§ 4. It has been already pointed out (in Chapter VI.) that, so far as concerns that definition of Probability which regards it as the science which discusses the degree and modifications of our belief, the question at issue seems to be simply this:—Are the causes alluded to above in (II.) capable of being reduced to one simple coherent scheme, so that any universal rules for the modification of assent can be obtained from them? If they are, strong grounds will have been shown for classing them with (I.), in other words, for considering them as rules of probability. Even then they would be rules practically of a very different kind, contingent instead of necessary (if one may use these terms without committing oneself to any philosophical system), but this objection might perhaps be overruled by the greater simplicity secured by classing them together. This view is, with various modifications, generally adopted by writers on Probability, or at least, as I understand the matter, implied by their methods of definition and treatment. Or, on the other hand, must these causes be regarded as a vast system, one might almost say a chaos, of perfectly distinct agencies; which may indeed be classified and arranged to some extent, but from which we can never hope to obtain any rules of perfect generality which shall not be subject to constant exception? If so, but one course is left; to exclude them all alike from Probability. In other words, we must assume the general proposition, viz. that which has been described throughout as our starting-point, to be given to us; it may be obtained by any of the numerous rules furnished by Induction, or it may be inferred deductively, or given by our own observation; its value may be diminished by its depending upon the testimony of witnesses, or its being recalled by our own memory. Its real value may be influenced by these causes or any combinations of them; but all these are preliminary questions with which we have nothing directly to do. We assume our statistical proposition to be true, neglecting the diminution of its value by the process of attainment; we take it up first at this point and then apply our rules to it. We receive it in fact, if one may use the expression, ready-made, and ask no questions about the process or completeness of its manufacture.

§ 5. It is not to be supposed, of course, that any writers have seriously attempted to reduce to one system of calculation all the causes mentioned above, and to embrace in one formula the diminution of certainty to which the inclusion of them subjects us. But on the other hand, they have been unwilling to restrain themselves from all appeal to them. From an early period in the study of the science attempts have been made to proceed, by the Calculus of Probability, from the observed cases to adjacent and similar cases. In practice, as has been already said, it is not possible to avoid some extension of this kind. But it should be observed, that in these instances the divergence from the strict ground of experience is not in reality recognized, at least not as a part of our logical procedure. We have, it is true, wandered somewhat beyond it, and so obtained a wider proposition than our data strictly necessitated, and therefore one of less certainty. Still we assume the conclusion given by induction to be equally certain with the data, or rather omit all notice of the divergence from consideration. It is assumed that the unexamined instances will resemble the examined, an assumption for which abundant warrant may exist; the theory of the calculation rests upon the supposition that there will be no difference between them, and the practical error is insignificant simply because this difference is small.

§ 6. But the rule we are now about to discuss, and which may be called the Rule of Succession, is of a very different kind. It not only recognizes the fact that we are leaving the ground of past experience, but takes the consequences of this divergence as the express subject of its calculation. It professes to give a general rule for the measure of expectation that we should have of the reappearance of a phenomenon that has been already observed any number of times. This rule is generally stated somewhat as follows: “To find the chance of the recurrence of an event already observed, divide the number of times the event has been observed, increased by one, by the same number increased by two.”

§ 7. It will be instructive to point out the origin of this rule; if only to remind the reader of the necessity of keeping mathematical formulæ to their proper province, and to show what astonishing conclusions are apt to be accepted on the supposed warrant of mathematics. Revert then to the example of Inverse Probability on p. 182. We saw that under certain assumptions, it would follow that when a single white ball had been drawn from a bag known to contain 10 balls which were white or black, the chance could be determined that there was only one white ball in it. Having done this we readily calculate ‘directly’ the chance that this white ball will be drawn next time. Similarly we can reckon the chances of there being two, three, &c. up to ten white balls in it, and determine on each of these suppositions the chance of a white ball being drawn next time. Adding these together we have the answer to the question:—a white ball has been drawn once from a bag known to contain ten balls, white or black; what is the chance of a second time drawing a white ball?

So far only arithmetic is required. For the next step we need higher mathematics, and by its aid we solve this problem:—A white ball has been drawn m times from a bag which contains any number, we know not what, of balls each of which is white or black, find the chance of the next drawing also yielding a white ball. The answer is

^m + 1/_m + 2.

Thus far mathematics. Then comes in the physical assumption that the universe may be likened to such a bag as the above, in the sense that the above rule may be applied to solve this question:—an event has been observed to happen m times in a certain way, find the chance that it will happen in that way next time. Laplace, for instance, has pointed out that at the date of the writing of his Essai Philosophique, the odds in favour of the sun's rising again (on the old assumption as to the age of the world) were 1,826,214 to 1. De Morgan says that a man who standing on the bank of a river has seen ten ships pass by with flags should judge it to be 11 to 1 that the next ship will also carry a flag.

§ 8. It is hard to take such a rule as this seriously, for there does not seem to be even that moderate confirmation of it which we shall find to hold good in the case of the application of abstract formulæ to the estimation of the evidence of witnesses. If however its validity is to be discussed there appear to be two very distinct lines of enquiry along which we may be led.

(1) In the first place we may take it for what it professes to be, and for what it is commonly understood to be, viz. a rule which assigns the measure of expectation we ought to entertain of the recurrence of the event under the circumstances in question. Of course, on the view adopted in this work, we insist on enquiring whether it is really true that on the average events do thus repeat their performance in accordance with this law. Thus tested, no one surely would attempt to defend such a formula. So far from past occurrence being a ground for belief in future recurrence, there are (as will be more fully pointed out in the Chapter on Fallacies) plenty of cases in which the direct contrary holds good. Then again a rule of this kind is subject to the very serious perplexity to be explained in our next chapter, arising out of the necessary arbitrariness of such inverse reference. That is, when an event has happened but a few times, we have no certain guide; and when it has happened but once,[2] we have no guide whatever, as to the class of cases to which it is to be referred. In the example above, about the flags, why did we stop short at this notion simply, instead of specifying the size, shape, &c. of the flags?

De Morgan, it must be remembered, only accepts this rule in a qualified sense. He regards it as furnishing a minimum value for the amount of our expectation. He terms it “the rule of probability of a pure induction,” and says of it, “The probabilities shown by the above rules are merely minima which may be augmented by other sources of knowledge.” That is, he recognizes only those instances in which our belief in the Uniformity of Nature and in the existence of special laws of causation comes in to supplement that which arises from the mere frequency of past occurrence. This however does not meet those cases in which past occurrence is a positive ground of disbelief in future recurrence.

§ 9. (2) There is however another and very different view which might be taken of such a rule. It is one, an obscure recognition of which has very likely had much to do with the acceptance which the rule has received.

What we might suppose ourselves to be thus expressing is,—not the measure of rational expectation which might be held by minds sufficiently advanced to be able to classify and to draw conscious inferences, but,—the law according to which the primitive elements of belief were started and developed. Of course such an interpretation as this would be equivalent to quitting the province of Logic altogether and crossing over into that of Psychology; but it would be a perfectly valid line of enquiry. We should be attempting nothing more than a development of the researches of Fechner and his followers in psychophysical measurement. Only then we ought, like them, not to start with any analogy of a ballot box and its contents, but to base our enquiry on careful determination of the actual mental phenomena experienced. We know how the law has been determined in accordance with which the intensity of the feeling of light varies with that of its objective source. We see how it is possible to measure the growth of memory according to the number of repetitions of a sentence or a succession of mere syllables. In this latter case, for instance, we just try experiments, and determine how much better a man can remember any utterances after eight hearings than after seven.[3]

Now this case furnishes a very close parallel to our supposed attempt to measure the increase of intensity of belief after repeated recurrence. That is, if it were possible to experiment in this order of mental phenomena, we ought simply to repeat a phenomenon a certain number of times and then ascertain by actual introspection or by some simple test, how fast the belief was increasing. Thus viewed the problem seems to me a hopeless one. The difficulties are serious enough, when we are trying to measure our simple sensations, of laying aside the effects of past training, and of attempting, as it were, to leave the mind open and passive to mere reception of stimuli. But if we were to attempt in this way to measure our belief these difficulties would become quite insuperable. We can no more divest ourselves of past training here than we can of intelligence or thought. I do not see how any one could possibly avoid classing the observed recurrences with others which he had experienced, and of being thus guided by special analogies and inductions instead of trusting solely to De Morgan's ‘pure induction’. The same considerations tend to rebut another form of defence for the rule in question. It is urged, for instance, that we may at least resort to it in those cases in which we are in entire ignorance as to the number and nature of the antecedents. This is a position to which I can hardly conceive it possible that we should ever be reduced. However remote or exceptional may be the phenomenon selected we may yet bring it into relation with some accepted generalizations and thus draw our conclusions from these rather than from purely à priori considerations.

§ 10. Since then past acquisitions cannot be laid aside or allowed for, the only remaining resource would be to experiment upon the infant mind. One would not like to pronounce that any line of enquiry is impossible; but the difficulties would certainly be enormous. And interesting as the facts would be, supposing that we had succeeded in securing them, they would not be of the slightest importance in Logic. However the question were settled:—whether, for instance, we proved that the sentiment or emotion of belief grew up slowly and gradually from a sort of zero point under the impress of repetition of experience; or whether we proved that a single occurrence produced complete belief in the repetition of the event, so that experience gradually untaught us and weakened our convictions;—in no case would the mature mind gain any aid as to what it ought to believe.

I cannot but think that some such view as this must occasionally underlie the acceptance which this rule has received. For instance, Laplace, though unhesitatingly adopting it as a real, that is, objective rule of inference, has gone into so much physiological and psychological matter towards the end of his discussion (Essai philosophique) as to suggest that what he had in view was the natural history of belief rather than its subsequent justification.

Again, the curious doctrine adopted by Jevons, that the principles of Induction rest entirely upon the theory of Probability,—a very different doctrine from that which is conveyed by saying that all knowledge of facts is probable only, i.e. not necessary,—seems unintelligible except on some such interpretation. We shall have more to say on this subject in our next chapter. It will be enough here to remark that in our present reflective and rational stage we find that every inference in Probability involves some appeal to, or support from, Induction, but that it is impossible to base either upon the other. However far back we try to push our way, and however disposed we might be to account for our ultimate beliefs by Association, it seems to me that so long as we consider ourselves to be dealing with rules of inference we must still distinguish between Induction and Probability.

1 John Craig, in his often named work, Theologiæ Christianæ Principia Mathematica (Lond. 1699) attempted something in this direction when he proposed to solve such problems as:—Quando evanescet probabilitas cujusvis Historiæ, cujus subjectum est transiens, vivâ tantum voce transmissæ, determinare.

2 When m = 1 the fraction becomes ²/₃; i.e. the odds are 2 to 1 in favour of recurrence. And there are writers who accept this result. For instance, Jevons (Principles of Science p. 258) says “Thus on the first occasion on which a person sees a shark, and notices that it is accompanied by a little pilot fish, the odds are 2 to 1 that the next shark will be so accompanied.” To say nothing of the fact that recognizing and naming the fish implies that they have often been seen before, how many of the observed characteristics of that single ‘event’ are to be considered essential? Must the pilot precede; and at the same distance? Must we consider the latitude, the ocean, the season, the species of shark, as matter also of repetition on the next occasion? and so on. I cannot see how the Inductive problem can be even intelligibly stated, for quantitative purposes, on the first occurrence of any event.

3 See in Mind (x. 454) Mr Jacob's account of the researches of Herr Ebbinghaus as described in his work Ueber das Gedächtniss.

CHAPTER IX.

INDUCTION AND ITS CONNECTION WITH PROBABILITY.

§ 1. We were occupied, during the last chapter, with the examination of a rule, the object of which was to enable us to make inferences about instances as yet unexamined. It was professedly, therefore, a rule of an inductive character. But, in the form in which it is commonly expressed, it was found to fail utterly. It is reasonable therefore to enquire at this point whether Probability is entirely a formal or deductive science, or whether, on the other hand, we are able, by means of it, to make valid inferences about instances as yet unexamined. This question has been already in part answered by implication in the course of the last two chapters. It is proposed in the present chapter to devote a fuller investigation to this subject, and to describe, as minutely as limits will allow, the nature of the connection between Probability and Induction. We shall find it advisable for clearness of conception to commence our enquiry at a somewhat early stage. We will travel over the ground, however, as rapidly as possible, until we approach the boundary of what can properly be termed Probability.

§ 2. Let us then conceive some one setting to work to investigate nature, under its broadest aspect, with the view of systematizing the facts of experience that are known, and thence (in case he should find that this is possible) discovering others which are at present unknown. He observes a multitude of phenomena, physical and mental, contemporary and successive. He enquires what connections are there between them? what rules can be found, so that some of these things being observed I can infer others from them? We suppose him, let it be observed, deliberately resolving to investigate the things themselves, and not to be turned aside by any prior enquiry as to there being laws under which the mind is compelled to judge of the things. This may arise either from a disbelief in the existence of any independent and necessary mental laws, and a consequent conviction that the mind is perfectly competent to observe and believe anything that experience offers, and should believe nothing else, or simply from a preference for investigations of the latter kind. In other words, we suppose him to reject Formal Logic, and to apply himself to a study of objective existences.

It must not for a moment be supposed that we are here doing more than conceiving a fictitious case for the purpose of more vividly setting before the reader the nature of the inductive process, the assumptions it has to make, and the character of the materials to which it is applied. It is not psychologically possible that any one should come to the study of nature with all his mental faculties in full perfection, but void of all materials of knowledge, and free from any bias as to the uniformities which might be found to prevail around him. In practice, of course, the form and the matter—the laws of belief or association, and the objects to which they are applied—act and react upon one another, and neither can exist in any but a low degree without presupposing the existence of the other. But the supposition is perfectly legitimate for the purpose of calling attention to the requirements of such a system of Logic, and is indeed nothing more than what has to be done at almost every step in psychological enquiry.[1]

§ 3. His task at first might be conceived to be a slow and tedious one. It would consist of a gradual accumulation of individual instances, as marked out from one another by various points of distinction, and connected with one another by points of resemblance. These would have to be respectively distinguished and associated in the mind, and the consequent results would then be summed up in general propositions, from which inferences could afterwards be drawn. These inferences could, of course, contain no new facts, they would only be repetitions of what he or others had previously observed. All that we should have so far done would have been to make our classifications of things and then to appeal to them again. We should therefore be keeping well within the province of ordinary logic, the processes of which (whatever their ultimate explanation) may of course always be expressed, in accordance with Aristotle's Dictum, as ways of determining whether or not we can show that one given class is included wholly or partly within another, or excluded from it, as the case may be.

§ 4. But a very short course of observation would suggest the possibility of a wide extension of his information. Experience itself would soon detect that events were connected together in a regular way; he would ascertain that there are ‘laws of nature.’ Coming with no à priori necessity of believing in them, he would soon find that as a matter of fact they do exist, though he could not feel any certainty as to the extent of their prevalence. The discovery of this arrangement in nature would at once alter the plan of his proceedings, and set the tone to the whole range of his methods of investigation. His main work now would be to find out by what means he could best discover these laws of nature.

An illustration may assist. Suppose I were engaged in breaking up a vast piece of rock, say slate, into small pieces. I should begin by wearily working through it inch by inch. But I should soon find the process completely changed owing to the existence of cleavage. By this arrangement of things a very few blows would do the work—not, as I might possibly have at first supposed, to the extent of a few inches—but right through the whole mass. In other words, by the process itself of cutting, as shown in experience, and by nothing else, a constitution would be detected in the things that would make that process vastly more easy and extensive. Such a discovery would of course change our tactics. Our principal object would thenceforth be to ascertain the extent and direction of this cleavage.

Something resembling this is found in Induction. The discovery of laws of nature enables the mind to dart with its inferences from a few facts completely through a whole class of objects, and thus to acquire results the successive individual attainment of which would have involved long and wearisome investigation, and would indeed in multitudes of instances have been out of the question. We have no demonstrative proof that this state of things is universal; but having found it prevail extensively, we go on with the resolution at least to try for it everywhere else, and we are not disappointed. From propositions obtained in this way, or rather from the original facts on which these propositions rest, we can make new inferences, not indeed with absolute certainty, but with a degree of conviction that is of the utmost practical use. We have gained the great step of being able to make trustworthy generalizations. We conclude, for instance, not merely that John and Henry die, but that all men die.

§ 5. The above brief investigation contains, it is hoped, a tolerably correct outline of the nature of the Inductive inference, as it presents itself in Material or Scientific Logic. It involves the distinction drawn by Mill, and with which the reader of his System of Logic will be familiar, between an inference drawn according to a formula and one drawn from a formula. We do in reality make our inference from the data afforded by experience directly to the conclusion; it is a mere arrangement of convenience to do so by passing through the generalization. But it is one of such extreme convenience, and one so necessarily forced upon us when we are appealing to our own past experience or to that of others for the grounds of our conclusion, that practically we find it the best plan to divide the process of inference into two parts. The first part is concerned with establishing the generalization; the second (which contains the rules of ordinary logic) determines what conclusions can be drawn from this generalization.

§ 6. We may now see our way to ascertaining the province of Probability and its relation to kindred sciences. Inductive Logic gives rules for discovering such generalizations as those spoken of above, and for testing their correctness. If they are expressed in universal propositions it is the part of ordinary logic to determine what inferences can be made from and by them; if, on the other hand, they are expressed in proportional propositions, that is, propositions of the kind described in our first chapter, they are handed over to Probability. We find, for example, that three infants out of ten die in their first four years. It belongs to Induction to say whether we are justified in generalizing our observation into the assertion, All infants die in that proportion. When such a proposition is obtained, whatever may be the value to be assigned to it, we recognize in it a series of a familiar kind, and it is at once claimed by Probability.

In this latter case the division into two parts, the inductive and the ratiocinative, seems decidedly more than one of convenience; it is indeed imperatively necessary for clearness of thought and cogency of treatment. It is true that in almost every example that can be selected we shall find both of the above elements existing together and combining to determine the degree of our conviction, but when we come to examine them closely it appears to me that the grounds of their cogency, the kind of conviction they produce, and consequently the rules which they give rise to, are so entirely distinct that they cannot possibly be harmonized into a single consistent system.

The opinion therefore according to which certain Inductive formulæ are regarded as composing a portion of Probability, and which finds utterance in the Rule of Succession criticised in our last chapter, cannot, I think, be maintained. It would be more correct to say, as stated above, that Induction is quite distinct from Probability, yet co-operates in almost all its inferences. By Induction we determine, for example, whether, and how far, we can safely generalize the proposition that four men in ten live to be fifty-six; supposing such a proposition to be safely generalized, we hand it over to Probability to say what sort of inferences can be deduced from it.

§ 7. So much then for the opinion which tends to regard pure Induction as a subdivision of Probability. By the majority of philosophical and logical writers a widely different view has of course been entertained. They are mostly disposed to distinguish these sciences very sharply from, not to say to contrast them with, one another; the one being accepted as philosophical or logical, and the other rejected as mathematical. This may without offence be termed the popular prejudice against Probability.

A somewhat different view, however, must be noticed here, which, by a sort of reaction against the latter, seems even to go beyond the former; and which occasionally finds expression in the statement that all inductive reasoning of every kind is merely a matter of Probability. Two examples of this may be given.

Beginning with the older authority, there is an often quoted saying by Butler at the commencement of his Analogy, that ‘probability is the very guide of life’; a saying which seems frequently to be understood to signify that the rules or principles of Probability are thus all-prevalent when we are drawing conclusions in practical life. Judging by the drift of the context, indeed, this seems a fair interpretation of his meaning, in so far of course as there could be said to be any such thing as a science of Probability in those days. Prof. Jevons, in his Principles of Science (p. 197), has expressed a somewhat similar view, of course in a way more consistent with the principles of modern science, physical and mathematical. He says, “I am convinced that it is impossible to expound the methods of induction in a sound manner, without resting them on the theory of Probability. Perfect knowledge alone can give certainty, and in nature perfect knowledge would be infinite knowledge, which is clearly beyond our capacities. We have, therefore, to content ourselves with partial knowledge,—knowledge mingled with ignorance, producing doubt.”[2]

§ 8. There are two senses in which this disposition to merge the two sciences into one may be understood. Using the word Probability in its vague popular signification, nothing more may be intended than to call attention to the fact, that in every case alike our conclusions are nothing more than ‘probable,’ that is, that they are not, and cannot be, absolutely certain. This must be fully admitted, for of course no one acquainted with the complexity of physical and other evidence would seriously maintain that absolute ideal certainty can be attained in any branch of applied logic. Hypothetical certainty, in abstract science, may be possible, but not absolute certainty in the domain of the concrete. This has been already noticed in a former chapter, where, however, it was pointed out that whatever justification may exist, on the subjective view of logic, for regarding this common prevalence of absence of certainty as warranting us in fusing the sciences into one, no such justification is admitted when we take the objective view.

§ 9. What may be meant, however, is that the grounds of this absence of certainty are always of the same general character. This argument, if admitted, would have real force, and must therefore be briefly noticed. We have seen abundantly that when we say of a conclusion within the strict province of Probability, that it is not certain, all that we mean is that in some proportion of cases only will such conclusion be right, in the other cases it will be wrong. Now when we say, in reference to any inductive conclusion, that we feel uncertain about its absolute cogency, are we conscious of the same interpretation? It seems to me that we are not. It is indeed quite possible that on ultimate analysis it might be proved that experience of failure in the past employment of our methods of investigation was the main cause of our present want of perfect confidence in them. But this, as we have repeatedly insisted, does not belong to the province of logical, but to that of Psychological enquiry. It is surely not the case that we are, as a rule, consciously guided by such occasional or repeated instances of past failure. In so far as they are at all influential, they seem to do their work by infusing a vague want of confidence which cannot be referred to any statistical grounds for its justification, at least not in a quantitative way. Part of our want of confidence is derived sympathetically from those who have investigated the matter more nearly at first hand. Here again, analysis might detect that a given proportion of past failures lay at the root of the distrust, but it does not show at the surface. Moreover, one reason why we cannot feel perfectly certain about our inductions is, that the memory has to be appealed to for some of our data; and will any one assert that the only reason why we do not place absolute reliance on our memory of events long past is that we have been deceived in that way before?

In any other sense, therefore, than as a needful protest against attaching too great demonstrative force to the conclusions of Inductive Logic, it seems decidedly misleading to speak of its reasonings as resting upon Probability.

§ 10. We may now see clearly the reasons for the limits within which causation[3] is necessarily required, but beyond which it is not needed. To be able to generalize a formula so as to extend it from the observed to the unobserved, it is clearly essential that there should be a certain permanence in the order of nature; this permanence is one form of what is implied in the term causation. If the circumstances under which men live and die remaining the same, we did not feel warranted in inferring that four men out of ten would continue to live to fifty, because in the case of those whom we had observed this proportion had hitherto done so, it is clear that we should be admitting that the same antecedents need not be followed by the same consequents. This uniformity being what the Law of Causation asserts, the truth of the law is clearly necessary to enable us to obtain our generalizations: in other words, it is necessary for the Inductive part of the process. But it seems to be equally clear that causation is not necessary for that part of the process which belongs to Probability. Provided only that the truth of our generalizations is secured to us, in the way just mentioned, what does it matter to us whether or not the individual members are subject to causation? For it is not in reality about these individuals that we make inferences. As this last point has been already fully treated in Chapter VI., any further allusion to it need not be made here.

§ 11. The above description, or rather indication, of the process of obtaining these generalizations must suffice for the present. Let us now turn and consider the means by which we are practically to make use of them when they are obtained. The point which we had reached in the course of the investigations entered into in the sixth and seventh chapters was this:—Given a series of a certain kind, we could draw inferences about the members which composed it; inferences, that is, of a peculiar kind, the value and meaning of which were fully discussed in their proper place.

We must now shift our point of view a little; instead of starting, as in the former chapters, with a determinate series supposed to be given to us, let us assume that the individual only is given, and that the work is imposed upon us of finding out the appropriate series. How are we to set about the task? In the former case our data were of this kind:—Eight out of ten men, aged fifty, will live eleven years more, and we ascertained in what sense, and with what certainty, we could infer that, say, John Smith, aged fifty, would live to sixty-one.

§ 12. Let us then suppose, instead, that John Smith presents himself, how should we in this case set about obtaining a series for him? In other words, how should we collect the appropriate statistics? It should be borne in mind that when we are attempting to make real inferences about things as yet unknown, it is in this form that the problem will practically present itself.

At first sight the answer to this question may seem to be obtained by a very simple process, viz. by counting how many men of the age of John Smith, respectively do and do not live for eleven years. In reality however the process is far from being so simple as it appears. For it must be remembered that each individual thing has not one distinct and appropriate class or group, to which, and to which alone, it properly belongs. We may indeed be practically in the habit of considering it under such a single aspect, and it may therefore seem to us more familiar when it occupies a place in one series rather than in another; but such a practice is merely customary on our part, not obligatory. It is obvious that every individual thing or event has an indefinite number of properties or attributes observable in it, and might therefore be considered as belonging to an indefinite number of different classes of things. By belonging to any one class it of course becomes at the same time a member of all the higher classes, the genera, of which that class was a species. But, moreover, by virtue of each accidental attribute which it possesses, it becomes a member of a class intersecting, so to say, some of the other classes. John Smith is a consumptive man say, and a native of a northern climate. Being a man he is of course included in the class of vertebrates, also in that of animals, as well as in any higher such classes that there may be. The property of being consumptive refers him to another class, narrower than any of the above; whilst that of being born in a northern climate refers him to a new and distinct class, not conterminous with any of the rest, for there are things born in the north which are not men.

§ 13. When therefore John Smith presents himself to our notice without, so to say, any particular label attached to him informing us under which of his various aspects he is to be viewed, the process of thus referring him to a class becomes to a great extent arbitrary. If he had been indicated to us by a general name, that, of course, would have been some clue; for the name having a determinate connotation would specify at any rate a fixed group of attributes within which our selection was to be confined. But names and attributes being connected together, we are here supposed to be just as much in ignorance what name he is to be called by, as what group out of all his innumerable attributes is to be taken account of; for to tell us one of these things would be precisely the same in effect as to tell us the other. In saying that it is thus arbitrary under which class he is placed, we mean, of course, that there are no logical grounds of decision; the selection must be determined by some extraneous considerations. Mere inspection of the individual would simply show us that he could equally be referred to an indefinite number of classes, but would in itself give no inducement to prefer, for our special purpose, one of these classes to another.

This variety of classes to which the individual may be referred owing to his possession of a multiplicity of attributes, has an important bearing on the process of inference which was indicated in the earlier sections of this chapter, and which we must now examine in more special reference to our particular subject.

§ 14. It will serve to bring out more clearly the nature of some of those peculiarities of the step which we are now about to take in the case of Probability, if we first examine the form which the corresponding step assumes in the case of ordinary Logic. Suppose then that we wished to ascertain whether a certain John Smith, a man of thirty, who is amongst other things a resident in India, and distinctly affected with cancer, will continue to survive there for twenty years longer. The terms in which the man is thus introduced to us refer him to different classes in the way already indicated. Corresponding to these classes there will be a number of propositions which have been obtained by previous observations and inductions, and which we may therefore assume to be available and ready at hand when we want to make use of them. Let us conceive them to be such as these following:—Some men live to fifty; some Indian residents live to fifty; no man suffering thus from cancer lives for five years. From the first and second of these premises nothing whatever can be inferred, for they are both[4] particular propositions, and therefore lead to no conclusion in this case. The third answers our enquiry decisively.

To the logical reader it will hardly be necessary to point out that the process here under consideration is that of finding middle terms which shall serve to connect the subject and predicate of our conclusion. This subject and predicate in the case in question, are the individual before us and his death within the stated period. Regarded by themselves there is nothing in common between them, and therefore no link by which they may be connected or disconnected with each other. The various classes above referred to are a set of such middle terms, and the propositions belonging to them are a corresponding set of major premises. By the help of any one of them we are enabled, under suitable circumstances, to connect together the subject and predicate of the conclusion, that is, to infer whether the man will or will not live twenty years.

§ 15. Now in the performance of such a logical process there are two considerations to which the reader's attention must for a moment be directed. They are simple enough in this case, but will need careful explanation in the corresponding case in Probability. In the first place, it is clear that whenever we can make any inference at all, we can do so with absolute certainty. Logic, within its own domain, knows nothing of hesitation or doubt. If the middle term is appropriate it serves to connect the extremes in such a way as to preclude all uncertainty about the conclusion; if it is not, there is so far an end of the matter: no conclusion can be drawn, and we are therefore left where we were. Assuming our premises to be correct, we either know our conclusion for certain, or we know nothing whatever about it. In the second place, it should be noticed that none of the possible alternatives in the shape of such major premises as those given above can ever contradict any of the others, or be at all inconsistent with them. Regarded as isolated propositions, there is of course nothing to secure such harmony; they have very different predicates, and may seem quite out of each other's reach for either support or opposition. But by means of the other premise they are in each case brought into relation with one another, and the general interests of truth and consistency prevent them therefore from contradicting one another. As isolated propositions it might have been the case that all men live to fifty, and that no Indian residents do so, but having recognised that some men are residents in India, we see at once that these premises are inconsistent, and therefore that one or other of them must be rejected. In all applied logic this necessity of avoiding self-contradiction is so obvious and imperious that no one would think it necessary to lay down the formal postulate that all such possible major premises are to be mutually consistent. To suppose that this postulate is not complied with, would be in effect to make two or more contradictory assumptions about matters of fact.

§ 16. But now observe the difference when we attempt to take the corresponding step in Probability. For ordinary propositions, universal or particular, substitute statistical propositions of what we have been in the habit of calling the ‘proportional’ kind. In other words, instead of asking whether the man will live for twenty years, let us ask whether he will live for one year? We shall be unable to find any universal propositions which will cover the case, but we may without difficulty obtain an abundance of appropriate proportional ones. They will be of the following description:—Of men aged 30, 98 in 100 live another year; of residents in India a smaller proportion survive, let us for example say 90 in 100; of men suffering from cancer a smaller proportion still, let us say 20 in 100.

Now in both of the respects to which attention has just been drawn, propositions of this kind offer a marked contrast with those last considered. In the first place, they do not, like ordinary propositions, either assert unequivocally yes or no, or else refuse to open their lips; but they give instead a sort of qualified or hesitating answer concerning the individuals included in them. This is of course nothing more than the familiar characteristic of what may be called ‘probability propositions.’ But it leads up to, and indeed renders possible, the second and more important point; viz. that these various answers, though they cannot directly and formally contradict each other (this their nature as proportional propositions, will not as a rule permit), may yet, in a way which will now have to be pointed out, be found to be more or less in conflict with each other.

Hence it follows that in the attempt to draw a conclusion from premises of the kind in question, we may be placed in a position of some perplexity; but it is a perplexity which may present itself in two forms, a mild and an aggravated form. We will notice them in turn.

§ 17. The mild form occurs when the different classes to which the individual case may be appropriately referred are successively included one within another; for here our sets of statistics, though leading to different results, will not often be found to be very seriously at variance with one another. All that comes of it is that as we ascend in the scale by appealing to higher and higher genera, the statistics grow continually less appropriate to the particular case in point, and such information therefore as they afford becomes gradually less explicit and accurate.

The question that we originally wanted to determine, be it remembered, is whether John Smith will die within one year. But all knowledge of this fact being unattainable, owing to the absence of suitable inductions, we felt justified (with the explanation, and under the restrictions mentioned in Chap VI.), in substituting, as the only available equivalent for such individual knowledge, the answer to the following statistical enquiry, What proportion of men in his circumstances die?

§ 18. But then at once there begins to arise some doubt and ambiguity as to what exactly is to be understood by his circumstances. We may know very well what these circumstances are in themselves, and yet be in perplexity as to how many of them we ought to take into account when endeavouring to estimate his fate. We might conceivably, for a beginning, choose to confine our attention to those properties only which he has in common with all animals. If so, and statistics on the subject were attainable, they would presumably be of some such character as this, Ninety-nine animals out of a hundred die within a year. Unusual as such a reference would be, we should, logically speaking, be doing nothing more than taking a wider class than the one we were accustomed to. Similarly we might, if we pleased, take our stand at the class of vertebrates, or at that of mammalia, if zoologists were able to give us the requisite information. Of course we reject these wide classes and prefer a narrower one. If asked why we reject them, the natural answer is that they are so general, and resemble the particular case before us in so few points, that we should be exceedingly likely to go astray in trusting to them. Though accuracy cannot be insured, we may at least avoid any needless exaggeration of the relative number and magnitude of our errors.

§ 19. The above answer is quite valid; but whilst cautioning us against appealing to too wide a class, it seems to suggest that we cannot go wrong in the opposite direction, that is in taking too narrow a class. And yet we do avoid any such extremes. John Smith is not only an Englishman; he may also be a native of such a part of England, be living in such a Presidency, and so on. An indefinite number of such additional characteristics might be brought out into notice, many of which at any rate have some bearing upon the question of vitality. Why do we reject any consideration of these narrower classes? We do reject them, but it is for what may be termed a practical rather than a theoretical reason. As was explained in the first chapters, it is essential that our series should contain a considerable number of terms if they are to be of any service to us. Now many of the attributes of any individual are so rare that to take them into account would be at variance with the fundamental assumption of our science, viz. that we are properly concerned only with the averages of large numbers. The more special and minute our statistics the better, provided only that we can get enough of them, and so make up the requisite large number of instances. This is, however, impossible in many cases. We are therefore obliged to neglect one attribute after another, and so to enlarge the contents of our class; at the avowed risk of somewhat increased variety and unsuitability in the members of it, for at each step of this kind we diverge more and more from the sort of instances that we really want. We continue to do so, until we no longer gain more in quantity than we lose in quality. We finally take our stand at the point where we first obtain statistics drawn from a sufficiently large range of observation to secure the requisite degree of stability and uniformity.

§ 20. In such an example as the one just mentioned, where one of the successive classes—man—is a well-defined natural kind or species, there is such a complete break in each direction at this point, that every one is prompted to take his stand here. On the one hand, no enquirer would ever think of introducing any reference to the higher classes with fewer attributes, such as animal or organized being: and on the other hand, the inferior classes, created by our taking notice of his employment or place of residence, &c., do not as a rule differ sufficiently in their characteristics from the class man to make it worth our while to attend to them.

Now and then indeed these characteristics do rise into importance, and whenever this is the case we concentrate our attention upon the class to which they correspond, that is, the class which is marked off by their presence. Thus, for instance, the quality of consumptiveness separates any one off so widely from the majority of his fellow-men in all questions pertaining to mortality, that statistics about the lives of consumptive men differ materially from those which refer to men in general. And we see the result; if a consumptive man can effect an insurance at all, he must do it for a much higher premium, calculated upon his special circumstances. In other words, the attribute is sufficiently important to mark off a fresh class or series. So with insurance against accident. It is not indeed attempted to make a special rate of insurance for the members of each separate trade, but the differences of risk to which they are liable oblige us to take such facts to some degree into account. Hence, trades are roughly divided into two or three classes, such as the ordinary, the hazardous, and the extra-hazardous, each having to pay its own rate of premium.

§ 21. Where one or other of the classes thus corresponds to natural kinds, or involves distinctions of co-ordinate importance with those of natural kinds, the process is not difficult; there is almost always some one of these classes which is so universally recognised to be the appropriate one, that most persons are quite unaware of there being any necessity for a process of selection. Except in the cases where a man has a sickly constitution, or follows a dangerous employment, we seldom have occasion to collect statistics for him from any class but that of men in general of his age in the country.

When, however, these successive classes are not ready marked out for us by nature, and thence arranged in easily distinguishable groups, the process is more obviously arbitrary. Suppose we were considering the chance of a man's house being burnt down, with what collection of attributes should we rest content in this instance? Should we include all kinds of buildings, or only dwelling-houses, or confine ourselves to those where there is much wood, or those which have stoves? All these attributes, and a multitude of others may be present, and, if so, they are all circumstances which help to modify our judgment. We must be guided here by the statistics which we happen to be able to obtain in sufficient numbers. Here again, rough distinctions of this kind are practically drawn in Insurance Offices, by dividing risks into ordinary, hazardous, and extra-hazardous. We examine our case, refer it to one or other of these classes, and then form our judgment upon its prospects by the statistics appropriate to its class.

§ 22. So much for what may be called the mild form in which the ambiguity occurs; but there is an aggravated form in which it may show itself, and which at first sight seems to place us in far greater perplexity.

Suppose that the different classes mentioned above are not included successively one within the other. We may then be quite at a loss which of the statistical tables to employ. Let us assume, for example, that nine out of ten Englishmen are injured by residence in Madeira, but that nine out of ten consumptive persons are benefited by such a residence. These statistics, though fanciful, are conceivable and perfectly compatible. John Smith is a consumptive Englishman; are we to recommend a visit to Madeira in his case or not? In other words, what inferences are we to draw about the probability of his death? Both of the statistical tables apply to his case, but they would lead us to directly contradictory conclusions. This does not mean, of course, contradictory precisely in the logical sense of that word, for one of these propositions does not assert that an event must happen and the other deny that it must; but contradictory in the sense that one would cause us in some considerable degree to believe what the other would cause us in some considerable degree to disbelieve. This refers, of course, to the individual events; the statistics are by supposition in no degree contradictory. Without further data, therefore, we can come to no decision.

§ 23. Practically, of course, if we were forced to a decision with only these data before us, we should make our choice by the consideration that the state of a man's lungs has probably more to do with his health than the place of his birth has; that is, we should conclude that the duration of life of consumptive Englishmen corresponds much more closely with that of consumptive persons in general than with that of their healthy countrymen. But this is, of course, to import empirical considerations into the question. The data, as they are given to us, and if we confine ourselves to them, leave us in absolute uncertainty upon the point. It may be that the consumptive Englishmen almost all die when transported into the other climate; it may be that they almost all recover. If they die, this is in obvious accordance with the first set of statistics; it will be found in accordance with the second set through the fact of the foreign consumptives profiting by the change of climate in more than what might be termed their due proportion. A similar explanation will apply to the other alternative, viz. to the supposition that the consumptive Englishmen mostly recover. The problem is, therefore, left absolutely indeterminate, for we cannot here appeal to any general rule so simple and so obviously applicable as that which, in a former case, recommended us always to prefer the more special statistics, when sufficiently extensive, to those which are wider and more general. We have no means here of knowing whether one set is more special than the other.

And in this no difficulty can be found, so long as we confine ourselves to a just view of the subject. Let me again recall to the reader's mind what our present position is; we have substituted for knowledge of the individual (finding that unattainable) a knowledge of what occurs in the average of similar cases. This step had to be taken the moment the problem was handed over to Probability. But the conception of similarity in the cases introduces us to a perplexity; we manage indeed to evade it in many instances, but here it is inevitably forced upon our notice. There are here two aspects of this similarity, and they introduce us to two distinct averages. Two assertions are made as to what happens in the long run, and both of these assertions, by supposition, are verified. Of their truth there need be no doubt, for both were supposed to be obtained from experience.

§ 24. It may perhaps be supposed that such an example as this is a reductio ad absurdum of the principle upon which Life and other Insurances are founded. But a moment's consideration will show that this is quite a mistake, and that the principle of insurance is just as applicable to examples of this kind as to any other. An office need find no difficulty in the case supposed. They might (for a reason to be mentioned presently, they probably would not) insure the individual without inconsistency at a rate determined by either average. They might say to him, “You are an Englishman. Out of the multitude of English who come to us nine in ten die if they go to Madeira. We will insure you at a rate assigned by these statistics, knowing that in the long run all will come right so far as we are concerned. You are also consumptive, it is true, and we do not know what proportion of the English are consumptive, nor what proportion of English consumptives die in Madeira. But this does not really matter for our purpose. The formula, nine in ten die, is in reality calculated by taking into account these unknown proportions; for, though we do not know them in themselves, statistics tell us all that we care to know about their results. In other words, whatever unknown elements may exist, must, in regard to all the effects which they can produce, have been already taken into account, so that our ignorance about them cannot in the least degree invalidate such conclusions as we are able to draw. And this is sufficient for our purpose.” But precisely the same language might be held to him if he presented himself as a consumptive man; that is to say, the office could safely carry on its proceedings upon either alternative.