CHAPTER XXII
Highlights of Fractional Substitution
Fractional substitution requires a cipher alphabet of the “multifid” type; that is, one in which the symbols are composed of two or more units, as in the Bacon and Trithemius alphabets (Chapter II: Figs. 3 and 4), the various “checkerboards” (Chapter XI), and so on. Polygram “alphabets” are also of this type, and seriation is a forrn of fractional substitution.
Among the older fractionals, we find a system called the “Pollux,” in which the basis was the Morse telegraphic alphabet. There were three units, the dot, the dash, and a separator (made necessary by the irregular lengths of the substitutes). There
Figure 172
Delastelle's "BIFID" Substitution -(Keyword Feature Added by M. E. OHAVER) Preparation of Alphabet: Checkerboard Key: Substitutes: G E N • R A L 1 2 3 4 5 S = 43 B C D F H I K E = 15 M O P Q S T U 1 G B M V E N = 24 V W X Y Z 2 C O W N D D = 25 3 P X F Q Y 4 R H S Z A 5 I T L K U Preliminary Substitution: S E N D S U P P L I E S T O M O R L E Y S R I G H T A W A Y. 4 1 2 2 4 5 3 3 5 5 1 4 5 2 1 2 4 5 1 3 4 4 5 1 4 5 4 2 4 3 3 5 4 5 3 5 1 1 3 1 5 3 2 2 3 2 1 3 5 5 3 1 1 1 2 2 5 3 5 5 Re-Substitution: 41 22 45 33 54 53 51 35 51 45 21 31 53 22 12 45 13 43 21 35 53 R O A F K L I Y I A C P L O B A M S C Y L 45 14 54 21 11 22 53 43 55 Transmitted: A V K C G O L S U. R O A F K L I Y I A C P L O B, etc. |
was a first substitution in which the letters of the text were replaced with their Morse symbols, including the space. The resulting cryptogram, composed entirely of the units dot, dash, space (. — x), was then subjected to a second substitution, using a small cipher alphabet (either digits or letters) in which each one of the three units might have any one of several different substitutes, chosen at will. For instance, a dot might be replaced with any one of digits 1, 8, 5, 6, a space with any one of digits 3, 9, 0, and a dash with any one of digits 2, 4, 7.
We find also a number of systems called “Collon” in which the basis is some one of the “checkerboards.” The text is subjected to a simple substitution in the agreed alphabet, and the resulting cryptogram is then subjected to a transposition, usually seriation, this being the final operation.
A similar system called the “Mirabeau” uses an alphabet of the same type as that of the Polybius square, in which only the digits 1-2-3-4-5 are significant. The remaining digits are all null, and numbers like 67 or 88 may be inserted at will. Numbers are written vertically (tens below units); then, in the taking off of the cryptograms, the whole series of units is taken first, and the second half of the cryptogram includes all of the tens-digits. In all of these forms, the undesirable features are self-evident. The later devices have added another operation: the regrouping of the scattered units, and their reconversion into letters.
Classic examples are those described by Delastelle as “bifid” and “trifid” (terms, incidentally, which some of our own writers find objectionable, as they do also the term “multifid”). Delastelle’s “bifid” cipher was of the kind shown in Fig. 172. A two-unit alphabet must be used, and all possible two-unit combinations must be convertible into letters. Any desired seriation-length may be agreed upon,
Figure 173
A Fractional Substitution Based on Morse Symbols - M.E.OHAVER The Alphabet, Arranged by Group-Lengths: E . S ... H .... B -... T - U ..- V ...- X -..- R .-. F ..-. C -.-. W .-- ü ..-- Y -.-- I .. D -.. L .-.. Z --.. A .- K -.- ä .-.- Q --.- N -. G --. P .--. ö ---. M -- O --- J .--- ch ---- S E N D S U P ... . -. -.. ... ..- .--. Reverse digits, and re-group: 3 1 2 3 3 3 4 4 3 3 3 2 1 3 .... -.- ... ... .- . --. H K S S A E G |
though it should not be divisible by 2. In the figure, the key-word GENERAL, 7 letters, governs the seriation-length as well as the mixing of the key-square, a feature suggested by Ohaver. The substitution is identical with that of the Polybius square, except that the two units of the substitute are written vertically below the original. Digits are then grouped horizontally in pairs, treating one seven-letter group at a time (if the seriation index is 7), and these pairs are replaced with letters from the same key-square. It will be noticed that we have here a form of polygram substitution, in which one seven-letter group has been replaced with another. Also, that possible errors have been confined by the seriation feature to their own seven-letter group.
Delastelle’s “trifid” cipher was of the same kind, except that a three-unit alphabet was required, resulting in three rows of units. It would have been the same as that of Fig. 4, Chapter II, but with the French accented E replacing the character &. All combinations of three units must be re-convertible into letters.
Fig. 173 shows a form of “mutilation” cipher once published by Ohaver. Beyond stating that its only key is the group-length (7 in the example), we leave the student to figure it out for himself.
As an example of recent use (1918), we are told on excellent authority that the Germans, for quite a long time during the World War, used a field cipher of the following description: There was a preliminary substitution using a key-square of the Nihilist type, except that the external co-ordinates were letters, and not digits, and were chosen in such a way that the five or six letters used were letters having very distinctive Morse symbols; this was for the avoidance of telegraphic errors. In some cases a 5 x 5 square was used, containing only a 25-letter mixed alphabet, and in others a 6 x 6 square containing a 26-letter mixed alphabet and all of the digits. The preliminary cryptogram obtained from this first encipherment was then written into a transposition block and taken off by columns, using key-word columnar transposition. The cryptograms were not afterward shortened by resubstitution, but were always twice as long as their messages, and never contained any other letters than the five (or six) originally used as co-ordinates. This German Field Cipher proved very effective until finally broken by the great French analytical genius, Georges Painvin.
We shall make no attempt, here, to go into the decryptment of these ciphers. The Delastelle “bifid” is, perhaps, a practical cipher, and the student may try his own hand at analyzing the example. The other examples should give no trouble.
Q I N H P R M L M G R N B M A H G T O L O O E L O A O D R I N H W R O
A A B M M I M M W I B M D A B T H D I L T H T H I N T L A Q M C A M F
I V N K Y N O F H B I I T R F Q L A D K V Q I N H P R M R B H S L L U
A B M E T S O A A B M M I M M I B P I V R Q F T K H I R D F G N I E M
A B E N I L M M P A S I F I O P L Y C C R C I T W I V W M F G I O O S
O E R O I K Q I E F O V N V M Q T D R S I O E R I B U Q C D O A L L A
P L A A O O C A Q O M E I D C N T I U L O L Z D G.
The mixed alphabet here was placed in the square by straight horizontals. History:
Message intercepted following a report that on the tenth of August an attenpt had
been made to enter the American embassy in a country where Royalists are opposed to
a group of radicals.
155. By PICCOLA. (Fractional. - Not so hard).
3 3 3 2 3 1 1 1 2 3 2 2 1 3 1 1 1 1 3 1 3 3 1 1 3 2 2 1 2 2 1 1 2 3 1
2 3 3 2 1 2 3 3 1 1 3 2 1 1 2 1 2 2 2 3 1 2 2 2 3 1 1 2 2 1 2 3 2 3 2
2 1 2 3 1 3 3 2 3 1 1 2 2 1 3 2 1 2 2 3 2 1 3 1 2 2 2 2 3 2 3 2 2 2 2
3 1 1 1 3 1 2 3 2 1 1 2 2 2 3 2 3 1 3 2 2 2 2 1 2 3 1 2 2 1 2 1 2 2 1
1 2 2 3 2 3 2 2 3 2 2 2 3 2 2 3 3 1 2 2 3 1 2 1 3 1 1 1 1 2 1 3 3 3 3
1 2 3 3 3 2 1 3 3 1 1 1 1 2 2 3 1 1 3 1 1 1 1 1 1 1 1 3 2 2 1 2 3 2 2
2 1 2 1 2 1 2 2 2 3 3 2 2 1 3.
E D C Y B A Z C B Z A V W X C X B A E Y D C B V A E D W B X A E Y Z D
A E Z V W D C A E D X C B Y D Y Z V C B W B A Z V E W X B X A E Y D C
B V A E W D C X A E Y Z D C E Z V D W C B E D X C B Y A Z D C B V W A
A E D C B A E E W D C B X Y D C Y Z B V A B A Z V E W X A E W D C X Y
E D Y Z C B V E D C W B A X E D Z V C B A C B V W A X Y X B Y A E D Z
E Y Z D V W C W E D X C B A D Y Z V C B A D C W B A X E E D C B A V E
D C X Y B Z A E D C B A E W D Y Z C B V A B A Z V E W X E D V W C X Y
X D C Y B A Z C Z B A V E W B A E W D X Y E D X Y C Z V V E D W C B A.
CHAPTER XXIII
Investigating the Unknown Cipher
When the type of encipherment is unknown, the decryptor’s first problem may concern the probable language used in the plaintext, and this he is usually able to determine from the source and history of the cryptogram.
His second problem is the major classification, and this, too, is usually simple, since transposition, as a rule, can be recognized by its appearance. It must, however, respond to a group-test, and for cases in which this is needed, the approximate percentages for English can be taken as follows:
| Vowels, with or without Y, | about 40% | (Variation limits: 35% to 45%) | ||
| Consonants L N R S T | about 30% | (Variation limits: 25% to 35%) | ||
| Consonants J K Q X Z | about 2% | (May be influenced by nulls). |
The 5% variation is suggested in the Parker Hitt Manual. In this connection, it should be pointed out that an apparent transposition with exactly 40% of vowels and 100% evenness in their distribution is suspicious. Many of the checkerboard systems result in this way, and also some of the codes based on pronounceable five-letter groups. Then, too, it is easily possible to construct a simple substitution cipher alphabet in such a way that the resulting cryptograms will resemble transposition, and even respond satisfactorily to a group-test. It should be carefully ascertained that a supposed transposition cryptogram does not contain the many repeated sequences which belong to simple substitution. As to those transpositions which do show an appreciable number of repeated digrams, they will probably have undergone one of the route transpositions, especially one in which columns were taken off in alternating directions.
Concerning the characteristics of simple substitution, these have been seen throughout the text; we have normal frequencies attached to the wrong letters, and we have those numerous repetitions of various lengths, occurring at all kinds of intervals, which are never found in a transposition. Here, too, we may apply a group-test, based only on the relative frequencies of letters. The five most frequent are supposed to represent the letters E T A O N or their equivalents, and should total about 45% of the text. The nine most frequent should total about 70%; the eleven most frequent well over 75%; the five of lowest frequency (which would include all of those totally absent) should correspond to the normal behavior of the group J K Q X Z.
If the simple substitution frequency count is present without the repeated sequences, then we probably have a combination of simple substitution with transposition. It becomes necessary to rewrite the cryptogram into various new arrangements until one is found which will bring back the repeated sequences. Ordinarily, the simplest kinds of transposition will have been used; sometimes the transposition will have taken place in a complete-unit block, and there will be a clue in the total number of letters present in the cryptogram.
When all letters are present in the frequency count (or all but one or two in the possible cases of 25-letter and 24-letter alphabets), a period-investigation is usually indicated. The case of periodics has been seen at considerable length, though a final hint might be added for the detection of a possible Porta encipherment. One of our many collaborators, F. R. Carter, suggests that any Porta cryptogram, periodic or otherwise, ought to show from 52% to 53% of letters N to Z — the opposite of normal.
The characteristics of digram-encipherment have been mentioned. Other polygram ciphers show corresponding characteristics, according to the polygram length, though the trail grows fainter as polygrams grow longer. A trigram-system, for instance, might be present when the cryptogram is evenly divisible into three-letter groups; it might suggest period 3, and might even show repeated sequences whose length is a multiple of 3 and which begin at serial positions such as 1, 4, 7, 10, which are the beginnings of trigrams. A great many of the trigram systems will show only repeated digrams beginning at these serial positions, or separated by intervals which are divisible by 3.
A 5 x 5 square is often suggested in the fact of a missing letter; but the fact of 26 letters does not deny one, since the careful encipherer may make use of his missing J instead of using I exclusively. Great evenness in frequencies may suggest one of the key-lengthening devices, such as autokey and progressing key; and the practical absence of repeated sequences will usually mean that a transposition has been added to a substitution. It is never a bad idea, in a puzzling example, to make the various digram-counts (in chart form): An actual digram count, in which every letter is considered the beginning of a digram; a pair-count on separated pairs, as in Playfair; the two counts which could be made with the cryptogram marked off into three-letter groups; and the kind of pair-count which could be made in Playfair if the first cryptogram-letter were omitted. Many devices, as mentioned, may be uncovered simply by “running down the alphabet.” And if the cryptogram has come from an amateur “inventor,” it may be a case of digging into one’s memory for previous “inventions.” With this last case, however, the “inventor” very often fails to submit material in proportion to the amount of complication he has introduced.
Of the examples to follow, there is none in which the system may not be learned through analysis, unless perhaps the final unnumbered cryptogram, and the material, in every case, should be suffcient for solution.
No. 163 follows Mr. Berkley’s encipherment plan, illustrated just above it.
No. 164 is said to have been taken from a German spy serving in the American army in France. This applies, however, to the first fifty groups only; the remainder was added to increase the length and to emphasize the plan followed by the spy.
No. 166 was accompanied by a plot:
“Supposed to have been found on the body of a man floating in San Diego Bay. Autopsy shows death by drowning. Victim was a local banker who had disappeared a few days earlier. Wife says no financial worries. No money missing. Banker had prospered during depression. Was yachting enthusiast. Our hero solved the cipher with the unconscious assistance of a radio crooner. Tragedy occurred in August, 1932.” The date was doubly underscored, but those who have read the message have found no reason for this and no explanation for the “crooner.”
C S R Z V Y P Q Z J K H K V Q U U C V M R T W Z N G H Q S A K O X P M
H D R W A J D F Q D F S R Z Z C G X P A J J T Z U L H T G S A H X J J
L T R N N Z P B Z G R E B N F Y G E J N M T N J J Q H P J X M O B J A
L X I A I C P F J O O F R N H.
158. By PICCOLA.
O C E E A T T I T K S N D T D S T H O O Y E A O E E P E B O T Y T A O
A D S E O E T F T T T H R V W C T H O Y L T O O H L R B T T U H R R V
R A W O B R U A O Y E H H L A B N E R L R K V C R I O N S E I D R U E
R I P.
159. By TITOGI.
A H Y N U H C E S T I T N D O R F E H R W E A T F N R F P A O T M A T
L H R E I O T N R L O D R H E E A T E S C T D I N W T S T O E A T S I
T E C D U T M S O T R L D O N G N I I S O F A E T L I T A S.
160. By PICCOLA. (Veiled reference to crypt No. 166?)
(a) H Z M Q L D N N D Z S P R F S K L L L L. (b) L I L V M S T Z U G
D H Z U Q X L L L L. (c) T V I U M F R U O Y U Q Y P S F W X L L.
(d) L I L V M F P O E Y Z K F D V U E L L L. (e) G K P V D Z T A Y T
B F Y Y C F I U L L. (f) Q B F P W Y C L U D V P Z Z O S W Y N C.
(g) Q B R T F F G V T U E N S Z H B Q E R L.
161. By PICCOLA.
E G W G W G E G T U C L C U O X G K Z T E G O B G B Y L W M I Q N K Q
Y E N F S C L H M N Y B X S E T N I W O C E G C B F C T C S Z T V G B
E A E G T U R K F K B E G K X B C T G Z Y L X C H Y E G C U O X Y T Q
F A D Q T T C U N B O G C O H X C E W E C U V E G C O C X Y X G B E A
Y T K X F Q C O T B X N E G T U C O N T O P E L E K U V U N T O C N G
N G B K W C E E C S Z K W N H E I K C C R E G C T E G T U R K F K T B
R G M W X C F G Q N I C E B P E E W E N B K I Y F K F D O F E G N U C
B G M T Z T F X C E W E C F V D T T U Z T E N E G W G L F M C T O V L.
162. By PICCOLA.
S P P A S T A S E F U N M T E H S O O A E S L E I C T R C H V U G S E
L Y R E M E N E E R O S N E H I R A E T O R N S H M O D R O P E A O R
P O S R Y P D O I N O C K G T.
Key: M E T H O D I C M E T H O D I C M E T...
6 3 8 4 7 2 5 1 6 3 8 4 7 2 5 1 6 3 8...
Cryptogram:
T H I S I S E N A N S P O S E D E T C...
C I P H E R E D U S I N G T H E PFGVT VUHDG LMRIV
I V V I G E N E S A M E K E Y F ZOPUH MMVNB FOUDQ WSURF
R E A N D A F T O R B O T H O P BIOTP FGHRU VWHKR RWEVV
E R W A R D T R E R A T I O N S WULVA MPGWV MGEAQ CUYHW LBFUT.
163. By PICCOLA.
T Y D Q V W P A Z O M B W B I R K F I O O G W C O G E F L T Q M S R F
X T C J C M A W P P Q M E X V O Q C O C Z F S F W V F E V E R S A B E
C V J J W S I P P H M M K O X V Y I D B D B C I S Y N L J C Y F K C W
E N Z E I T J V L Z M I L I I R W K R O O S Z A W E K J V J G F M Q K
G F N C K H P B R D L V I A P E S L V M D J Z Z V F Z F F R D B A D P
Q W E N L A L O E K M F M F W X O K D W D G C K K K C Q R V.
164. By VULPUS.
P E N A R C P F T I Q E V A T E N B L A T K Q F O A R E N E U I P E P
F U K X I L C N F Q E P C V B T A W A O B N C O E T I N D W B N A R D
Q F O F N B V C P E P G V G P A V A P B P F O A O B S C L B V B T F W
A N E W B T C S D N F M A N A O E V A R A R C T K Q E N B M B Q F V E
V B X K O A P E T B U I P F O F Q E L E O B R D R B Q F U A W A S C U
K L F P E W B O C O D N A M E L G V F V A N C N D M F N B V D T D L E
P F V I T I Q E Q F O C O A U C L F L A O B M E P E N A S D L B T K L
H N E P D. ..... U I L A L B O B M A V K M G U K R F P F U B U D M
F W E T A T I Q E V B R C M B W A N F Z I L E N A Q F W B T C R D T B
T K O E P E U A V A O F N B S C Z K V B W C U B O A L F O B M E X I T
D Q C Q D W A P F Q E N A L A.
165. By PICCOLA. (Again that No. 166?)
R O V L L A B T L D L B C Q M P X L B A F B T C T A T C O R L T O L C
R H P D T X L Y O A E L B X P H L X B T X X Q L D R G L T K X R L G D
B K L D P P L O H L Y O A E L K O M X B L H O E L V C R R C R J L T K
D T L R C I N X P L L L T K X L R C I N X P L V D B L V O R L P O R J
L D J O L F Y L I O P O R X P L M D E N X E L K C T T L V K O L O H H
X E X G L T O L I O Q M E O Q C B X L H O E L T V O L I X R T B L B C
R I X L K X L V D B L D F P X L T O L B X R G L T K X L B O P A T C O
R L F Y L E X T A E R L Q D C P L L B T C P P L C T L V O A P G L F X
L V O E T K L D R O T K X E L R C I N X P L T O L H C R G L O A T L T
K X L N X Y L L T K C B L Q A B T L F X L T K X L X W M P D R D T C O
R L O H L T K X L E X H X E X R I X L T O L D L I E O O R X E L D R G
L T K X L X Q M K D B C B L O R L D L G D T X L L M L B L T K X L T V
O L I X R T B L K D B L R O T L Y X T L F X X R L M D C G L.
03 65 12 45 58 28 06 41 72 14 22 03 02 17 36 88 25 20 55 77 74 51 23 45 41 42 30 24
36 61 96 09 07 78 05 44 08 06 55 92 16 93 02 15 36 37 40 87 41 01 33 77 06 36 27 54
48 29 16 78 92 66 03 10 38 17 45 23 72 96 73 01 49 25 72 38 92 72 24 55 48 08 40 92
28 01 72 96 02 04 74 61 06 99 30 45 72 69 74 93 77 23 55 36 24 93 47 84 76 35 32 89
87 76 77 64 51 96 58 43 76 02 81 38 87 69 89 55 99 23 79 55 51 06 99 71 74 69 89 84
27 25 22 39 42 53 19 93 41 66 09 75 87 37 91 87 90 91 43 19 40 30 38 16 96 22 69 38
78 02 74 92 47 25 77 91 15 40 24 45 07 07 96 48 44 15 12 06 99 44 93 19 25 23 55 30
45 87 96 18 01 78 44 29 45 86 47 69 48 30 66 44 03 41 66 37 38 22 06 42 41.
59.
Here is one which nobody has ever been able to decrypt:
V Q B U P P V S P G G F P N U E D O K D X H E W T I Y C L K X R Z A P
V U F S A W E M U X G P N I V Q J M N J J N I Z Y K B P N F R R H T B
W W N U Q J A J G J F H A D Q L Q M F L X R G G W U G W V Z G K F B C
M P X K E K Q C Q Q L B O D O Q J V E L.
APPENDIX
ENGLISH FREQUENCY AND SEQUENCE DATA
(Compiled from the MEAKER Digram Chart) Order and Frequency of Order and Frequency of Single Letters Leading DIGRAMS E 1231 L 403 B 162 TH 315 TO 111 SA 75 MA 56 T 959 D 365 G 161 HE 251 NT 110 HI 72 TA 56 A 805 C 320 V 93 AN 172 ED 107 LE 72 CE 55 O 794 U 310 K 52 IN 169 IS 106 SO 71 IC 55 N 719 P 229 Q 20 ER 154 AR 101 AS 67 LL 55 I 718 F 228 X 20 RE 148 OU 96 NO 65 NA 54 S 659 M 225 J 10 ES 145 TE 94 NE 64 RO 54 R 603 W 203 Z 9 ON 145 OF 94 EC 64 OT 53 H 514 Y 188 EA 131 IT 88 IO 63 TT 53 TI 128 HA 84 RT 63 VE 53 AT 124 SE 84 CO 59 NS 51 Group Percentages: ST 121 ET 80 BE 58 UR 49 EN 120 AL 77 DI 57 ME 48 A E I O U 38.58% ND 118 RI 77 LI 57 WH 48 OR 113 NG 75 RA 57 LY 47 L N R S T 33.43% List of Common REVERSALS: J K Q X Z 1.11% ER RE ON NO TE ET ST TS E T A 0 N 45.08% ES SE IN NI OR RO IS SI AN NA EN NE TO OT ED DE E T A O N I S R H 70.02% TI IT AT TA AR RA OF FO Order of the Leading TRIGRAMS In 10,000 Letters of Semi-Military Text - PARKER HITT THE ENT FOR NCE OFT AND ION NDE EDT STH THA TIO HAS TIS MEN INITIAL LETTERS OF WORDS: Order, as found by M. E. OHAVER ... T A O S H I W C B P F D M R, etc. Order, as found by H. O. YARDLEY .. T O A W B C D S F M R H I Y, etc. FINAL LETTERS OF WORDS: Order, as found by M. E. OHAVER ... E S T D N R O Y, etc. Order, as found by H. O. YARDLEY .. E T D N S R Y, etc. NOTE: Lists of terminals (letters, digrams, trigrams); of common affixes, short words, and common pattern-words, can be found in the booklet "CRYPTOGRAM SOLVING", obtainable from the author, M.E.Ohaver, at Columbus, Ohio. X J M M T V O Z B N Q M F B T F S F N J U G P S U I J T B E ? |
COMPARATIVE TABLE OF SINGLE-LETTER FREQUENCIES (Per 100)
ENGLISH GERMAN FRENCH ITALIAN SPANISH PORTUGUESE A 7.81 A 5. A 9.42 A 11.74 A 12.69 A 13.5 B 1.28 B 2.5 B 1.02 B .92 B 1.41 B .5 C 2.93 C 1.5 C 2.64 C 4.50 C 3.93 C 3.5 D 4.11 D 5. D 3.38 D 3.73 D 5.58 D 5. E 13.05 E 18.5 E 15.87 E 11.79 E 13.15 E 13. F 2.88 F 1.5 F .95 F .95 F .46 F 1. G 1.39 G 4. G 1.04 G 1.64 G 1.12 G 1. H 5.85 H 4. H .77 H 1.54 H 1.24 H 1. I 6.77 I 8. I 8.41 I 11.28 I 6.25 I 6. J .23 J ... J .89 J ... J .56 J .5 K .42 K 1. K ... K ... K ... K ... L 3.60 L 3. L 5.34 L 6.51 L 5.94 L 3.5 M 2.62 M 2.5 M 3.24 M 2.51 M 2.65 M 4.5 N 7.28 N 11.5 N 7.15 N 6.88 N 6.95 N 5.5 O 8.21 O 3.5 O 5.14 O 9.83 O 9.49 O 11.5 P 2.15 P .5 P 2.86 P 3.05 P 2.43 P 3. Q .14 Q ... Q 1.06 Q .61 Q 1.16 Q 1.5 R 6.64 R 7. R 6.46 R 6.37 R 6.25 R 7.5 S 6.46 S 7. S 7.90 S 4.98 S 7.60 S 7.5 T 9.02 T 5. T 7.26 T 5.62 T 3.91 T 4.5 U 2.77 U 5. U 6.24 U 3.01 U 4.63 U 4. V 1.00 V 1. V 2.15 V 2.10 V 1.07 V 1.5 W 1.49 W 1.5 W ... W ... W ... W ... X .30 X ... X .30 X ... X .13 X .2 Y 1.51 Y ... Y .24 Y ... Y 1.06 Y ... Z .09 Z 1.5 Z .32 Z .49 Z .35 Z .3 Vowel Percentages: English German French Italian Spanish Portuguese 40% 40% 45% 48% 47% 48% Percentages for L N R S T: 33% 34% 34% 30% 31% 29% NOTES: ENGLISH frequencies, which may be compared with those of Mr. Meaker, (A, 8.05; B, 1.62; C, 3.20; etc.), were taken from M.E.OHAVER. FRENCH, ITALIAN, and SPANISH frequencies were taken from a count made by the author. All four counts are based on 10,000 letters of literary text, and the dropping of the decimal point gives the actual count. The frequencies given for GERMAN and PORTUGUESE are approximations, reduced from other texts, probably military. |
(Based on a Digram Chart by M.E.OHAVER)
All figures indicate PERCENTAGES. - Taking any one letter, as A: On the left, it was contacted
14% of the time by H, 11% by E, etc., and 81% of its total contacts on that side were consonants. On the right, it was contacted 21% of the time by N, and 94% of the time by consonants. | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||