WeRead Powered by ReaderPub
Elementary cryptanalysis cover

Elementary cryptanalysis

Chapter 25: CHAPTER XXIII Investigating the Unknown Cipher
Open in WeRead

Explore more books like this:

About This Book

A practical instructional treatment of classical cryptanalysis that concentrates on techniques for analyzing and solving encrypted messages rather than codes or chemical and mechanical devices. It surveys concealment methods, transposition and geometrical systems, irregular and columnar types, single- and polyalphabetic substitution (including Vigenère, Gronsfeld, Beaufort, and Playfair), periodic-cipher analysis such as the Kasiski method, auto-encipherment, and approaches for attacking unknown ciphers. Each topic is presented with explanatory discussion, worked examples, and practice cryptograms, while appendices provide frequency tables, digram/trigram data, common-word lists, charts, and a bibliography to support statistical and linguistic decryptment methods.

CHAPTER XXII
Highlights of Fractional Substitution

Fractional substitution requires a cipher alphabet of the “multifid” type; that is, one in which the symbols are composed of two or more units, as in the Bacon and Trithemius alphabets (Chapter II: Figs. 3 and 4), the various “checkerboards” (Chapter XI), and so on. Polygram “alphabets” are also of this type, and seriation is a forrn of fractional substitution.

Among the older fractionals, we find a system called the “Pollux,” in which the basis was the Morse telegraphic alphabet. There were three units, the dot, the dash, and a separator (made necessary by the irregular lengths of the substitutes). There

                                 Figure 172
Delastelle's "BIFID" Substitution -(Keyword Feature Added by M. E. OHAVER)
  Preparation of Alphabet:         Checkerboard Key:         Substitutes:

       G E N • R A L                     1 2 3 4 5              S = 43
       B C D F H I K                                            E = 15
       M O P Q S T U                 1   G B M V E              N = 24
       V W X Y Z                     2   C O W N D              D = 25
                                     3   P X F Q Y
                                     4   R H S Z A
                                     5   I T L K U
Preliminary Substitution:

  S E N D S U P   P L I E S T O   M O R L E Y S   R I G H T A W   A Y.
  4 1 2 2 4 5 3   3 5 5 1 4 5 2   1 2 4 5 1 3 4   4 5 1 4 5 4 2   4 3
  3 5 4 5 3 5 1   1 3 1 5 3 2 2   3 2 1 3 5 5 3   1 1 1 2 2 5 3   5 5

Re-Substitution:

41 22 45 33 54 53 51   35 51 45 21 31 53 22   12 45 13 43 21 35 53
R  O  A  F  K  L  I    Y  I  A  C  P  L  O    B  A  M  S  C  Y  L

      45 14 54 21 11 22 53   43 55      Transmitted:
       A  V  K  C  G  O  L    S  U.
                                     R O A F K   L I Y I A   C P L O B, etc.

was a first substitution in which the letters of the text were replaced with their Morse symbols, including the space. The resulting cryptogram, composed entirely of the units dot, dash, space (. — x), was then subjected to a second substitution, using a small cipher alphabet (either digits or letters) in which each one of the three units might have any one of several different substitutes, chosen at will. For instance, a dot might be replaced with any one of digits 1, 8, 5, 6, a space with any one of digits 3, 9, 0, and a dash with any one of digits 2, 4, 7.

We find also a number of systems called “Collon” in which the basis is some one of the “checkerboards.” The text is subjected to a simple substitution in the agreed alphabet, and the resulting cryptogram is then subjected to a transposition, usually seriation, this being the final operation.

A similar system called the “Mirabeau” uses an alphabet of the same type as that of the Polybius square, in which only the digits 1-2-3-4-5 are significant. The remaining digits are all null, and numbers like 67 or 88 may be inserted at will. Numbers are written vertically (tens below units); then, in the taking off of the cryptograms, the whole series of units is taken first, and the second half of the cryptogram includes all of the tens-digits. In all of these forms, the undesirable features are self-evident. The later devices have added another operation: the regrouping of the scattered units, and their reconversion into letters.

Classic examples are those described by Delastelle as “bifid” and “trifid” (terms, incidentally, which some of our own writers find objectionable, as they do also the term “multifid”). Delastelle’s “bifid” cipher was of the kind shown in Fig. 172. A two-unit alphabet must be used, and all possible two-unit combinations must be convertible into letters. Any desired seriation-length may be agreed upon,

                                 Figure 173
        A Fractional Substitution Based on Morse Symbols - M.E.OHAVER
                    The Alphabet, Arranged by Group-Lengths:

                 E  .       S  ...         H  ....        B  -...
                 T  -       U  ..-         V  ...-        X  -..-
                            R  .-.         F  ..-.        C  -.-.
                            W  .--       ü    ..--        Y  -.--
                 I  ..      D  -..         L  .-..        Z  --..
                 A  .-      K  -.-       ä    .-.-        Q  --.-
                 N -.       G  --.         P  .--.      ö    ---.
                 M --       O  ---         J  .---      ch   ----

S    E    N    D    S    U    P
...  .    -.   -..  ...  ..-  .--.          Reverse digits, and re-group:
3    1    2    3    3    3    4             4    3    3    3    2    1    3
                                           ....  -.-  ...  ...  .-   .   --.
                                            H    K    S    S    A    E    G

though it should not be divisible by 2. In the figure, the key-word GENERAL, 7 letters, governs the seriation-length as well as the mixing of the key-square, a feature suggested by Ohaver. The substitution is identical with that of the Polybius square, except that the two units of the substitute are written vertically below the original. Digits are then grouped horizontally in pairs, treating one seven-letter group at a time (if the seriation index is 7), and these pairs are replaced with letters from the same key-square. It will be noticed that we have here a form of polygram substitution, in which one seven-letter group has been replaced with another. Also, that possible errors have been confined by the seriation feature to their own seven-letter group.

Delastelle’s “trifid” cipher was of the same kind, except that a three-unit alphabet was required, resulting in three rows of units. It would have been the same as that of Fig. 4, Chapter II, but with the French accented E replacing the character &. All combinations of three units must be re-convertible into letters.

Fig. 173 shows a form of “mutilation” cipher once published by Ohaver. Beyond stating that its only key is the group-length (7 in the example), we leave the student to figure it out for himself.

As an example of recent use (1918), we are told on excellent authority that the Germans, for quite a long time during the World War, used a field cipher of the following description: There was a preliminary substitution using a key-square of the Nihilist type, except that the external co-ordinates were letters, and not digits, and were chosen in such a way that the five or six letters used were letters having very distinctive Morse symbols; this was for the avoidance of telegraphic errors. In some cases a 5 x 5 square was used, containing only a 25-letter mixed alphabet, and in others a 6 x 6 square containing a 26-letter mixed alphabet and all of the digits. The preliminary cryptogram obtained from this first encipherment was then written into a transposition block and taken off by columns, using key-word columnar transposition. The cryptograms were not afterward shortened by resubstitution, but were always twice as long as their messages, and never contained any other letters than the five (or six) originally used as co-ordinates. This German Field Cipher proved very effective until finally broken by the great French analytical genius, Georges Painvin.

We shall make no attempt, here, to go into the decryptment of these ciphers. The Delastelle “bifid” is, perhaps, a practical cipher, and the student may try his own hand at analyzing the example. The other examples should give no trouble.

154.  By PICCOLA.  (Delastelle's "Bifid." - Repeated words:  AMERICA(N), ATTEMPT,                   REPORT, THAT, THE, OF, TO.  Other short words:  FROM, WITH,                   BEEN, HAVE.  Likely words:  REPORT, AGENT, CONFIRM, CABLE, etc).
Q I N H P   R M L M G   R N B M A   H G T O L   O O E L O   A O D R I   N H W R O
A A B M M   I M M W I   B M D A B   T H D I L   T H T H I   N T L A Q   M C A M F
I V N K Y   N O F H B   I I T R F   Q L A D K   V Q I N H   P R M R B   H S L L U
A B M E T   S O A A B   M M I M M   I B P I V   R Q F T K   H I R D F   G N I E M
A B E N I   L M M P A   S I F I O   P L Y C C   R C I T W   I V W M F   G I O O S
O E R O I   K Q I E F   O V N V M   Q T D R S   I O E R I   B U Q C D   O A L L A
P L A A O   O C A Q O   M E I D C   N T I U L   O L Z D G.

The mixed alphabet here was placed in the square by straight horizontals. History:
Message intercepted following a report that on the tenth of August an attenpt had
been made to enter the American embassy in a country where Royalists are opposed to
a group of radicals.

155.  By PICCOLA.  (Fractional. - Not so hard).

3 3 3 2 3   1 1 1 2 3   2 2 1 3 1   1 1 1 3 1   3 3 1 1 3   2 2 1 2 2   1 1 2 3 1
2 3 3 2 1   2 3 3 1 1   3 2 1 1 2   1 2 2 2 3   1 2 2 2 3   1 1 2 2 1   2 3 2 3 2
2 1 2 3 1   3 3 2 3 1   1 2 2 1 3   2 1 2 2 3   2 1 3 1 2   2 2 2 3 2   3 2 2 2 2
3 1 1 1 3   1 2 3 2 1   1 2 2 2 3   2 3 1 3 2   2 2 2 1 2   3 1 2 2 1   2 1 2 2 1
1 2 2 3 2   3 2 2 3 2   2 2 3 2 2   3 3 1 2 2   3 1 2 1 3   1 1 1 1 2   1 3 3 3 3
1 2 3 3 3   2 1 3 3 1   1 1 1 2 2   3 1 1 3 1   1 1 1 1 1   1 1 3 2 2   1 2 3 2 2
2 1 2 1 2   1 2 2 2 3   3 2 2 1 3.

156.  By PICCOLA.  (Fractional. - Nor is this very hard).

E D C Y B   A Z C B Z   A V W X C   X B A E Y   D C B V A   E D W B X   A E Y Z D
A E Z V W   D C A E D   X C B Y D   Y Z V C B   W B A Z V   E W X B X   A E Y D C
B V A E W   D C X A E   Y Z D C E   Z V D W C   B E D X C   B Y A Z D   C B V W A
A E D C B   A E E W D   C B X Y D   C Y Z B V   A B A Z V   E W X A E   W D C X Y
E D Y Z C   B V E D C   W B A X E   D Z V C B   A C B V W   A X Y X B   Y A E D Z
E Y Z D V   W C W E D   X C B A D   Y Z V C B   A D C W B   A X E E D   C B A V E
D C X Y B   Z A E D C   B A E W D   Y Z C B V   A B A Z V   E W X E D   V W C X Y
X D C Y B   A Z C Z B   A V E W B   A E W D X   Y E D X Y   C Z V V E   D W C B A.

CHAPTER XXIII
Investigating the Unknown Cipher

When the type of encipherment is unknown, the decryptor’s first problem may concern the probable language used in the plaintext, and this he is usually able to determine from the source and history of the cryptogram.

His second problem is the major classification, and this, too, is usually simple, since transposition, as a rule, can be recognized by its appearance. It must, however, respond to a group-test, and for cases in which this is needed, the approximate percentages for English can be taken as follows:

Vowels, with or without Y,  about 40%  (Variation limits: 35% to 45%)
Consonants L N R S Tabout 30%(Variation limits: 25% to 35%)
Consonants J K Q X Zabout 2%(May be influenced by nulls).

The 5% variation is suggested in the Parker Hitt Manual. In this connection, it should be pointed out that an apparent transposition with exactly 40% of vowels and 100% evenness in their distribution is suspicious. Many of the checkerboard systems result in this way, and also some of the codes based on pronounceable five-letter groups. Then, too, it is easily possible to construct a simple substitution cipher alphabet in such a way that the resulting cryptograms will resemble transposition, and even respond satisfactorily to a group-test. It should be carefully ascertained that a supposed transposition cryptogram does not contain the many repeated sequences which belong to simple substitution. As to those transpositions which do show an appreciable number of repeated digrams, they will probably have undergone one of the route transpositions, especially one in which columns were taken off in alternating directions.

 

Concerning the characteristics of simple substitution, these have been seen throughout the text; we have normal frequencies attached to the wrong letters, and we have those numerous repetitions of various lengths, occurring at all kinds of intervals, which are never found in a transposition. Here, too, we may apply a group-test, based only on the relative frequencies of letters. The five most frequent are supposed to represent the letters E T A O N or their equivalents, and should total about 45% of the text. The nine most frequent should total about 70%; the eleven most frequent well over 75%; the five of lowest frequency (which would include all of those totally absent) should correspond to the normal behavior of the group J K Q X Z.

 

If the simple substitution frequency count is present without the repeated sequences, then we probably have a combination of simple substitution with transposition. It becomes necessary to rewrite the cryptogram into various new arrangements until one is found which will bring back the repeated sequences. Ordinarily, the simplest kinds of transposition will have been used; sometimes the transposition will have taken place in a complete-unit block, and there will be a clue in the total number of letters present in the cryptogram.

 

When all letters are present in the frequency count (or all but one or two in the possible cases of 25-letter and 24-letter alphabets), a period-investigation is usually indicated. The case of periodics has been seen at considerable length, though a final hint might be added for the detection of a possible Porta encipherment. One of our many collaborators, F. R. Carter, suggests that any Porta cryptogram, periodic or otherwise, ought to show from 52% to 53% of letters N to Z — the opposite of normal.

The characteristics of digram-encipherment have been mentioned. Other polygram ciphers show corresponding characteristics, according to the polygram length, though the trail grows fainter as polygrams grow longer. A trigram-system, for instance, might be present when the cryptogram is evenly divisible into three-letter groups; it might suggest period 3, and might even show repeated sequences whose length is a multiple of 3 and which begin at serial positions such as 1, 4, 7, 10, which are the beginnings of trigrams. A great many of the trigram systems will show only repeated digrams beginning at these serial positions, or separated by intervals which are divisible by 3.

A 5 x 5 square is often suggested in the fact of a missing letter; but the fact of 26 letters does not deny one, since the careful encipherer may make use of his missing J instead of using I exclusively. Great evenness in frequencies may suggest one of the key-lengthening devices, such as autokey and progressing key; and the practical absence of repeated sequences will usually mean that a transposition has been added to a substitution. It is never a bad idea, in a puzzling example, to make the various digram-counts (in chart form): An actual digram count, in which every letter is considered the beginning of a digram; a pair-count on separated pairs, as in Playfair; the two counts which could be made with the cryptogram marked off into three-letter groups; and the kind of pair-count which could be made in Playfair if the first cryptogram-letter were omitted. Many devices, as mentioned, may be uncovered simply by “running down the alphabet.” And if the cryptogram has come from an amateur “inventor,” it may be a case of digging into one’s memory for previous “inventions.” With this last case, however, the “inventor” very often fails to submit material in proportion to the amount of complication he has introduced.

 

Of the examples to follow, there is none in which the system may not be learned through analysis, unless perhaps the final unnumbered cryptogram, and the material, in every case, should be suffcient for solution.

No. 163 follows Mr. Berkley’s encipherment plan, illustrated just above it.

No. 164 is said to have been taken from a German spy serving in the American army in France. This applies, however, to the first fifty groups only; the remainder was added to increase the length and to emphasize the plan followed by the spy.

No. 166 was accompanied by a plot:

“Supposed to have been found on the body of a man floating in San Diego Bay. Autopsy shows death by drowning. Victim was a local banker who had disappeared a few days earlier. Wife says no financial worries. No money missing. Banker had prospered during depression. Was yachting enthusiast. Our hero solved the cipher with the unconscious assistance of a radio crooner. Tragedy occurred in August, 1932.” The date was doubly underscored, but those who have read the message have found no reason for this and no explanation for the “crooner.”

157.  By PICCOLA.

C S R Z V   Y P Q Z J   K H K V Q   U U C V M   R T W Z N   G H Q S A   K O X P M
H D R W A   J D F Q D   F S R Z Z   C G X P A   J J T Z U   L H T G S   A H X J J
L T R N N   Z P B Z G   R E B N F   Y G E J N   M T N J J   Q H P J X   M O B J A
L X I A I   C P F J O   O F R N H.

158.  By PICCOLA.

O C E E A   T T I T K   S N D T D   S T H O O   Y E A O E   E P E B O   T Y T A O
A D S E O   E T F T T   T H R V W   C T H O Y   L T O O H   L R B T T   U H R R V
R A W O B   R U A O Y   E H H L A   B N E R L   R K V C R   I O N S E   I D R U E
R I P.

159.  By TITOGI.

A H Y N U   H C E S T   I T N D O   R F E H R   W E A T F   N R F P A   O T M A T
L H R E I   O T N R L   O D R H E   E A T E S   C T D I N   W T S T O   E A T S I
T E C D U   T M S O T   R L D O N   G N I I S   O F A E T   L I T A S.

160.  By PICCOLA. (Veiled reference to crypt No. 166?)
(a)  H Z M Q L  D N N D Z  S P R F S  K L L L L.       (b)  L I L V M  S T Z U G
D H Z U Q  X L L L L.       (c)  T V I U M  F R U O Y  U Q Y P S  F W X L L.
(d)  L I L V M  F P O E Y  Z K F D V  U E L L L.       (e)  G K P V D  Z T A Y T
B F Y Y C  F I U L L.       (f)  Q B F P W  Y C L U D  V P Z Z O  S W Y N C.
(g)  Q B R T F  F G V T U  E N S Z H  B Q E R L.

161.  By PICCOLA.

E G W G W   G E G T U   C L C U O   X G K Z T   E G O B G   B Y L W M   I Q N K Q
Y E N F S   C L H M N   Y B X S E   T N I W O   C E G C B   F C T C S   Z T V G B
E A E G T   U R K F K   B E G K X   B C T G Z   Y L X C H   Y E G C U   O X Y T Q
F A D Q T   T C U N B   O G C O H   X C E W E   C U V E G   C O C X Y   X G B E A
Y T K X F   Q C O T B   X N E G T   U C O N T   O P E L E   K U V U N   T O C N G
N G B K W   C E E C S   Z K W N H   E I K C C   R E G C T   E G T U R   K F K T B
R G M W X   C F G Q N   I C E B P   E E W E N   B K I Y F   K F D O F   E G N U C
B G M T Z   T F X C E   W E C F V   D T T U Z   T E N E G   W G L F M   C T O V L.

162.  By PICCOLA.

S P P A S   T A S E F   U N M T E   H S O O A   E S L E I   C T R C H   V U G S E
L Y R E M   E N E E R   O S N E H   I R A E T   O R N S H   M O D R O   P E A O R
P O S R Y   P D O I N   O C K G T.

         The "NICODEMUS" Cipher (Harold Berkley) SPECIMEN ENCIPHERMENT
Key:  M E T H O D I C     M E T H O D I C     M E T...
      6 3 8 4 7 2 5 1     6 3 8 4 7 2 5 1     6 3 8...
                                                                     Cryptogram:
      T H I S I S E N     A N S P O S E D     E T C...
      C I P H E R E D     U S I N G T H E                  PFGVT   VUHDG   LMRIV
      I V V I G E N E     S A M E K E Y F          ZOPUH   MMVNB   FOUDQ   WSURF
      R E A N D A F T     O R B O T H O P          BIOTP   FGHRU   VWHKR   RWEVV
      E R W A R D T R     E R A T I O N S  WULVA   MPGWV   MGEAQ   CUYHW   LBFUT.


163.  By PICCOLA.

T Y D Q V   W P A Z O   M B W B I   R K F I O   O G W C O   G E F L T   Q M S R F
X T C J C   M A W P P   Q M E X V   O Q C O C   Z F S F W   V F E V E   R S A B E
C V J J W   S I P P H   M M K O X   V Y I D B   D B C I S   Y N L J C   Y F K C W
E N Z E I   T J V L Z   M I L I I   R W K R O   O S Z A W   E K J V J   G F M Q K
G F N C K   H P B R D   L V I A P   E S L V M   D J Z Z V   F Z F F R   D B A D P
Q W E N L   A L O E K   M F M F W   X O K D W   D G C K K   K C Q R V.

164.  By VULPUS.

P E N A R   C P F T I   Q E V A T   E N B L A   T K Q F O   A R E N E   U I P E P
F U K X I   L C N F Q   E P C V B   T A W A O   B N C O E   T I N D W   B N A R D
Q F O F N   B V C P E   P G V G P   A V A P B   P F O A O   B S C L B   V B T F W
A N E W B   T C S D N   F M A N A   O E V A R   A R C T K   Q E N B M   B Q F V E
V B X K O   A P E T B   U I P F O   F Q E L E   O B R D R   B Q F U A   W A S C U
K L F P E   W B O C O   D N A M E   L G V F V   A N C N D   M F N B V   D T D L E
P F V I T   I Q E Q F   O C O A U   C L F L A   O B M E P   E N A S D   L B T K L
H N E P D.    .....     U I L A L   B O B M A   V K M G U   K R F P F   U B U D M
F W E T A   T I Q E V   B R C M B   W A N F Z   I L E N A   Q F W B T   C R D T B
T K O E P   E U A V A   O F N B S   C Z K V B   W C U B O   A L F O B   M E X I T
D Q C Q D   W A P F Q   E N A L A.

165.  By PICCOLA.  (Again that No. 166?)

R O V L L   A B T L D   L B C Q M   P X L B A   F B T C T   A T C O R   L T O L C
R H P D T   X L Y O A   E L B X P   H L X B T   X X Q L D   R G L T K   X R L G D
B K L D P   P L O H L   Y O A E L   K O M X B   L H O E L   V C R R C   R J L T K
D T L R C   I N X P L   L L T K X   L R C I N   X P L V D   B L V O R   L P O R J
L D J O L   F Y L I O   P O R X P   L M D E N   X E L K C   T T L V K   O L O H H
X E X G L   T O L I O   Q M E O Q   C B X L H   O E L T V   O L I X R   T B L B C
R I X L K   X L V D B   L D F P X   L T O L B   X R G L T   K X L B O   P A T C O
R L F Y L   E X T A E   R L Q D C   P L L B T   C P P L C   T L V O A   P G L F X
L V O E T   K L D R O   T K X E L   R C I N X   P L T O L   H C R G L   O A T L T
K X L N X   Y L L T K   C B L Q A   B T L F X   L T K X L   X W M P D   R D T C O
R L O H L   T K X L E   X H X E X   R I X L T   O L D L I   E O O R X   E L D R G
L T K X L   X Q M K D   B C B L O   R L D L G   D T X L L   M L B L T   K X L T V
O L I X R   T B L K D   B L R O T   L Y X T L   F X X R L   M D C G L.

166.  By CACHE. (Contributor, C. H. Price, died without explaining his key).
03 65 12 45 58 28 06 41 72 14 22 03 02 17 36 88 25 20 55 77 74 51 23 45 41 42 30 24
36 61 96 09 07 78 05 44 08 06 55 92 16 93 02 15 36 37 40 87 41 01 33 77 06 36 27 54
48 29 16 78 92 66 03 10 38 17 45 23 72 96 73 01 49 25 72 38 92 72 24 55 48 08 40 92
28 01 72 96 02 04 74 61 06 99 30 45 72 69 74 93 77 23 55 36 24 93 47 84 76 35 32 89
87 76 77 64 51 96 58 43 76 02 81 38 87 69 89 55 99 23 79 55 51 06 99 71 74 69 89 84
27 25 22 39 42 53 19 93 41 66 09 75 87 37 91 87 90 91 43 19 40 30 38 16 96 22 69 38
78 02 74 92 47 25 77 91 15 40 24 45 07 07 96 48 44 15 12 06 99 44 93 19 25 23 55 30
45 87 96 18 01 78 44 29 45 86 47 69 48 30 66 44 03 41 66 37 38 22 06 42 41.
                                                                             59.

Here is one which nobody has ever been able to decrypt:

V Q B U P   P V S P G   G F P N U   E D O K D   X H E W T   I Y C L K   X R Z A P
V U F S A   W E M U X   G P N I V   Q J M N J   J N I Z Y   K B P N F   R R H T B
W W N U Q   J A J G J   F H A D Q   L Q M F L   X R G G W   U G W V Z   G K F B C
M P X K E   K Q C Q Q   L B O D O   Q J V E L.

APPENDIX

                   ENGLISH FREQUENCY AND SEQUENCE DATA
                 (Compiled from the MEAKER Digram Chart)
Order and Frequency of                              Order and Frequency of    Single Letters                                     Leading DIGRAMS
E 1231   L  403   B  162                TH  315    TO  111    SA  75    MA  56
T  959   D  365   G  161                HE  251    NT  110    HI  72    TA  56
A  805   C  320   V   93                AN  172    ED  107    LE  72    CE  55
O  794   U  310   K   52                IN  169    IS  106    SO  71    IC  55
N  719   P  229   Q   20                ER  154    AR  101    AS  67    LL  55
I  718   F  228   X   20                RE  148    OU   96    NO  65    NA  54
S  659   M  225   J   10                ES  145    TE   94    NE  64    RO  54
R  603   W  203   Z    9                ON  145    OF   94    EC  64    OT  53
H  514   Y  188                         EA  131    IT   88    IO  63    TT  53
                                        TI  128    HA   84    RT  63    VE  53
                                        AT  124    SE   84    CO  59    NS  51
   Group Percentages:                   ST  121    ET   80    BE  58    UR  49
                                        EN  120    AL   77    DI  57    ME  48
A E I O U           38.58%              ND  118    RI   77    LI  57    WH  48
                                        OR  113    NG   75    RA  57    LY  47
L N R S T           33.43%
                                             List of Common REVERSALS:
J K Q X Z            1.11%
                                           ER RE   ON NO   TE ET   ST TS
E T A 0 N           45.08%                 ES SE   IN NI   OR RO   IS SI
                                           AN NA   EN NE   TO OT   ED DE
E T A O N I S R H   70.02%                 TI IT   AT TA   AR RA   OF FO

                    Order of the Leading TRIGRAMS          In 10,000 Letters of Semi-Military Text - PARKER HITT
                THE      ENT      FOR      NCE      OFT
                AND      ION      NDE      EDT      STH
                THA      TIO      HAS      TIS      MEN

INITIAL LETTERS OF WORDS:

        Order, as found by M. E. OHAVER ... T A O   S H I W C   B P F D M R, etc.

        Order, as found by H. O. YARDLEY .. T O A   W B C D S   F M R H I Y, etc.

FINAL LETTERS OF WORDS:

        Order, as found by M. E. OHAVER ... E  S  T  D  N  R  O  Y, etc.

        Order, as found by H. O. YARDLEY .. E  T  D  N  S  R  Y, etc.

NOTE:  Lists of terminals (letters, digrams, trigrams); of common affixes,
       short words, and common pattern-words, can be found in the booklet
       "CRYPTOGRAM SOLVING", obtainable from the author, M.E.Ohaver, at
       Columbus, Ohio.

X J M M T   V O Z B N   Q M F B T   F S F N J   U G P S U   I J T B E  ?

             COMPARATIVE TABLE OF SINGLE-LETTER FREQUENCIES  (Per 100)
ENGLISH     GERMAN      FRENCH      ITALIAN     SPANISH     PORTUGUESE
A  7.81     A   5.      A  9.42     A 11.74     A 12.69     A  13.5
B  1.28     B   2.5     B  1.02     B   .92     B  1.41     B    .5
C  2.93     C   1.5     C  2.64     C  4.50     C  3.93     C   3.5
D  4.11     D   5.      D  3.38     D  3.73     D  5.58     D   5.
E 13.05     E  18.5     E 15.87     E 11.79     E 13.15     E  13.
F  2.88     F   1.5     F   .95     F   .95     F   .46     F   1.
G  1.39     G   4.      G  1.04     G  1.64     G  1.12     G   1.
H  5.85     H   4.      H   .77     H  1.54     H  1.24     H   1.
I  6.77     I   8.      I  8.41     I 11.28     I  6.25     I   6.
J   .23     J     ...   J   .89     J    ...    J   .56     J    .5
K   .42     K   1.      K    ...    K    ...    K    ...    K     ...
L  3.60     L   3.      L  5.34     L  6.51     L  5.94     L   3.5
M  2.62     M   2.5     M  3.24     M  2.51     M  2.65     M   4.5
N  7.28     N  11.5     N  7.15     N  6.88     N  6.95     N   5.5
O  8.21     O   3.5     O  5.14     O  9.83     O  9.49     O  11.5
P  2.15     P    .5     P  2.86     P  3.05     P  2.43     P   3.
Q   .14     Q     ...   Q  1.06     Q   .61     Q  1.16     Q   1.5
R  6.64     R   7.      R  6.46     R  6.37     R  6.25     R   7.5
S  6.46     S   7.      S  7.90     S  4.98     S  7.60     S   7.5
T  9.02     T   5.      T  7.26     T  5.62     T  3.91     T   4.5
U  2.77     U   5.      U  6.24     U  3.01     U  4.63     U   4.
V  1.00     V   1.      V  2.15     V  2.10     V  1.07     V   1.5
W  1.49     W   1.5     W    ...    W    ...    W    ...    W     ...
X   .30     X     ...   X   .30     X    ...    X   .13     X    .2
Y  1.51     Y     ...   Y   .24     Y    ...    Y  1.06     Y     ...
Z   .09     Z   1.5     Z   .32     Z   .49     Z   .35     Z    .3


                          Vowel Percentages:

English     German      French      Italian     Spanish     Portuguese
   40%          40%        45%         48%         47%          48%

                      Percentages for L N R S T:

   33%          34%        34%         30%         31%          29%


NOTESENGLISH frequencies, which may be compared with those of Mr. Meaker,
        (A, 8.05;  B, 1.62;  C, 3.20; etc.), were taken from M.E.OHAVER.
        FRENCH, ITALIAN, and SPANISH frequencies were taken from a count
        made by the author. All four counts are based on 10,000 letters
        of literary text, and the dropping of the decimal point gives
        the actual count. The frequencies given for GERMAN and PORTUGUESE        are approximations, reduced from other texts, probably military.

 

Chart Showing Normal CONTACT PERCENTAGES - Compiled by F. R. CARTER

(Based on a Digram Chart by M.E.OHAVER)
%%
V.C.V.C.
1981P4 L4 C5 D5 M5 N6 S6 W7 T8 R8 E11H14   A   N21T17S12R10L8 D5 C4 M4 694
55 45       Y4 B4 N5 T5 U8 D9 O9 S10A16E16   B   E34L17U11O9 A7 Y5 B4 R4 70 30
61 39                   U4 O5 S8 N13A13I18E20   C   H19O19E17A13I7 T6 R4 L4 K4 59 41
52 48                R4 I5 L6 A10N29E39   D   E16I14T14A10O8 S6 U5 54 46
8 92          C4 B4 E5 M5 V5 D5 S5 L5 N6 T6 R11H24   E   R15D10S9 N8 A7 T6 M5 E4 C4 O4 W4 21 79
69 31             S4 N5 F5 D5 A6 I7 E12O41   F   T22O21E10I9 A7 R5 F5 U4 52 48
36 64                   O4 D4 U5 R5 I9 E9 A10N48   G   E14H14O12R10A8 T6 F5 W4 I4 S4 42 58
7 93       G4 E5 W5 S7 C9 T62   H   E50A23I12O7 90 10
13 87          F4 M4 W5 E6 N6 L8 D8 S8 R9 H11T14   I   N25T13S10O8 C7 R4 E4 M4 A4 L4 17 83
28 72 Y7 W7 T7 S7 N7 E7 C7 B7 A14M29   J   U35O29A12E12M6 W6 88 12
53 47    Y5 U5 I5 N7 A11R13E13O15C18   K   E34I21N10A9 T7 S6 68 32
52 48       N4 P4 T6 I7 B7 U7 O10E11L11A17   L   E19I15Y12L12O9 A8 D7 U4 65 35
69 31       S4 D4 M5 R5 I12A13O16E24   M   E26A17O12I11P5 M5 71 29
89 11                         U7 E14O22A23I24   N   D16T14G12E10A7 S7 O7 I6 C5 32 68
21 79 M4 O4 D4 L4 P4 H5 N6 E6 C7 F7 S8 I8 R9 T11   O   N20F14R11U10T6 M5 L5 S4 W4 O4 18 82
47 53       R4 L4 T4 N4 I4 P6 M6 A7 O8 U10E16S17   P   O17E16A15R15L8 U6 P6 T5 I5 S4 59 41
20 80 O10N10L10E10D10R20S30   Q   U100    100 --
70 30             P5 I5 U5 T7 A13O16E30   R   E23O12A11T11I10S7 Y4 61 39
48 52          D4 T4 O6 U6 R7 N8 S9 I11A16E18   S   T19E11O10I9 S9 A8 H6 P5 U4 41 59
43 57 U4 O5 D6 T6 F7 R7 E8 I10N10S13A14   T   H39I10O10E8 A7 T6 R4 38 62
35 65             P5 F5 T5 L5 B6 D8 S9 O30   U   N18S13T13R12L10P7 B4 C4 8 92
88 12       R6 U10O16A16I16E30   V   E65I14O9 A8 99 1
48 52       G4 D4 Y5 N9 S10T11O16E23   W   A27H16I16E15O11N4 80 20
95 5                         U4 N5 I16E74   X   P29T19I14A14U10C5 K5 O5 38 62
24 76                         B4 N8 A8 T13E14R15L25   Y   A15O12S12T9 W7 H5 I5 E5 D4 M4 B4 38 62
88 12           O12N12A25I50   Z   E43I43W14 86 14
    All figures indicate PERCENTAGES. - Taking any one letter, as A:  On the left, it was contacted
    14% of the time by H, 11% by E, etc., and 81% of its total contacts on that side were consonants.
    On the right, it was contacted 21% of the time by N, and 94%  of the time by consonants.