Cryptology - Unofficial St. Mary's College of California Web Site
Cryptology - Unofficial St. Mary's College of California Web Site
Cryptology - Unofficial St. Mary's College of California Web Site
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Chapter 6<br />
Decrypting Monoalphabetic<br />
Ciphers<br />
We begin with the unsolved cipher from Section 5.5:<br />
KNHHXKK QS PXTDQSB YQFJ NSISCYS HQEJXUK QK LXTKNUXP AO FJXKX RCNU<br />
FJQSBK QS FJX CUPXU STLXP EXUKXVXUTSHX HTUXRND LXFJCPK CR TSTDOKQK<br />
QSFNQFQCS DNHI FJX TAQDQFO TF DXTKF FC UXTP FJX DTSBNTBX CR FJX<br />
CUQBQSTD QK VXUO PXKQUTADX ANF SCF XKKXSFQTD<br />
The ciphertext has as its frequency chart<br />
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z<br />
4 5 10 10 2 17 0 6 2 9 17 3 0 9 4 7 18 4 16 16 12 2 0 27 2 0<br />
We have previously determined that this is not a Caesar cipher (as none <strong>of</strong><br />
the usual hill and valley patterns fit) so we are treating it as a monoalphabetic<br />
cipher <strong>of</strong> unknown type. The most common letters are C, D, F, J, K, Q, S,<br />
T and X, so these are probably the substitutes for etaoinshr. But which is<br />
which and how can we decide<br />
Just as ciphers with word spacing are generally easier to decrypt than those<br />
without, long ciphers are generally easier to decrypt than short ones. And for<br />
a simple reason: the longer the cipher is, the more accurate the information<br />
given by the frequency count is likely to be. If we are given a 100, 000 letter<br />
message enciphered with a monoalphabetic substitution (without other trick!),<br />
we can be quite confident that exactly one <strong>of</strong> the ciphertext letters will occur<br />
almost 13% <strong>of</strong> the time, and this letter stands for e, that the next most common<br />
letter will appear very near 9% <strong>of</strong> the time and will stand for t, etc. Very little<br />
thought will be needed to decrypt the cipher. It is the medium length ciphers <strong>of</strong><br />
25 to 100 letters, usually without proper word division, that give the beginning<br />
codebreaker some difficulty. We will concentrate on those in this chapter.<br />
89