06.01.2015 Views

Cryptology - Unofficial St. Mary's College of California Web Site

Cryptology - Unofficial St. Mary's College of California Web Site

Cryptology - Unofficial St. Mary's College of California Web Site

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

174 CHAPTER 9. DIGRAPHIC CIPHERS<br />

9.3 Recognizing and Breaking Polygraphic Ciphers<br />

What makes a digraphic cipher digraphic It encipher pairs <strong>of</strong> letters, rather<br />

than single letters. For example, the word the will be broken up either as<br />

*t-he or as th-e*. In a monoalphabetic cipher each plaintext letter is always<br />

replaced by the same ciphertext letter. Don’t let the “di” in digraphic fool you –<br />

digraphic ciphers also have this one-to-one replacement. Each pair <strong>of</strong> plaintext<br />

letters is always replaced by the same pair <strong>of</strong> ciphertext letters. So there should<br />

be many copies <strong>of</strong> the ciphertext versions <strong>of</strong> th and he in our ciphertext.<br />

Further, and here is the key point, these repetitions will all occur at even<br />

distances. If there are an odd number <strong>of</strong> letters between two occurrences <strong>of</strong><br />

the, then these two occurrences will be broken differently, and so will not lead<br />

to a repetition. It is only when the is broken the same way that we will get<br />

a repetition, and this only happens when there are an even number <strong>of</strong> letters<br />

in between, that is, when the distance is even. So a cipher that is not monoalphabetic<br />

but has many many digraph repetitions occurring at even distances is<br />

almost certainly a digraphic cipher.<br />

Let’s demonstrate with an example.<br />

Example: Decrypt XCCNQ FUARE LXELM XSBUM WLKVO KTWJU EELXA NZUQJ<br />

KCWFM KSKYN QOEYR QLXFK ELRKQ FYCSK OXELZ GZHQN UPIUM NYNVQ OXBRA VQAAN<br />

IPRYJ YKOQO WXUMK JELOZ YSCII EJLXI MQAGJ FNIKO BJJMH EIURL RKQFR SLXSR<br />

KJKOW FRQLX FKELR KEZ<br />

Of course, this is a Hill cipher. But how might we determine this if we didn’t<br />

know it As always, we start with a frequency count:<br />

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z total<br />

6 3 5 0 12 8 2 2 7 9 16 13 7 8 9 2 12 11 6 1 8 3 5 11 7 5 178<br />

This is not a Caesar cipher. Nor does it really look like a monoalphabetic<br />

cipher, since the rare letters are not that rare. Is it a polyalphabetic cipher<br />

Computing, Φ = 0.04621 suggests a keylength <strong>of</strong> about 4. In particular, this is<br />

not a monoalphabetic cipher.<br />

If it is polyalphabetic, we should be able to find some repetitions. Figure<br />

9.4 contains all repetitions longer than length 2. The repetition RQLXFKELRK is<br />

too long to be ignored: if this is a polyalphabetic cipher the keylength must<br />

divide 112. But if we pick 7, 14, 28 or 56 as the keylength, we must ignore three<br />

nice repetitions (LRKQF, RLFER, and QXKLK), all <strong>of</strong> whose distance is divisible by<br />

22. This seems strange. Further, there are 57 repetitions <strong>of</strong> length 2 (which<br />

we didn’t list). This is a huge number, far more than we’ve ever seen in a<br />

polyalphabetic cipher. And almost all <strong>of</strong> them occur in lengths divisible by 2.<br />

We have ruled out any <strong>of</strong> the ciphers we have ever seen, except for digraphic

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!