Frequency analysis is the bedrock of decryption. All you do is count how many times each letter appears and compare that letter distribution with that of the target language. If the message has simply been reaaranged but not enciphered then the frequency distribution of the message will be the same as that of a standard passage of English.
If the message has been encoded with symbols or letter groups then a similar process can be applied.
Frequency analysis can also be done, quite effectively, with a pencil and paper.
Standard English and Transposition Ciphers. | ||
Alphabetical Order
| ||
Rank Order
| ||
3000 Letter Magazine Article.
| ||
A transposition cipher takes the plaintext and jumbles it up. The frequency distribution of the letters remains the same as that of standard English. Note that the frequency distribution for the 3000 letter text is not exact. In fact several letters do not appear in the same place in the order. To get anything approaching the ideal frequency distribution you need huge numbers of letters. |
|
|
Mono-Alphabetic Substitution Ciphers.This is the 3000 letter text enciphered using the codeword wrong with a mono-alphabetic substitution cipher.
Note that whilst the peaks don't have exactly the same heights the basic shape of the graph remains the same. The values tail down smoothly with a step in the R/D region. | ||
2 Letter ViginereThis is the 3000 letter text enciphered using a 2 letter Viginere cipher.
This is a very different graph to tthat for a nono-alphabetic cipher. The high scoring letters such as ETAION are depressed and low scoring letters are pushed higher. The graph has been flattened out but still tails from left to right. | ||
3 Letter ViginereThis is the 3000 letter text enciphered using a 3 letter Viginere cipher.
The distribution is now even flatter. | ||
5 Letter ViginereThis is the 3000 letter text enciphered using a 5 letter Viginere cipher.
The distribution is now even flatter. Most of the peaks are between 4.5% and 2.5%. Note that even the lowest scoring letters now appear. | ||
26 Letter ViginereThis is the 3000 letter text enciphered using a 26 letter Viginere cipher. The alphabet was used as the keyword.
The distribution is now almost flat across all 26 letters of the alphabet. | ||
Playfair CipherThis is the 3000 letter text enciphered using a Playfair cipher.
The distribution is similar to that of a 3 or 5 letter Viginere but note that J is missing from the ciphertext. Playfair or double Playfair ciphers always tend to have one letter of the alphabet missing completely. | ||
Digraph Ciphers11000 letters. Punctuation and spaces stripped out. Text put into pairs and counted with first letter across and second letter down. AE occurs twice. EA occurs 54 times.
| ||
last updated 20th November 2007