Historical Language Models in Cryptanalysis: Case Studies on English and German.

Beáta Megyesi, Justyna Sikora, Filip Fornmark,Michelle Waldispühl,Nils Kopal, Vasily Mikhalev

HistoCrypt(2023)

引用 0|浏览6
暂无评分
摘要
In this paper, we study the impact of language models (LM) on decipherment of historical homophonic substitution ciphers. In particular, we investigate if decipherment by using hill-climbing and simulated annealing can benefit from LMs generated from historical texts in general and century-specific texts in particular. We carry out experiments on homophonic substitution ciphers with English and German as plaintext languages. We take into account ciphertext length as well as n-gram size of the LMs. We compare the results on decipherment based on historical LMs with large LMs generated from modern texts. The results show that using historical LMs in decipherment of homophonic substitution ciphers leads to significantly better performance on ciphertext produced in the 17th century or earlier, and century-specific language models yield better results on longer and older ciphertexts.
更多
查看译文
关键词
cryptanalysis,language,english
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要