A University of Bristol academic has succeeded where countless cryptographers, linguistics scholars and computer programs have failed—by cracking the code of the ‘world’s most mysterious text’, the Voynich manuscript.

Although the purpose and meaning of the manuscript had eluded scholars for over a century, it took Research Associate Dr. Gerard Cheshire two weeks, using a combination of lateral thinking and ingenuity, to identify the language and writing system of the famously inscrutable document.

In his peer-reviewed paper, The Language and Writing System of MS408 (Voynich) Explained, published in the journal Romance Studies, Cheshire describes how he successfully deciphered the manuscript’s codex and, at the same time, revealed the only known example of proto-Romance language.

“I experienced a series of ‘eureka’ moments whilst deciphering the code, followed by a sense of disbelief and excitement when I realised the magnitude of the achievement, both in terms of its linguistic importance and the revelations about the origin and content of the manuscript.

“What it reveals is even more amazing than the myths and fantasies it has generated. For example, the manuscript was compiled by Dominican nuns as a source of reference for Maria of Castile, Queen of Aragon, who happens to have been great aunt to Catherine of Aragon.

Find your dream job in the space industry. Check our Space Job Board »

“It is also no exaggeration to say this work represents one of the most important developments to date in Romance linguistics. The manuscript is written in proto-Romance—ancestral to today’s Romance languages including Portuguese, Spanish, French, Italian, Romanian, Catalan and Galician. The language used was ubiquitous in the Mediterranean during the Medieval period, but it was seldom written in official or important documents because Latin was the language of royalty, church and government. As a result, proto-Romance was lost from the record, until now.”

Cheshire explains in linguistic terms what makes the manuscript so unusual:

“It uses an extinct language. Its alphabet is a combination of unfamiliar and more familiar symbols. It includes no dedicated punctuation marks, although some letters have symbol variants to indicate punctuation or phonetic accents. All of the letters are in lower case and there are no double consonants. It includes diphthong, triphthongs, quadriphthongs and even quintiphthongs for the abbreviation of phonetic components. It also includes some words and abbreviations in Latin.”

The next step is to use this knowledge to translate the entire manuscript and compile a lexicon, which Cheshire acknowledges will take some time and funding, as it comprises more than 200 pages.

“Now the language and writing system have been explained, the pages of the manuscript have been laid open for scholars to explore and reveal, for the first time, its true linguistic and informative content.”


Provided by :University of Bristol

More information: Gerard Cheshire. The Language and Writing System of MS408 (Voynich) ExplainedRomance Studies (2019). DOI: 10.1080/02639904.2019.1599566

Image: This shows two women dealing with five children in a bath. The words describe different temperaments: tozosr (buzzing: too noisy), orla la (on the edge: losing patience), tolora (silly/foolish), noror (cloudy: dull/sad), or aus (golden bird: well behaved), oleios (oiled: slippery). These words survive in Catalan [tozos], Portuguese [orla], Portuguese [tolos], Romanian [noros], Catalan [or aus] and Portuguese [oleio]. The words orla la describe the mood of the woman on the left and may well be the root of the French phrase ‘oh là là’, which has a very similar sentiment.
Credit: Voynich manuscript