Conventions for Diplomatic Transcriptions
Our practice is to preserve most of the typographical, orthographical, and compositorial features of the original text. To do this, our transcribers, editors, and encoders follow these conventions:
We preserve the long ſ.
We preserve the capitalization of characters in the source, including the second upper-case letter after a woodblock dropped capital.
We preserve the italicization of words by tagging them with a
We retain the interchangeable u/v and i/j and the use of vv for w.
We retain the vowel digraphs using the appropriate Unicode characters (e.g., æ). We silently expand typographical ligatures (e.g., ﬂ).
We retain the nasal tilde over vowels (e.g., õ) using the appropriate Unicode characters.
|Spacing Within Lines||
We close up extra spaces between words and punctuation marks. However, we retain the spacing in authorial initials, such as A. M. (for Anthony Munday). We add a single space after a comma when the comma has been used to separate two words.
We preserve the line breaks in verse sections. We also preserve the line wrapping in the prose sections of some works in our library, principally the mayoral pageant books. Prose line breaks are encoded with a self-closing
We preserve the hyphenation of words, both within and at the end of lines.
We retain all quotation marks in the text using the appropriate Unicode characters. We do not use the
MoEML Guide to Editorial Stylecalls for curly apostrophes and straight double quotation marks in both transcriptions and born-digital texts.
- For definitions of
Encode a Primary Source Transcription.
Last modification: 2016-06-16 10:37:03 -0700 (Thu, 16 Jun 2016) (mholmes)