Prepare an EEBO Transcription
The following document outlines the successive steps, websites, programs, and xml codes used to prepare a diplomatic transcription of a text retrieved from Early English Books Online (EEBO). Additionally, the document provides useful tips, shortcuts, and reminders to help encoders efficiently produce effective and truthful transcriptions.
- Search for the title or STC number
of the source text in EEBO–TCP. From the results list,
select the version which has the
full–textview (see red arrow in the left figure below). Once open, select the
printable versionview (see red arrow in the right figure below). Select all the text (Ctrl+A) and copy (Ctrl+C) it.
- Open a blank Libre Office Writer document and select the
Paste specialfunction (Ctrl+Shift+V) from the
HTML (HyperText Markup Language)in the dialogue box that appears and press
OK.This process should paste the TCP transcription into your document in a way that retains its HTML formatting.
- Record information about which copy you are transcribing. Sometimes more than one copy has been filmed. Every early modern book is unique, which means we need to be clear about the documentary witness that we are transcribing in our diplomatic transcription.
- Save document using MoEML xml:id: for example, METR1.odt for Metropolis Coronata. The encoded file will be named using the same MoEML xml:id: for example, METR1.xml. (Tip! Be sure to check the MoEML website for existing xml:ids before assigning a new one.)
- The next step is to transform special
characters (curly apostrophes, en dash, em dash, long ſ, etc.)
by using the Global Find & Replace
(Ctrl + H) operation.
- Using the Ctrl + H operation, find all straight apostrophes and replace with curly apostrophes (Unicode character: U+2019).
- Find each double hyphen (--) and replace with an em dash (—).
- Find each single hyphen (-) and replace with an en dash (–).
each lower–case short s and replace with lower–case
long ſ (Unicode character: U + 017f).
For this step, it is vital that you tick the
Match casebox before you begin, because this operation is case sensitive! Do NOT find and replace the upper–case letter S!
- Find each long ſ before a space and replace with a short s before a space.
- Find each long ſ before a period and replace with a short s before a period.
- Find each long ſ before a comma and replace with a short s before a comma.
- Find each long ſ before a semi-colon and replace with a short s before a semi-colon.
- Find each long ſ before a colon and replace with a short s before a colon.
- Find each long ſ before a question mark and replace with a short s before a question mark.
- Find each long ſ before an apostrophe and replace with a short s before an apostrophe.
- Find each long ſ before a right parenthesis and replace with a short s before a right parenthesis.
- Find each pipe character (|) before an en dash (–) and replace with a self-closing
<lb>element after an en dash. For example,–<lb/>
- Find each remaining pipe character (|) and replace with a self-closing
<lb>element with a
"hyphenInWord". For example,<lb type="hyphenInWord"/>
- Find all italicized text (leave the
Find whatfield blank but select Format>Font>Font Style>Italic) and replace with blue italicized text (leave the
Replace withfield blank but select Format>Font>Font Style>Italic>Font Color>Light Blue), so that the encoder will be able see at a glance which portions of text should be italicized.
- Use the facsimile view of the EEBO–TCP text as a reference and add in
all page breaks using a self-closing page break tag (
<pb>). This will alert the encoder to the page breaks and remind them to add the forme-works (catch words, signatures, running titles, etc.).
- Proofread the entire prepared text:
- Correct any long ſ errors introduced by your Find & Replace transformation.
- Correct any errors in EEBO–TCP’s transcription.
- Check punctuation carefully by comparing your prepared text to the facsimile image of the EEBO text. (Usually, punctuation is NOT italicized after italicized text. So you should change each piece of punctuation to normal text, unless it is clearly italicized.)
- Infer and supply gaps in the transcription if you can (but don’t make it up if you can’t). If you make an inference, give the encoder the necessary information so they can encode your transcription responsibility in a note. Put the supplied text in [square brackets]. Identify who supplied the missing text. If you supply the missing transcription, provide your name. If you obtained the supplied text from another edition of the text, provide details of the edition and name of the editor. Provide the reason that the text needed to be supplied. (e.g., [i] supplied by Janelle Jenstad because there was a gap in inking).
Last modification: 2016-05-27 14:37:29 -0700 (Fri, 27 May 2016) (tlandels)