Texts for TAG experiments
textual phenomena
Issues that are difficult to model in XML (and that should ideally be present in the example text):
- Overlap
- Discontinuity
- Hierarchy, containment, dominance
- Artifactual hierarchy
- Multiple layers of annotations
Goal of this exercise: to propose queries for text, markup, annotations, or a combination of those components.
possible texts
- Raymond Queneau, Exercises de style
- Herman Melville, Moby Dick
- Virginia Woolf, Time Passes
- Henry Thoreau, Walden
Herman Melville
Particular to Moby Dick:
- lexical variety, vocabulary richness
- first person narrator, using direct address
- textual variation between the first British edition and the first American edition
MEL = a critical archive of the works of Herman Melville
- Contains transcriptions of the texts in TEI/XML and kept in TextLab (build by Nick Laiacona), an online editing environment that uses Juxta Commons to collate, annotate, and publish the editions. You need a login to access TextLab.
At the moment the Billy Budd project is the most developed project in TextLab
Interesting for collation: chapter 1 and chapter 4 of Moby Dick - revisions, variants, expurgations
- Revision narrative to provide editorial explanation of revisions
Contextual information that is potentially interesting for querying:
- Melville's personal library
- Melville's travels (Itinerary Projects) - the use of place names in MOD
- Moby Dick adaptations
- Images and metadata related to Melville's use of art in "Moby Dick"
When using text or information from the MEL project please attribute as follows: "Moby-Dick as Fluid Text," in "Versions of Moby-Dick," The Melville Electronic Library, an NEH-funded scholarly edition (http://mel.hofstra.edu/)."
Experiment LMNL encoding Moby Dick (MD) chapter 15
- textual perspective (sentences (s), paragraphs (p))
- documentary perspective (lines, l)
- quotes and speakers
- direct address
- nantucket dialect