Framework in XML

<?xml version="1.0" encoding="UTF-8"?>
<corpus>
 <head>
  <author></author>
  <year></year>
  <title></title>
  <place></place>
  <publisher></publisher>
 </head>
 <body>
  <heading></heading> <-- recursive -->
  <para> <-- recursive -->
   <s page="">text</s> <-- recursive -->
  </para>
 </body>
</corpus>

Syntactic Annotation (Parsing)

Elements

nc [noun clause]
adjc [adjective clause]
advc [adverb clause]
np [noun phrase]
adjp [adjective phrase]
advp [adverb phrase]
vp [verb phrase]
prepp [prepositional phrase]

Attributes and Values

role [syntactic role]
  subj [subject]
  obj [object]
valency [valency]

Morphosyntactic Annotation (Part-of-Speech Tagging)

Elements

n [noun]
art [article]
adj [adjective]
card [cardinal number]
ord [ordinal number]
v [verb]
adv [adverb]
quant [quantifier]
pron [pronoun]
proadj [proadjective]
proadv [proadverb]
prep [preposition]
prep-art [preposition+article]
exist [existential marker]
q [question marker]
neg [negative particle]
comp [complementizer]
rel [relativizer]
conj [conjunction]
interj [interjection]
punct [punctuation]

Lexical Annotation (Lemmatization)

Attribute

lemma

Morphological Annotation (Inflection Tagging)

Attributes and Values

bound [boundness]
  pre [prefixal]
  suf [suffixal]
person
  1 [1st person]
  2 [2nd person]
  3 [3rd person]
number
  sg [singular]
  pl [plural]
gender
  m [masculine]
  f [feminine]
state
  c [construct]
tense
  past [past]
  pres [present]
  fut [future]
mood
  imp [imperative]
  inf [infinitive]

RELAX NG Schema for the Annotated Corpus (Version 2003.08.15)

Primary Sources