Framework in XML
<?xml version="1.0" encoding="UTF-8"?>
<corpus>
<head>
<author></author>
<year></year>
<title></title>
<place></place>
<publisher></publisher>
</head>
<body>
<heading></heading> <-- recursive -->
<para> <-- recursive -->
<s page="">text</s> <-- recursive -->
</para>
</body>
</corpus>
Syntactic Annotation (Parsing)
Elements
nc [noun clause]
adjc [adjective clause]
advc [adverb clause]
np [noun phrase]
adjp [adjective phrase]
advp [adverb phrase]
vp [verb phrase]
prepp [prepositional phrase]
Attributes and Values
role [syntactic role]
subj [subject]
obj [object]
valency [valency]
Morphosyntactic Annotation (Part-of-Speech Tagging)
Elements
n [noun]
art [article]
adj [adjective]
card [cardinal number]
ord [ordinal number]
v [verb]
adv [adverb]
quant [quantifier]
pron [pronoun]
proadj [proadjective]
proadv [proadverb]
prep [preposition]
prep-art [preposition+article]
exist [existential marker]
q [question marker]
neg [negative particle]
comp [complementizer]
rel [relativizer]
conj [conjunction]
interj [interjection]
punct [punctuation]
Lexical Annotation (Lemmatization)
Attribute
lemma
Morphological Annotation (Inflection Tagging)
Attributes and Values
bound [boundness]
pre [prefixal]
suf [suffixal]
person
1 [1st person]
2 [2nd person]
3 [3rd person]
number
sg [singular]
pl [plural]
gender
m [masculine]
f [feminine]
state
c [construct]
tense
past [past]
pres [present]
fut [future]
mood
imp [imperative]
inf [infinitive]
Primary Sources