CT
Specifications Guide
Positional and Structural Attributes
| POSIT. | word | the token | mangia | ... ... ... |
| lemma | the lemma to which the token has been brought back | mangiare | cf. separate list | |
| pos | the Part of Speech with its Hierarchy-Defining Features (HDF) | v.m.f.ind.pr | cf. FD 1 hereunder | |
| kat | the
Morphosyntactic Features (MSF) codes
the Hierarchy-Collapsed Features (HCF) codes | 3,0,6,0,0 111 | cf. FD 2
hereunder cf. FD 1 hereunder | |
| typ | the structure of a (portion of a) text: {prose; verse; rubrica}. Crossreferenced with type | /P | /P /V /R | |
| genre | the literary genre of a text: {documentary, didactic, historical, narrative, lyric} | nar | doc; did; stor; nar; lir | |
| msform | the unaltered token really appearing in manuscript | magia | ... ... ... | |
| philform | the philological emendation, with the usual diacritics (round & square brackets, italics) | ma(n)gia; ma[n]gia; ma¦n¦gia | cf. list hereunder | |
| STRUCT. | author | the author of a text | Anonimo | cf. CT texts hereunder |
| title | title of a text | Novellino | cf. CT texts hereunder | |
| chapter | chapter number | n | ... ... ... | |
| par | paragraph number | n | ... ... ... | |
| s | sentence number | n | ... ... ... | |
| line | line number (of the page) | n | ... ... ... | |
| page | page number (of the printed edition) | n | ... ... ... | |
| type | crossreference with typ | /P | /P /V /R |
Feature Declaration (FD) 1: POS & HDF
| kat (HCF) | pos (HDF) | |||
| Tagset | noun | 20 | n.c | noun.common |
| 21 | n.p | noun.proper | ||
| adjective | 26 | adj | adjective | |
| pro-det | 30 | pd.dem.s | pro-det.demonstrative.strong | |
| 31 | pd.dem.w | pro-det.demonstrative.weak | ||
| 32 | pd.idf | pro-det.indefinit. | ||
| 33 | pd.pos.s | pro-det.possessive.strong | ||
| 34 | pd.pos.w | pro-det.possessive.weak | ||
| 35 | pd.int | pro-det.interrogative | ||
| 36 | pd.rel | pro-det.relative | ||
| 37 | pd.per.s.no | pro-det.personal.strong.nominative | ||
| 38 | pd.per.s.ob | pro-det.personal.strong.oblique | ||
| 39 | pd.per.w.ob | pro-det.personal.weak.oblique | ||
| 40 | pd.exc | pro-det.exclamative | ||
| adverb | 45 | adv.gn | adverb.general | |
| 46 | adv.pc | adverb.particle | ||
| conjunction | 50 | conj.co | conjunction.coordinative | |
| 51 | conj.sb | conjunction.subordinative | ||
| adposition | 56 | adp.pre | adposition.preposition | |
| 57 | adp.post | adposition.postposition | ||
| article | 60 | art.d | article.determinative | |
| 61 | art.i | article.indeterminative | ||
| numeral | 64 | num.car | numeral.cardinal | |
| 65 | num.ord | numeral.ordinal | ||
| interjection | 68 | intj | interjection | |
| punctuation | 70 | punct.fi | punctuation.final | |
| 71 | punct.nfi | punctuation.non-final | ||
| residuals | 75 | r.frg | residual.foreign | |
| 76 | r.abb | residual.abbreviation | ||
| 77 | r.for | residual.formulae | ||
| 78 | r.epe | residual.epenthesis | ||
| verb (main) | 111 | v.m.f.ind.pr | verb.main.finite.indicative.present | |
| 112 | v.m.f.ind.ipf | verb.main.finite.indicative.imperfect | ||
| 113 | v.m.f.ind.pt | verb.main.finite.indicative.past | ||
| 114 | v.m.f.ind.ft | verb.main.finite.indicative.future | ||
| 115 | v.m.f.sub.pr | verb.main.finite.subjunctive.present | ||
| 116 | v.m.f.sub.ipf | verb.main.finite.subjunctive.imperfect | ||
| 117 | v.m.f.cnd.pr | verb.main.finite.conditional.present | ||
| 118 | v.m.f.imp.pr | verb.main.finite.imperative.present | ||
| 121 | v.m.nf.inf.pr | verb.main.non-finite.infinitive.present | ||
| 122 | v.m.nf.par.pr | verb.main.non-finite.participle.present | ||
| 123 | v.m.nf.par.pt | verb.main.non-finite.participle.past | ||
| 124 | v.m.nf.ger.pr | verb.main.non-finite.gerunde.present | ||
| verb (auxiliar) | 211 | v.a.f.ind.pr | verb.auxiliar.finite.indicative.present | |
| 212 | v.a.f.ind.ipf | verb.auxiliar.finite.indicative.imperfect | ||
| 213 | v.a.f.ind.pt | verb.auxiliar.finite.indicative.past | ||
| 214 | v.a.f.ind.ft | verb.auxiliar.finite.indicative.future | ||
| 215 | v.a.f.sub.pr | verb.auxiliar.finite.subjunctive.present | ||
| 216 | v.a.f.sub.ipf | verb.auxiliar.finite.subjunctive.imperfect | ||
| 217 | v.a.f.cnd.pr | verb.auxiliar.finite.conditional.present | ||
| 218 | v.a.f.imp.pr | verb.auxiliar.finite.imperative.present | ||
| 221 | v.a.nf.inf.pr | verb.auxiliar.non-finite.infinitive.present | ||
| 222 | v.a.nf.par.pr | verb.auxiliar.non-finite.participle.present | ||
| 223 | v.a.nf.par.pt | verb.auxiliar.non-finite.participle.past | ||
| 224 | v.a.nf.ger.pr | verb.auxiliar.non-finite.gerunde.present | ||
| verb (modal) | 311 | v.md.f.ind.pr | verb.modal.finite.indicative.present | |
| 312 | v.md.f.ind.ipf | verb.modal.finite.indicative.imperfect | ||
| 313 | v.md.f.ind.pt | verb.modal.finite.indicative.past | ||
| 314 | v.md.f.ind.ft | verb.modal.finite.indicative.future | ||
| 315 | v.md.f.sub.pr | verb.modal.finite.subjunctive.present | ||
| 316 | v.md.f.sub.ipf | verb.modal.finite.subjunctive.imperfect | ||
| 317 | v.md.f.cnd.pr | verb.modal.finite.conditional.present | ||
| 318 | v.md.f.imp.pr | verb.modal.finite.imperative.present | ||
| 321 | v.md.nf.inf.pr | verb.modal.non-finite.infinitive.present | ||
| 322 | v.md.nf.par.pr | verb.modal.non-finite.participle.present | ||
| 323 | v.md.nf.par.pt | verb.modal.non-finite.participle.past | ||
| 324 | v.md.nf.ind.pr | verb.modal.non-finite.gerunde.present |
Feature Declaration (FD) 2: MSF
| kat (MSF) | ||||
| MSF | person | 1 | pers=1 | position 1 |
| 2 | pers=2 | |||
| 3 | pers=3 | |||
| gender | 4 | gend=masc | position 2 | |
| 5 | gend=fem | |||
| 4;5 | gend=c | |||
| number | 6 | numb=sg | position 3 | |
| 7 | numb=pl | |||
| 6;7 | numb=n | |||
| degree | 8 | degr=pos | position 4 | |
| 9 | degr=comp | |||
| 10 | degr=sup | |||
| multiword | 11-19 | loc=11-19 | position 5 |
CT texts
| author | title | genre | type | |
| CT Texts | MaestroRinuccino | Sonetti | lir | /V |
| BonoGiamboni | LibroViziVirtù | did | /P | |
| BonoGiamboni | TrattatoViziVirtù | did | /P | |
| BrunettoLatini | Favolello | did | /V | |
| BrunettoLatini | Tesoretto | did | /V | |
| BrunettoLatini | Rettorica | did | /P | |
| Anonimi | CapitoliCompagniaSanGilio(Statuti84) | doc | /P | |
| DanteAlighieri | VitaNuova | lir | /P /V | |
| Anonimi | CapitoliCompagniaMadonnaOrsanmichele(Statuti94/97) | doc | /P | |
| ConsiglioDe'Cerchi | Lettera | doc | /P | |
| Consiglio&LapoDe'Cerchi | Lettera | doc | /P | |
| CastraGualfredi&c | LibroDareEdAvere | doc | /P | |
| LapoRiccomanni | LibroDareEdAvere | doc | /P | |
| Anonimo | FioreDiFilosafi | nar | /P | |
| Anonimi | LibroOrdinamentiCompagniaSMariaCarmine(Statuti80) | doc | /P | |
| Anonimo | CronicaFiorentina | stor | /P | |
| Anonimo | VolgarizzamentoDisciplinaClericalis | nar | /P | |
| Anonimo | Novellino | nar | /P /V | |
| GuidoCavalcanti(?) | DueBallate(I'Vidi/SolPerPietà) | lir | /P | |
| GuidoCavalcanti | Rime | lir | /V | |
| JacopoCavalcanti | TreSonetti | lir | /V |
Special conventions
| Symbols | ¬ | logicalnot | compounds | Mei¬di¬donna |
| n\d-phonosyntax | nonn¬ è, foglia¬d è | |||
| ¦ | brokenbar | phylological italics | or¦o¦ | |
| · | periodcenter | proclisis (with assimilation) | de· regno, Be· ll' ho | |
| ÷ | divide | graphoclisis | porta ÷l ÷te ÷ne | |
| ~ | tilde | compendia | ka~ agosto | |
| ^ | caret | ellipsis | ^^^ (corresponding to usual "...") | |
| lacuna | molto [^^^^] francamente | |||
| × | multiply | deperdita | ××osta (unreadable characters) | |
| * | asterisk | vacua | die *** d' aprile (blank in ms.) | |
| Ø | Oslash | zero morphemes | a ÷Ø demonî 'ai demonii' | |
| ©...® | copyright®ister | typographic italics | le credenze del © Credo in Deo ® | |
| (...) | round brackets | solved abbreviation | mante(n)gna | |
| [...] | square brackets | integration | rag[g]io | |
| {...} | brace brackets | graphical symbols | {SN} 'signum notarii' |
Nota Bene
| + | Graphoclitics are treated as individual tokens. They are marked by the divide (ASCII 246 = ANSI 247) "÷" |
| + | Lemmas are introduced by the formula "lemma=" |
| + | Cassationes and expunctiones were not included in the CT texts. |
| + | The template of annotation in CT is "token_lemma=lemma,HDF/HCF,MSF1,MSF2 ,MSF3,MSF4,MSF5" |
Manuel Barbera, 29 August 2000.