CZECH ACADEMIC CORPUS 1.0 GUIDE

B. Description of lemmas

In the CAC 2.0, lemma has a form of string lemma_:P1_;P2_,P3_^(K) where lemma is the lemma proper and P1, P2, P3, K stand for the optional additional info; lemma has a form of string LemmaProper-[0-9]* where the optional string “-[0-9]*” helps to distinguish several senses of a homonymous base form.

Table B.1. Additional information of the lemmas

Labelling Separator Description Notes
P1 : morpho-syntactic flag part of speech or its detailed specification
P2 ; semantic flag common semantic clasification
P3 , style flag stylistical classification
K ^ comment explanatory note, derivational comments, other comments

Table B.2. Morpho-syntactic flags of the lemmas

Value Description
B abbreviation
T imperfect verb
W perfect verb

Table B.3. Semantic flags of the lemmas

Value Description
E member of a particular nation, inhabitant of a particular territory
G geographical name
H chemistry
K company, organization, institution
L natural sciences
R product
S surname (family name)
U medicine
Y given name
b economy, finances
c computers and electronics
g technology in general
j justice
m other proper name
o color indication
p politcs, government, military
u culture, education, arts, other sciences
w sports
y hobby, leisure, travelling
z ecology, environment

Table B.4. Style flags of the lemmas

Value Description
a archaic
e expressive
h colloquial
l slang, argot
n dialect
s bookish
t foreign word
v vulgar
x outdated spellimg or misspeling

Table B.5. Examples of lemmas

Lemma Additional info Description
Abchaz (Abkhazian) _;E member of a particular nation
Agned _;Y_,t given name|foreign word
dobromysl (oregano) _;L natural sciences
dementi _,t foreign word
FFUK (Faculty of Arts, Charles University) _:B_;K_; u-^(Filozof._fakulta_Univerzity_Karlovy) abbreviation|institution|culture, education|abbreviation description
líně (lazy) _^(*1ý) derivation: remove one character from the end (i.e. “ě”), add character “ý”: “líný”