OVERVIEW OF TABLE DIFFERENCES: Version 3.0 vs. Version 3.1 A. Entries per table (counting lines that do not begin with ";") table_name v3_0 v3_1 unchanged ----------------------------------------- dictPrefixes 1296 1328 1113 dictSuffixes 945 945 945 dictStems 79216 79318 78870 tableAB 2445 2497 2439 tableAC 1152 1180 1152 tableBC 1632 1632 1632 B. Distinct lemmas in dictStems (counting lines beginning with ";; ") v3_0 v3_1 unchanged 40561 40654 40539 C., D., E.: NO CHANGES IN POS TAG INVENTORY F. Open-class stems built from dictStems (ADJ*,ADV,CV,IV,PV,NOUN*): pos v3_0 v3_1 --------------------- ADJ* 10360 10416 ADV 84 84 CV 96 101 IV 13479 13482 PV 17350 17339 IV_PASS 2812 2824 PV_PASS 381 423 NOUN* 42778 42767 --------------------- total: 87340 87436 Explanation: ADJ* and NOUN* refer to all ADJ and NOUN tags, including subcategories such as ADJ_COMP, ADJ_NUM, NOUN_PROP, NOUN_QUANT, etc. These numbers are based on the Berkeley DB file "dictStems.db" built from the original dictStems text table (using sama-dbm2txt to extract the DB file contents). The values above represent the number of distinct combinations of lookup stem, diacritized stem, POS tag, gloss and lemma_id, looking only at stems having the POS tags listed. The decrease in ADV entries is due to changing the POS labels of some stems from ADV to something else.