home grc/pos edit page issue tracker

This page still pertains to UD version 1.

NUM: numeral

Definition

In Ancient Greek grammar “numeral” is the PoS reserved for cardinal and ordinal adjectives, as well as adverbs such as ἅπαξ ‘once’. A list for them can be found in Smyth 1920: 102-106.

In accordance with the UD guidelines, only cardinal numbers are tagged as NUM, whether they are adjective or substantivized adjectives. Ordinal numbers are, following the UD guidelines, tagged as adjectives, while adverb numerals receive the PoS ADV.

Examples

References

Smyth, Herbert Weir. 1920. A Greek Grammar for Colleges. New York: American Book Company (Perseus Digital Library; Internet Archive).


Treebank Statistics (UD_Ancient_Greek)

There are 25 NUM lemmas (0%), 43 NUM types (0%) and 226 NUM tokens (0%). Out of 14 observed tags, the rank of NUM is: 10 in number of lemmas, 11 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: δύο, εἴκοσι, τρεῖς, τεσσαράκοντα, πέντε, ἐννέα, ἑκατόν, τριάκοντα, δέκα, πεντήκοντα

The 10 most frequent NUM types: δύω, δύο, εἴκοσι, τεσσαράκοντα, ἐννέα, ἑκατὸν, δέκα, πέντε, πεντήκοντα, τρεῖς

The 10 most frequent ambiguous lemmas: δύο (NUM 53, ADJ 19, NOUN 1), εἴκοσι (NUM 20, ADJ 14), τρεῖς (NUM 16, ADJ 9), τεσσαράκοντα (NUM 14, ADJ 6), πέντε (NUM 12, ADJ 6), ἐννέα (NUM 12, ADJ 6), ἑκατόν (NUM 12, ADJ 9), τριάκοντα (ADJ 16, NUM 11), δέκα (ADJ 15, NUM 10), πεντήκοντα (NUM 10, ADJ 9)

The 10 most frequent ambiguous types: δύω (NUM 22, VERB 3), δύο (NUM 17, ADJ 12), εἴκοσι (ADJ 14, NUM 12), τεσσαράκοντα (NUM 12, ADJ 3), ἐννέα (NUM 12, ADJ 4), ἑκατὸν (NUM 12, ADJ 8, PRON 1), δέκα (ADJ 15, NUM 10), πέντε (NUM 10, ADJ 6), πεντήκοντα (NUM 10, ADJ 9), τρεῖς (NUM 10, ADJ 4)

Morphology

The form / lemma ratio of NUM is 1.720000 (the average of all parts of speech is 3.038201).

The 1st highest number of forms (4) was observed with the lemma “δύο”: δυοῖν, δύ̓, δύο, δύω.

The 2nd highest number of forms (4) was observed with the lemma “εἴκοσι”: εἴκοσί, εἴκοσι, ἐείκοσι, ἐείκοσιν.

The 3rd highest number of forms (4) was observed with the lemma “τρεῖς”: τρία, τρεῖς, τρισὶ, τριῶν.

NUM occurs with 3 features: grc-feat/Gender (5; 2% instances), grc-feat/Number (5; 2% instances), grc-feat/Case (4; 2% instances)

NUM occurs with 5 feature-value pairs: Case=Dat, Case=Nom, Gender=Masc, Gender=Neut, Number=Plur

NUM occurs with 4 feature combinations. The most frequent feature combination is _ (221 tokens). Examples: δύω, δύο, εἴκοσι, τεσσαράκοντα, ἐννέα, ἑκατὸν, δέκα, πέντε, πεντήκοντα, τρεῖς

Relations

NUM nodes are attached to their parents using 9 different relations: grc-dep/nummod (199; 88% instances), grc-dep/conj (11; 5% instances), grc-dep/advcl (3; 1% instances), grc-dep/nsubj (3; 1% instances), grc-dep/obj (3; 1% instances), grc-dep/advmod (2; 1% instances), grc-dep/root (2; 1% instances), grc-dep/xcomp (2; 1% instances), grc-dep/obl (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (181; 80% instances), VERB (13; 6% instances), ADJ (11; 5% instances), NUM (10; 4% instances), PRON (4; 2% instances), DET (3; 1% instances), ROOT (2; 1% instances), ADP (1; 0% instances), X (1; 0% instances)

198 (88%) NUM nodes are leaves.

7 (3%) NUM nodes have one child.

12 (5%) NUM nodes have two children.

9 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 13 different relations: grc-dep/conj (15; 20% instances), grc-dep/cc (14; 19% instances), grc-dep/advmod (12; 16% instances), grc-dep/cop (9; 12% instances), grc-dep/punct (9; 12% instances), grc-dep/nmod (4; 5% instances), grc-dep/nsubj (3; 4% instances), grc-dep/appos (2; 3% instances), grc-dep/case (2; 3% instances), grc-dep/det (2; 3% instances), grc-dep/amod (1; 1% instances), grc-dep/mark (1; 1% instances), grc-dep/nummod (1; 1% instances)

Children of NUM nodes belong to 12 different parts of speech: VERB (14; 19% instances), CCONJ (12; 16% instances), ADJ (11; 15% instances), NUM (10; 13% instances), PUNCT (9; 12% instances), ADV (6; 8% instances), PART (5; 7% instances), ADP (2; 3% instances), DET (2; 3% instances), NOUN (2; 3% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)


Treebank Statistics (UD_Ancient_Greek-PROIEL)

There are 70 NUM lemmas (1%), 164 NUM types (1%) and 1516 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 6 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: εἷς, δύο, τρεῖς, ἑπτά, δώδεκα, τέσσαρες, δέκα, πέντε, ἑκατόν, εἴκοσι

The 10 most frequent NUM types: δύο, εἷς, ἑπτὰ, δώδεκα, δέκα, ἓν, τρεῖς, πέντε, ἕνα, μίαν

The 10 most frequent ambiguous lemmas: εἷς (NUM 388, ADJ 1), διακόσιοι (NUM 22, ADJ 2), καὶ (NUM 15, ADJ 1), τετρακόσιοι (NUM 15, ADJ 2), δέκατος (ADJ 8, NUM 2), τε (CCONJ 1245, ADV 35, NUM 2), μυρίος (ADJ 7, NUM 1), χίλιος (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: εἷς (NUM 99, ADP 2), ἑνὶ (NUM 25, ADJ 1), καὶ (CCONJ 9688, ADV 1119, NUM 15, ADJ 1), τε (CCONJ 1233, ADV 33, NUM 2), τετρακόσιοι (NUM 2, ADJ 1), εἶς (AUX 6, NUM 1), τετρακοσίας (NUM 1, ADJ 1)

Morphology

The form / lemma ratio of NUM is 2.342857 (the average of all parts of speech is 3.387371).

The 1st highest number of forms (16) was observed with the lemma “εἷς”: εἶς, εἷς, μία, μίαν, μιᾶς, μιᾷ, μιῆς, μιῇ, ἐνὶ, ἑνί, ἑνός, ἑνὶ, ἑνὸς, ἓν, ἕν, ἕνα.

The 2nd highest number of forms (12) was observed with the lemma “διακόσιοι”: διακοσίας, διακοσίους, διακοσίων, διακόσιαι, διηκοσίας, διηκοσίων, διηκοσιέων, διηκόσια, διηκόσιαί, διηκόσιαι, διηκόσιοί, διηκόσιοι.

The 3rd highest number of forms (11) was observed with the lemma “τέσσαρες”: τέσσαρα, τέσσαρας, τέσσαρες, τέσσαρσιν, τέσσερα, τέσσερας, τέσσερες, τέσσερσι, τέτορες, τεσσάρων, τεσσέρων.

NUM occurs with 3 features: grc-feat/Case (768; 51% instances), grc-feat/Number (768; 51% instances), grc-feat/Gender (736; 49% instances)

NUM occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing

NUM occurs with 33 feature combinations. The most frequent feature combination is _ (748 tokens). Examples: δύο, ἑπτὰ, δώδεκα, δέκα, πέντε, εἴκοσι, ἑκατὸν, τριήκοντα, τεσσεράκοντα, ὀκτὼ

Relations

NUM nodes are attached to their parents using 18 different relations: grc-dep/nummod (904; 60% instances), grc-dep/nsubj (107; 7% instances), grc-dep/conj (105; 7% instances), grc-dep/obj:dir (61; 4% instances), grc-dep/flat (59; 4% instances), grc-dep/obl (57; 4% instances), grc-dep/orphan (46; 3% instances), grc-dep/root (39; 3% instances), grc-dep/iobj (32; 2% instances), grc-dep/appos (30; 2% instances), grc-dep/nmod (19; 1% instances), grc-dep/nsubj:pass (17; 1% instances), grc-dep/xcomp (15; 1% instances), grc-dep/advcl (13; 1% instances), grc-dep/ccomp (4; 0% instances), grc-dep/obl:agent (4; 0% instances), grc-dep/advmod (3; 0% instances), grc-dep/csubj:pass (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (892; 59% instances), VERB (286; 19% instances), NUM (176; 12% instances), ADJ (66; 4% instances), ROOT (39; 3% instances), PROPN (18; 1% instances), PRON (16; 1% instances), ADV (12; 1% instances), ADP (6; 0% instances), AUX (4; 0% instances), SCONJ (1; 0% instances)

993 (66%) NUM nodes are leaves.

262 (17%) NUM nodes have one child.

151 (10%) NUM nodes have two children.

110 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 11.

Children of NUM nodes are attached using 18 different relations: grc-dep/nmod (162; 16% instances), grc-dep/det (134; 13% instances), grc-dep/cc (130; 13% instances), grc-dep/conj (117; 12% instances), grc-dep/case (76; 7% instances), grc-dep/flat (59; 6% instances), grc-dep/orphan (56; 6% instances), grc-dep/cop (53; 5% instances), grc-dep/nsubj (48; 5% instances), grc-dep/advmod (46; 5% instances), grc-dep/discourse (44; 4% instances), grc-dep/appos (27; 3% instances), grc-dep/acl (18; 2% instances), grc-dep/obl (17; 2% instances), grc-dep/amod (10; 1% instances), grc-dep/mark (9; 1% instances), grc-dep/advcl (7; 1% instances), grc-dep/nummod (2; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: NUM (176; 17% instances), CCONJ (139; 14% instances), NOUN (139; 14% instances), DET (134; 13% instances), ADV (92; 9% instances), ADP (77; 8% instances), ADJ (70; 7% instances), AUX (54; 5% instances), PRON (54; 5% instances), VERB (42; 4% instances), PROPN (29; 3% instances), SCONJ (9; 1% instances)


NUM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]