NUM
: numeral
Definition
In Ancient Greek grammar “numeral” is the PoS reserved for cardinal and ordinal adjectives, as well as adverbs such as ἅπαξ ‘once’. A list for them can be found in Smyth 1920: 102-106.
In accordance with the UD guidelines, only cardinal numbers are tagged as NUM, whether they are adjective or substantivized adjectives. Ordinal numbers are, following the UD guidelines, tagged as adjectives, while adverb numerals receive the PoS ADV.
Examples
- τρεῖς, τρία “three”
- πεντεκαίδεκα “fifteen”
- ὀκτακόσιοι “eighty”
References
Smyth, Herbert Weir. 1920. A Greek Grammar for Colleges. New York: American Book Company (Perseus Digital Library; Internet Archive).
Treebank Statistics (UD_Ancient_Greek)
There are 25 NUM
lemmas (0%), 43 NUM
types (0%) and 226 NUM
tokens (0%).
Out of 14 observed tags, the rank of NUM
is: 10 in number of lemmas, 11 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: δύο, εἴκοσι, τρεῖς, τεσσαράκοντα, πέντε, ἐννέα, ἑκατόν, τριάκοντα, δέκα, πεντήκοντα
The 10 most frequent NUM
types: δύω, δύο, εἴκοσι, τεσσαράκοντα, ἐννέα, ἑκατὸν, δέκα, πέντε, πεντήκοντα, τρεῖς
The 10 most frequent ambiguous lemmas: δύο (NUM 53, ADJ 19, NOUN 1), εἴκοσι (NUM 20, ADJ 14), τρεῖς (NUM 16, ADJ 9), τεσσαράκοντα (NUM 14, ADJ 6), πέντε (NUM 12, ADJ 6), ἐννέα (NUM 12, ADJ 6), ἑκατόν (NUM 12, ADJ 9), τριάκοντα (ADJ 16, NUM 11), δέκα (ADJ 15, NUM 10), πεντήκοντα (NUM 10, ADJ 9)
The 10 most frequent ambiguous types: δύω (NUM 22, VERB 3), δύο (NUM 17, ADJ 12), εἴκοσι (ADJ 14, NUM 12), τεσσαράκοντα (NUM 12, ADJ 3), ἐννέα (NUM 12, ADJ 4), ἑκατὸν (NUM 12, ADJ 8, PRON 1), δέκα (ADJ 15, NUM 10), πέντε (NUM 10, ADJ 6), πεντήκοντα (NUM 10, ADJ 9), τρεῖς (NUM 10, ADJ 4)
- δύω
- δύο
- εἴκοσι
- τεσσαράκοντα
- ἐννέα
- ἑκατὸν
- δέκα
- πέντε
- πεντήκοντα
- τρεῖς
Morphology
The form / lemma ratio of NUM
is 1.720000 (the average of all parts of speech is 3.038201).
The 1st highest number of forms (4) was observed with the lemma “δύο”: δυοῖν, δύ̓, δύο, δύω.
The 2nd highest number of forms (4) was observed with the lemma “εἴκοσι”: εἴκοσί, εἴκοσι, ἐείκοσι, ἐείκοσιν.
The 3rd highest number of forms (4) was observed with the lemma “τρεῖς”: τρία, τρεῖς, τρισὶ, τριῶν.
NUM
occurs with 3 features: grc-feat/Gender (5; 2% instances), grc-feat/Number (5; 2% instances), grc-feat/Case (4; 2% instances)
NUM
occurs with 5 feature-value pairs: Case=Dat
, Case=Nom
, Gender=Masc
, Gender=Neut
, Number=Plur
NUM
occurs with 4 feature combinations.
The most frequent feature combination is _
(221 tokens).
Examples: δύω, δύο, εἴκοσι, τεσσαράκοντα, ἐννέα, ἑκατὸν, δέκα, πέντε, πεντήκοντα, τρεῖς
Relations
NUM
nodes are attached to their parents using 9 different relations: grc-dep/nummod (199; 88% instances), grc-dep/conj (11; 5% instances), grc-dep/advcl (3; 1% instances), grc-dep/nsubj (3; 1% instances), grc-dep/obj (3; 1% instances), grc-dep/advmod (2; 1% instances), grc-dep/root (2; 1% instances), grc-dep/xcomp (2; 1% instances), grc-dep/obl (1; 0% instances)
Parents of NUM
nodes belong to 9 different parts of speech: NOUN (181; 80% instances), VERB (13; 6% instances), ADJ (11; 5% instances), NUM (10; 4% instances), PRON (4; 2% instances), DET (3; 1% instances), ROOT (2; 1% instances), ADP (1; 0% instances), X (1; 0% instances)
198 (88%) NUM
nodes are leaves.
7 (3%) NUM
nodes have one child.
12 (5%) NUM
nodes have two children.
9 (4%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 10.
Children of NUM
nodes are attached using 13 different relations: grc-dep/conj (15; 20% instances), grc-dep/cc (14; 19% instances), grc-dep/advmod (12; 16% instances), grc-dep/cop (9; 12% instances), grc-dep/punct (9; 12% instances), grc-dep/nmod (4; 5% instances), grc-dep/nsubj (3; 4% instances), grc-dep/appos (2; 3% instances), grc-dep/case (2; 3% instances), grc-dep/det (2; 3% instances), grc-dep/amod (1; 1% instances), grc-dep/mark (1; 1% instances), grc-dep/nummod (1; 1% instances)
Children of NUM
nodes belong to 12 different parts of speech: VERB (14; 19% instances), CCONJ (12; 16% instances), ADJ (11; 15% instances), NUM (10; 13% instances), PUNCT (9; 12% instances), ADV (6; 8% instances), PART (5; 7% instances), ADP (2; 3% instances), DET (2; 3% instances), NOUN (2; 3% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)
Treebank Statistics (UD_Ancient_Greek-PROIEL)
There are 70 NUM
lemmas (1%), 164 NUM
types (1%) and 1516 NUM
tokens (1%).
Out of 14 observed tags, the rank of NUM
is: 6 in number of lemmas, 8 in number of types and 12 in number of tokens.
The 10 most frequent NUM
lemmas: εἷς, δύο, τρεῖς, ἑπτά, δώδεκα, τέσσαρες, δέκα, πέντε, ἑκατόν, εἴκοσι
The 10 most frequent NUM
types: δύο, εἷς, ἑπτὰ, δώδεκα, δέκα, ἓν, τρεῖς, πέντε, ἕνα, μίαν
The 10 most frequent ambiguous lemmas: εἷς (NUM 388, ADJ 1), διακόσιοι (NUM 22, ADJ 2), καὶ (NUM 15, ADJ 1), τετρακόσιοι (NUM 15, ADJ 2), δέκατος (ADJ 8, NUM 2), τε (CCONJ 1245, ADV 35, NUM 2), μυρίος (ADJ 7, NUM 1), χίλιος (ADJ 1, NUM 1)
The 10 most frequent ambiguous types: εἷς (NUM 99, ADP 2), ἑνὶ (NUM 25, ADJ 1), καὶ (CCONJ 9688, ADV 1119, NUM 15, ADJ 1), τε (CCONJ 1233, ADV 33, NUM 2), τετρακόσιοι (NUM 2, ADJ 1), εἶς (AUX 6, NUM 1), τετρακοσίας (NUM 1, ADJ 1)
- εἷς
- ἑνὶ
- NUM 25: ἑνὶ δὲ ἑκάστῳ αὐτῶν οὔνομα οὐδὲν κέεται
- ADJ 1: Αἰγινῆταί τε δὴ ἐδηίουν τῆς Ἀττικῆς τὰ παραθαλάσσια καὶ Ἀθηναίοισι ὁρμημένοισι ἐπ’ Αἰγινήτας στρατεύεσθαι ἦλθε μαντήιον ἐκ Δελφῶν ἐπισχόντας ἀπὸ τοῦ Αἰγινητέων ἀδικίου τριήκοντα ἔτεα τῷ ἑνὶ καὶ τριηκοστῷ Αἰακῷ τέμενος ἀποδέξαντας ἄρχεσθαι τοῦ πρὸς Αἰγινήτας πολέμου καί σφι χωρήσειν τὰ βούλονται
- καὶ
- CCONJ 9688: καὶ μετὰ ταῦτα αὐτίκα παρῆν καὶ ἡ γυνή
- ADV 1119: καὶ μετὰ ταῦτα αὐτίκα παρῆν καὶ ἡ γυνή
- NUM 15: οὗτοι οἱ πάντες σταθμοί εἰσι ἕνδεκα καὶ ἑκατόν
- ADJ 1: Αἰγινῆταί τε δὴ ἐδηίουν τῆς Ἀττικῆς τὰ παραθαλάσσια καὶ Ἀθηναίοισι ὁρμημένοισι ἐπ’ Αἰγινήτας στρατεύεσθαι ἦλθε μαντήιον ἐκ Δελφῶν ἐπισχόντας ἀπὸ τοῦ Αἰγινητέων ἀδικίου τριήκοντα ἔτεα τῷ ἑνὶ καὶ τριηκοστῷ Αἰακῷ τέμενος ἀποδέξαντας ἄρχεσθαι τοῦ πρὸς Αἰγινήτας πολέμου καί σφι χωρήσειν τὰ βούλονται
- τε
- τετρακόσιοι
- εἶς
- τετρακοσίας
- NUM 1: ξεῖνόν τέ σε ποιεῦμαι ἐμὸν καὶ τὰς τετρακοσίας μυριάδας τοι τῶν στατήρων ἀποπλήσω παρ’ ἐμεωυτοῦ δοὺς τὰς ἑπτὰ χιλιάδας ἵνα μή τοι ἐπιδεέες ἔωσι αἱ τετρακόσιαι μυριάδες ἑπτὰ χιλιάδων ἀλλὰ ᾖ τοι ἀπαρτιλογίη ὑπ’ ἐμέο πεπληρωμένη
- ADJ 1: ἐπείτε γὰρ τάχιστά σε ἐπυθόμην ἐπὶ θάλασσαν καταβαίνοντα τὴν Ἑλληνίδα βουλόμενός τοι δοῦναι ἐς τὸν πόλεμον χρήματα ἐξεμάνθανον καὶ εὗρον λογιζόμενος ἀργυρίου μὲν δύο χιλιάδας ἐούσας μοι ταλάντων χρυσίου δὲ τετρακοσίας μυριάδας στατήρων Δαρεικῶν ἐπιδεούσας ἑπτὰ χιλιάδων
Morphology
The form / lemma ratio of NUM
is 2.342857 (the average of all parts of speech is 3.387371).
The 1st highest number of forms (16) was observed with the lemma “εἷς”: εἶς, εἷς, μία, μίαν, μιᾶς, μιᾷ, μιῆς, μιῇ, ἐνὶ, ἑνί, ἑνός, ἑνὶ, ἑνὸς, ἓν, ἕν, ἕνα.
The 2nd highest number of forms (12) was observed with the lemma “διακόσιοι”: διακοσίας, διακοσίους, διακοσίων, διακόσιαι, διηκοσίας, διηκοσίων, διηκοσιέων, διηκόσια, διηκόσιαί, διηκόσιαι, διηκόσιοί, διηκόσιοι.
The 3rd highest number of forms (11) was observed with the lemma “τέσσαρες”: τέσσαρα, τέσσαρας, τέσσαρες, τέσσαρσιν, τέσσερα, τέσσερας, τέσσερες, τέσσερσι, τέτορες, τεσσάρων, τεσσέρων.
NUM
occurs with 3 features: grc-feat/Case (768; 51% instances), grc-feat/Number (768; 51% instances), grc-feat/Gender (736; 49% instances)
NUM
occurs with 11 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Gender=Fem
, Gender=Fem,Masc
, Gender=Masc
, Gender=Masc,Neut
, Gender=Neut
, Number=Plur
, Number=Sing
NUM
occurs with 33 feature combinations.
The most frequent feature combination is _
(748 tokens).
Examples: δύο, ἑπτὰ, δώδεκα, δέκα, πέντε, εἴκοσι, ἑκατὸν, τριήκοντα, τεσσεράκοντα, ὀκτὼ
Relations
NUM
nodes are attached to their parents using 18 different relations: grc-dep/nummod (904; 60% instances), grc-dep/nsubj (107; 7% instances), grc-dep/conj (105; 7% instances), grc-dep/obj:dir (61; 4% instances), grc-dep/flat (59; 4% instances), grc-dep/obl (57; 4% instances), grc-dep/orphan (46; 3% instances), grc-dep/root (39; 3% instances), grc-dep/iobj (32; 2% instances), grc-dep/appos (30; 2% instances), grc-dep/nmod (19; 1% instances), grc-dep/nsubj:pass (17; 1% instances), grc-dep/xcomp (15; 1% instances), grc-dep/advcl (13; 1% instances), grc-dep/ccomp (4; 0% instances), grc-dep/obl:agent (4; 0% instances), grc-dep/advmod (3; 0% instances), grc-dep/csubj:pass (1; 0% instances)
Parents of NUM
nodes belong to 11 different parts of speech: NOUN (892; 59% instances), VERB (286; 19% instances), NUM (176; 12% instances), ADJ (66; 4% instances), ROOT (39; 3% instances), PROPN (18; 1% instances), PRON (16; 1% instances), ADV (12; 1% instances), ADP (6; 0% instances), AUX (4; 0% instances), SCONJ (1; 0% instances)
993 (66%) NUM
nodes are leaves.
262 (17%) NUM
nodes have one child.
151 (10%) NUM
nodes have two children.
110 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 11.
Children of NUM
nodes are attached using 18 different relations: grc-dep/nmod (162; 16% instances), grc-dep/det (134; 13% instances), grc-dep/cc (130; 13% instances), grc-dep/conj (117; 12% instances), grc-dep/case (76; 7% instances), grc-dep/flat (59; 6% instances), grc-dep/orphan (56; 6% instances), grc-dep/cop (53; 5% instances), grc-dep/nsubj (48; 5% instances), grc-dep/advmod (46; 5% instances), grc-dep/discourse (44; 4% instances), grc-dep/appos (27; 3% instances), grc-dep/acl (18; 2% instances), grc-dep/obl (17; 2% instances), grc-dep/amod (10; 1% instances), grc-dep/mark (9; 1% instances), grc-dep/advcl (7; 1% instances), grc-dep/nummod (2; 0% instances)
Children of NUM
nodes belong to 12 different parts of speech: NUM (176; 17% instances), CCONJ (139; 14% instances), NOUN (139; 14% instances), DET (134; 13% instances), ADV (92; 9% instances), ADP (77; 8% instances), ADJ (70; 7% instances), AUX (54; 5% instances), PRON (54; 5% instances), VERB (42; 4% instances), PROPN (29; 3% instances), SCONJ (9; 1% instances)
NUM in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]