NOUN
: noun
Description
Nouns are a part of speech typically denoting a person, place, thing, animal or idea.
The NOUN
tag is intended for common nouns only. See PROPN for proper nouns and PRON for pronouns.
Common nouns
Irish nouns are either masculine or feminine, and inflect for case and number. In Old Irish, nouns inflected for nominative, accusative and dative cases. All of these are now represented in Modern Irish by what’s referred to as the `common’ case. An exception to this is pronoun forms, where the subject and object forms differ: sé “he”/ é “him”.
The genitive case and vocative case are marked by inflection.
Each noun falls into one of five declensions.
Verbal nouns
Verbal nouns are marked as NOUN
in the UD scheme. Verbal noun forms are used widely in Irish for the infinitive form (a dhéanamh “to do”) and progressive aspectual phrases (ag déanamh “doing/ making”).
Examples
Common nouns
- fear “man”
- hata an fhir “the man’s hat”
- a mhúinteoir “teacher!”
Verbal nouns
- ag déanamh iarracht “making an attempt”
- beidh orthu gnó a dhéanamh leis “they will have to do business with him”
- scannán a dhéanamh “to make a movie”
- tá obair le déanamh “there is work to be done”
Treebank Statistics (UD_Irish)
There are 1590 NOUN
lemmas (55%), 2254 NOUN
types (56%) and 3730 NOUN
tokens (27%).
Out of 16 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: duine, cur, bliain, cuid, alt, rud, bheith, ceann, déanamh, lá
The 10 most frequent NOUN
types: bheith, chur, rud, duine, dhéanamh, daoine, leith, chuid, fáil, lá
The 10 most frequent ambiguous lemmas: bheith (NOUN 28, ADV 1), ceann (NOUN 27, ADP 6), áit (NOUN 20, SCONJ 1), dóigh (NOUN 11, VERB 1), aghaidh (NOUN 9, ADP 5), ceart (NOUN 9, ADJ 7), measc (NOUN 9, VERB 2), mac (NOUN 7, PART 4), meas (VERB 6, NOUN 6), mí (NOUN 6, PROPN 2)
The 10 most frequent ambiguous types: bheith (NOUN 28, ADV 1), áit (NOUN 13, SCONJ 1), ceann (NOUN 13, ADP 6), aghaidh (NOUN 7, ADP 5), ceart (NOUN 5, ADJ 2), go (PART 142, ADP 42, SCONJ 12, NOUN 5, ADV 2, ADJ 1), leor (NOUN 5, ADV 2, ADJ 1), nós (NOUN 5, ADP 4), réir (NOUN 5, ADP 5, SCONJ 2), cionn (NOUN 4, ADP 3)
- bheith
- NOUN 28: ’ Ach caithfidh mise a bheith ar shiúl chun an tsiopa .
- ADV 1: Tá sé ráite ag tráchtairí áirithe , agus go minic acu siúd ar mhian leo caitheamh anuas ar Chonradh na Gaeilge agus díspeagadh a dhéanamh ar ghluaiseacht an náisiúnachais chultúrtha i gcoitinne in Éirinn , gur gluaiseacht neo-thraidisiúnta thar a bheith coimeádach amach is amach a bhí ann .
- áit
- ceann
- aghaidh
- ceart
- go
- PART 142: Níor chúis iontais í go hiomlán gur ghlaoigh sé ar Melissa .
- ADP 42: Agus as go brách leo .
- SCONJ 12: Bhí ráfla ann go mb’ fhéidir go dtiocfadh Pat An Cóp Ó Gallchóir ar ais ón Eoraip le polaitíocht na Dála a imirt arís .
- NOUN 5: Bhí scaifte maith sa tábhairne seo anois , an ceann amháin i mBaile an Chaolais ach ar ndóigh go leor óstáin .
- ADV 2: Tá an éide scoile s’ againne maith go leor anseo i Scoil Naomh Pól .
- ADJ 1: Ar ndóigh bhí an tír suaite go leor i rith an am seo go léir .
- leor
- nós
- réir
- NOUN 5: (2) Más rud é , áfach , go ndéanfar an t-aistriú gan chomaoin luachmhar , beidh sé , chomh fada is a bhaineann leis an aistrí agus le daoine a bheidh ag éileamh faoi ar dhóigh seachas ar chomaoin luachmhar , faoi réir gach cirt neamhchláraithe a raibh an talamh a aistreofar ar teachtadh faoina réir ag an aistreoir .
- ADP 5: Éileofar air nó uirthi , áfach , taisteal chuig an Ionad de réir mar is gá .
- SCONJ 2: (4) D’ fhonn amhras a sheachaint , aon achtachán a ndéantar leasú air le halt den Acht seo a scoireann de bheith i ngníomh amhail ar an agus ón lá dá dtagraítear i bhfo-alt (1) nó , de réir mar a bheidh , amhail ar agus ó dháta éagtha na tréimhse ar lena linn a choimeádfar i ngníomh é faoi fho-alt (2) ( ‘ an t-éag ‘ ) , beidh feidhm aige agus beidh éifeacht leis amhail ar an agus ón lá sin nó , de réir mar a bheidh , amhail ar an agus ón éag , mar a bhí feidhm aige agus éifeacht leis díreach roimh dháta an Achta seo a rith ach sin faoi réir aon leasuithe a dhéanfar le haon Acht eile den Oireachtas tar éis an dáta rite sin .
- cionn
Morphology
The form / lemma ratio of NOUN
is 1.417610 (the average of all parts of speech is 1.393750).
The 1st highest number of forms (7) was observed with the lemma “alt”: ailt, alt, bhfo-alt, fho-ailt, fho-alt, fo-alt, halt.
The 2nd highest number of forms (7) was observed with the lemma “bliain”: bhliain, bliain, bliana, blianta, mblian, mbliana, mblianta.
The 3rd highest number of forms (7) was observed with the lemma “ceann”: ceann, cheann, chinn, chionn, cinn, cionn, gcionn.
NOUN
occurs with 11 features: ga-feat/Number (3270; 88% instances), ga-feat/Case (3187; 85% instances), ga-feat/Gender (3173; 85% instances), ga-feat/Form (926; 25% instances), ga-feat/Definite (601; 16% instances), ga-feat/VerbForm (442; 12% instances), ga-feat/NounType (118; 3% instances), ga-feat/PartType (14; 0% instances), ga-feat/Foreign (3; 0% instances), ga-feat/Degree (1; 0% instances), ga-feat/Reflex (1; 0% instances)
NOUN
occurs with 22 feature-value pairs: Case=Dat
, Case=Gen
, Case=NomAcc
, Case=Voc
, Definite=Def
, Definite=Ind
, Degree=Pos
, Foreign=Yes
, Form=Ecl
, Form=Emp
, Form=HPref
, Form=Len
, Gender=Fem
, Gender=Masc
, NounType=Strong
, NounType=Weak
, Number=Plur
, Number=Sing
, PartType=Comp
, Reflex=Yes
, VerbForm=Inf
, VerbForm=Vnoun
NOUN
occurs with 75 feature combinations.
The most frequent feature combination is Case=NomAcc|Gender=Masc|Number=Sing
(738 tokens).
Examples: rud, duine, ceann, lá, lucht, measc, alt, am, deireadh, fad
Relations
NOUN
nodes are attached to their parents using 23 different relations: ga-dep/compound (667; 18% instances), ga-dep/nmod (645; 17% instances), ga-dep/obl (591; 16% instances), ga-dep/nsubj (488; 13% instances), ga-dep/obj (338; 9% instances), ga-dep/xcomp (320; 9% instances), ga-dep/conj (251; 7% instances), ga-dep/xcomp:pred (151; 4% instances), ga-dep/root (89; 2% instances), ga-dep/appos (41; 1% instances), ga-dep/ccomp (30; 1% instances), ga-dep/advmod (26; 1% instances), ga-dep/obl:tmod (20; 1% instances), ga-dep/mark:prt (18; 0% instances), ga-dep/advcl (15; 0% instances), ga-dep/acl:relcl (8; 0% instances), ga-dep/fixed (8; 0% instances), ga-dep/mark (8; 0% instances), ga-dep/nummod (4; 0% instances), ga-dep/vocative (4; 0% instances), ga-dep/amod (3; 0% instances), ga-dep/csubj:cop (3; 0% instances), ga-dep/parataxis (2; 0% instances)
Parents of NOUN
nodes belong to 16 different parts of speech: NOUN (1716; 46% instances), VERB (1525; 41% instances), ADJ (150; 4% instances), ROOT (89; 2% instances), ADP (58; 2% instances), PRON (47; 1% instances), PROPN (45; 1% instances), ADV (30; 1% instances), X (21; 1% instances), NUM (16; 0% instances), SCONJ (10; 0% instances), AUX (9; 0% instances), CCONJ (7; 0% instances), PART (4; 0% instances), PUNCT (2; 0% instances), DET (1; 0% instances)
508 (14%) NOUN
nodes are leaves.
1124 (30%) NOUN
nodes have one child.
1189 (32%) NOUN
nodes have two children.
909 (24%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 18.
Children of NOUN
nodes are attached using 34 different relations: ga-dep/case (1551; 23% instances), ga-dep/det (1084; 16% instances), ga-dep/compound (783; 11% instances), ga-dep/nmod (663; 10% instances), ga-dep/punct (392; 6% instances), ga-dep/amod (367; 5% instances), ga-dep/acl:relcl (246; 4% instances), ga-dep/conj (234; 3% instances), ga-dep/cc (221; 3% instances), ga-dep/mark (208; 3% instances), ga-dep/obj (160; 2% instances), ga-dep/nummod (153; 2% instances), ga-dep/nmod:poss (117; 2% instances), ga-dep/advmod (104; 2% instances), ga-dep/xcomp (92; 1% instances), ga-dep/obl:prep (82; 1% instances), ga-dep/cop (76; 1% instances), ga-dep/nsubj (58; 1% instances), ga-dep/ccomp (48; 1% instances), ga-dep/appos (43; 1% instances), ga-dep/xcomp:pred (25; 0% instances), ga-dep/advcl (20; 0% instances), ga-dep/csubj:cleft (20; 0% instances), ga-dep/csubj:cop (12; 0% instances), ga-dep/mark:prt (12; 0% instances), ga-dep/fixed (8; 0% instances), ga-dep/flat:name (8; 0% instances), ga-dep/case:voc (6; 0% instances), ga-dep/list (6; 0% instances), ga-dep/obl:tmod (6; 0% instances), ga-dep/vocative (4; 0% instances), ga-dep/compound:prt (2; 0% instances), ga-dep/flat (2; 0% instances), ga-dep/parataxis (2; 0% instances)
Children of NOUN
nodes belong to 16 different parts of speech: NOUN (1716; 25% instances), ADP (1671; 25% instances), DET (1173; 17% instances), ADJ (419; 6% instances), PUNCT (394; 6% instances), VERB (324; 5% instances), CCONJ (244; 4% instances), PROPN (188; 3% instances), PART (180; 3% instances), NUM (133; 2% instances), PRON (106; 2% instances), X (87; 1% instances), AUX (77; 1% instances), ADV (69; 1% instances), SCONJ (33; 0% instances), SYM (1; 0% instances)
NOUN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]