home sv/pos edit page issue tracker

This page still pertains to UD version 1.

PROPN: proper noun

Definition

A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object.

In Swedish proper nouns differ from common nouns in inflecting only for case, not for definiteness or number.

Examples


Treebank Statistics (UD_Swedish)

There are 458 PROPN lemmas (5%), 490 PROPN types (4%) and 1118 PROPN tokens (1%). Out of 16 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: Sverige, EEC, Stockholm, Gud, Göteborg, Horn, USA, Danmark, Malmö, Kristus

The 10 most frequent PROPN types: Sverige, EEC, Stockholm, Sveriges, EEC:s, Göteborg, Horn, Gud, USA, Danmark

The 10 most frequent ambiguous lemmas: Björn (PROPN 6, NOUN 1), de (PRON 381, DET 71, PROPN 2), vi (PRON 324, DET 111, PROPN 2, NOUN 1), väckarklocka (NOUN 2, PROPN 2), Children (NOUN 1, PROPN 1), SKB (PROPN 1, NOUN 1), backa (VERB 1, PROPN 1), hand (NOUN 41, PROPN 1), väst (NOUN 12, PROPN 1), ännu (ADV 38, PROPN 1)

The 10 most frequent ambiguous types: Björn (PROPN 6, NOUN 1), Vi (PRON 61, PROPN 2), de (DET 396, PRON 227, PROPN 2), Children (PROPN 1, NOUN 1), Handelsbanken (PROPN 1, NOUN 1), I (ADP 209, NOUN 1, ADJ 1, NUM 1, PROPN 1), SKB (NOUN 1, PROPN 1), Ännu (ADV 4, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.069869 (the average of all parts of speech is 1.407742).

The 1st highest number of forms (3) was observed with the lemma “Göteborg”: Göteborg, Göteborgs, Göteborgs-.

The 2nd highest number of forms (2) was observed with the lemma “Bales”: Bales, Bales’.

The 3rd highest number of forms (2) was observed with the lemma “Belgien”: Belgien, Belgiens.

PROPN occurs with 1 features: sv-feat/Case (1095; 98% instances)

PROPN occurs with 2 feature-value pairs: Case=Gen, Case=Nom

PROPN occurs with 3 feature combinations. The most frequent feature combination is Case=Nom (960 tokens). Examples: Sverige, EEC, Stockholm, Göteborg, Horn, Gud, USA, Danmark, Malmö, Kristus

Relations

PROPN nodes are attached to their parents using 22 different relations: sv-dep/conj (223; 20% instances), sv-dep/nsubj (186; 17% instances), sv-dep/obl (173; 15% instances), sv-dep/nmod (153; 14% instances), sv-dep/flat:name (130; 12% instances), sv-dep/nmod:poss (119; 11% instances), sv-dep/appos (35; 3% instances), sv-dep/root (30; 3% instances), sv-dep/obj (23; 2% instances), sv-dep/obl:agent (16; 1% instances), sv-dep/nsubj:pass (5; 0% instances), sv-dep/parataxis (5; 0% instances), sv-dep/advcl (4; 0% instances), sv-dep/acl (3; 0% instances), sv-dep/dislocated (3; 0% instances), sv-dep/iobj (2; 0% instances), sv-dep/orphan (2; 0% instances), sv-dep/xcomp (2; 0% instances), sv-dep/amod (1; 0% instances), sv-dep/case (1; 0% instances), sv-dep/compound (1; 0% instances), sv-dep/csubj:pass (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: PROPN (358; 32% instances), VERB (344; 31% instances), NOUN (330; 30% instances), ADJ (39; 3% instances), ROOT (30; 3% instances), ADV (10; 1% instances), ADP (3; 0% instances), PRON (2; 0% instances), NUM (1; 0% instances), PUNCT (1; 0% instances)

364 (33%) PROPN nodes are leaves.

471 (42%) PROPN nodes have one child.

137 (12%) PROPN nodes have two children.

146 (13%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 33.

Children of PROPN nodes are attached using 22 different relations: sv-dep/case (319; 23% instances), sv-dep/punct (234; 17% instances), sv-dep/conj (219; 16% instances), sv-dep/nmod (170; 12% instances), sv-dep/flat:name (131; 10% instances), sv-dep/cc (100; 7% instances), sv-dep/advmod (33; 2% instances), sv-dep/amod (27; 2% instances), sv-dep/nummod (23; 2% instances), sv-dep/obl (21; 2% instances), sv-dep/appos (20; 1% instances), sv-dep/acl:relcl (19; 1% instances), sv-dep/mark (11; 1% instances), sv-dep/orphan (10; 1% instances), sv-dep/det (9; 1% instances), sv-dep/parataxis (6; 0% instances), sv-dep/acl (3; 0% instances), sv-dep/cop (2; 0% instances), sv-dep/fixed (2; 0% instances), sv-dep/nsubj (2; 0% instances), sv-dep/xcomp (2; 0% instances), sv-dep/nmod:poss (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: PROPN (358; 26% instances), ADP (321; 24% instances), PUNCT (234; 17% instances), NOUN (196; 14% instances), CCONJ (100; 7% instances), NUM (39; 3% instances), ADV (38; 3% instances), ADJ (32; 2% instances), VERB (19; 1% instances), SYM (10; 1% instances), DET (9; 1% instances), PRON (4; 0% instances), AUX (2; 0% instances), SCONJ (2; 0% instances)


Treebank Statistics (UD_Swedish-LinES)

There are 1 PROPN lemmas (6%), 622 PROPN types (5%) and 2158 PROPN tokens (3%). Out of 17 observed tags, the rank of PROPN is: 12 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: _

The 10 most frequent PROPN types: Harry, Quinn, Stillman, Access, Bray, Auster, Microsoft, Ron, Weasley, Mweta

The 10 most frequent ambiguous lemmas: _ (NOUN 11462, VERB 8134, ADP 7148, PUNCT 6980, PRON 6631, ADV 4925, ADJ 4444, DET 3508, AUX 2803, CCONJ 2463, SCONJ 2183, PROPN 2158, PART 1442, NUM 339, INTJ 143, X 15, SYM 9)

The 10 most frequent ambiguous types: Le (PROPN 3, DET 2), SA (PROPN 3, X 1), Web (NOUN 6, PROPN 3), van (ADJ 4, PROPN 2), C (NOUN 2, PROPN 1), Importera (VERB 4, PROPN 1), Park (NOUN 1, PROPN 1), Style (PROPN 1, NOUN 1), Via (ADP 1, PROPN 1), Visual (ADJ 2, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 622.000000 (the average of all parts of speech is 693.647059).

The 1st highest number of forms (622) was observed with the lemma “_”: .xml, .xsd, ADOX, ANSI, ANSI-89, ANSI-92, Aaron, Absaloms, Access, Adam, Adamson, Afrika, Airways, Albanien, Alex, Alexandra, Allen, Amerika, Amerikas, Amin, Amins, Amsterdam, Andrej, Andrew, Anna, Anställda, Antal, Anwar, Apollo, Arafat, Arafats, Armenien, Arthur, Asahe, Asahes, Athen, Auerbach, Auschwitz, Auster, Austers, Australien, Autofilter, Avenue, BBC, BRAY, Balkan, Barry, Bartlebys, Barzanti, Bashi, Basic, Bassam, Bastiljen, Bayley, Bayleys, Belzecs, Ben-Gurion, Bernie, Bill, Blooms, Blotts, Blums, Bohéme, Bonino, Booz, Borgin, Borgins, Boston, Bray, British, Brittan, Broadway, Broek, Bull, Burkes, Burma, C, CIA, CTRL, Cabrols, Caesarea, Cambridge, Cascading, Cervantes, Charlie, Chicago, Christine, Clough, Coco, Compson, Compsons, ConnectionFile, ConnectionString, Copeland, Cornelissen, Cox, Cresson, Cummings, Curtis, Cypern, Cyperns, Cyprian, DISTINCT, Daily, Daladiers, Damaskus, Dando, Dandos, Dandy-Roly, Daniel, Danmark, David, Davis, Deptford, Derrick, Diagongränden, Dick, Dobby, Dobbys, Dolorosa, Don, Donnay, Doris, Dostojevskij, Dostojevskijs, Draco, Drake, Drive, Dudley, Dudleys, Dumbledore, Dursley, Dursleys, EDU:s, EE, EU, EU:s, Edouard, Edward, Edwards, Egypten, Eichelberger, Eichelbergers, Elie, End, Enghien, England, Englands, Entebbe, Erebus, Erith, Errols, Escola, Essen, Etiopien, Eugene, Europa, Europas, Europol, Evelyn, Evelyns, Excel, Explorer, F, FFP, FFP:s, FN-ambassadör, Faulkner, Ferte, Festus, Festus’, Fischler, Fitzsimons, Flandern, Flats, Fleet, Flourish, Forest, Forsyth, Forsyth-Byggen, Fort, Frances, Franklin, Frankrike, Frankrikes, Fred, Freds, Fricks, FrontPage, Förenta, GRANT, GYLLENROY, Gaeta, Gai-Hinnom, Gala, Gallierna, Gehenna, George, Georges, Getsemanes, Gibraltar, Ginny, Glase, Golden, Graham, Gran, Granger, Grangers, Grants, Gravesend, Gray’s, Green, Greenwich, Grekland, Gringotts, Gruson, Gryffindor, Gwenzi, Gwenzis, Gyllenroy, HTML, Hagrid, Haifa, Hamilton, Hannas, Harold, Harry, Harrys, Harvey, Harveys, Hatzidakis, Haug, Heathrow, Hedwig, Hedwigs, Heights, Henry, Hereford, Herman, Hermione, Hernando, Hind, Hindenburg, Hitler, Hjalmar, Hogwarts, Hongkong, Hopkirk, Howard, Hubert, Hudsonfloden, Importera, Inn, Internet, Ionesco, Israel, Israels, Italien, J, Jacques, Jaffa, Jakob, Jakov, Jalta, James, Jan-Feb, Japan, Jason, Jefferson, Jeremia, Jersey, Jerusalem, Jerusalems, Jesus, Jet, Jo-Ann, Johannes, John, Jordan, Josafats, Joseph, Joubert, Joyces, Judéens, Kabul, Kairo, Kano, Katherine, Kedourie, Kedouries, Kellet-Bowman, Kennedy, Kensington, Kente, Kenyatta, Kermit, Kina, Kinnock, Kippur, Kissinger, Kodaks, Kongo, Kreml, Kråkboet, Kuba, Kurtz, Kv3, Kvartal, LOCKMAN, Le, Ledley, Lee, Lejonporten, Leon, Lev, Leveransdatum, Linds, Little, Lockman, Lockmans, Lomas, London, Louis, Lucius, Lugard, Luncheonette, Lydda, MSDN, Macao, Mace, Machie, Madison, Mafalda, Malenga, Malfoy, Malfoys, Mallorca, Malone, Mandelstam, Manes, Manhattan, Manson, Manuel, Manyema, Mar-Apr, Margot, Maritain, Marlow, Marlows, Marocko, Martin, Mason, Masons, Maurice, Max, McGowan, Medelhavet, Medina, Mellanöstern, Melville, Melvilles, Messias, Mets, Microsoft, Miert, Miggs, Miles, Minnesota, Mishkenot, Mississippi, Moabs, Moby, Mohieddin, Molly, Monde, Morse, Moses, Mount, Mozart, Msaccess.exe, Mweta, Mwetas, NT, Nantucket, Napoleon, Nasser, Nassers, Nauvoo, Navrozov, Neapel, Neil, Neils, Nellie, New, News, Ni-vet-vem, Nicaragua, Nimbus, Nordafrika, Nordpolen, Norge, Northfield, Nz, OAS, OLE, Odara, Odaras, Oddy, Odysseus, Office, Ojala, Oklahoma, Olivia, Olivias, Onabu, Oomen-Ruijten, Order, Ortega, Osip, PATH, POTTER, Panza, Paris, Park, Paul, Percy, Peter, Peters, Pettigrew, Pettigrews, Petunia, Petunias, Pirker, Places, Platon, Polen, Polens, Popo, Port, Portugal, Post, Potsdam, Potter, Poulenc, Privet, Produkter, Quentin, Quijote, Quinn, Quinns, REVOKE, Rabin, Ramlah, Ras, Ravenna, Rebecca, Redvers, Reginald, ReportML, Rhino, Rhodesia, Rio, Riverside, Road, Roland, Roly, Rom, Ron, Ronalds, Rons, Roosevelt, Rosenberg, Ross-on-Wye, Ruhr, Ryssland, Rysslands, SA, SQL, STOA, SUD, Sadat, Sadats, Sambata, Sancho, Santer, Scabbers, Scherers, Sdot, Sellafield, Sha’ananim, Shahar, Sheets, Shinza, Shinzas, Shropshire, Sibirien, Silver, Sinjavskij, Sion, Sions, Siri, Sitting, Sjöstedt, Smith, Smiths, Sovjetunionen, Spanien, Spencer, Sperber, Sport, Sputnikbaren, St, Stendhals, Stephen, Stillman, Stillmans, Storbritannien, Stravinskij, Street, Stromboli, Style, Svartvändargränden, Sydafrika, Sydamerika, Sydney, Syrien, System, System32, Söderhavet, Talisman, Tanganyika, Tatu, Teheran, Telford, Terror, Thompson, Thompsons, Thors, TillåtBorttagning, TillåtRedigering, TillåtTillägg, Times, Timothy, Tindi, Toledo, Tolstoj, Tom, Transact-SQL, Tulsa, Tyskland, U, USA, V, Valverdes, Vatikanen, Venetia, Vermeers, Vernon, Vernons, Versailles, Via, Vietnam, Virginia, Visual, Vivien, Voldemort, Voldemorts, Västeuropa, Västprovinsen, WINNT, Walter, Warszawapaktens, Warszawas, Washington, Watergate, Weasley, Weasleys, Web, Weimarrepublikens, Wentz, Wentz’, West, William, Wilson, Wilsons, Wiltshire, Windows, Wolfsonstiftelsen, Word, Work, Wyre, XML, XP, XSD, XSLT, Yam, Yassir, Yom, York, Youngblood, Zakaria, Zion, _rapport.xml, april, den, des, februari, gud, guds, http://officeupdate.microsoft.com/, januari, juli, juni, maj, mars, november, orienten, september, sidfilnamn.bak.htm, van, Österrike, Östeuropa.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 23 different relations: sv-dep/nsubj (716; 33% instances), sv-dep/nmod (311; 14% instances), sv-dep/obl (280; 13% instances), sv-dep/flat (273; 13% instances), sv-dep/nmod:poss (188; 9% instances), sv-dep/obj (131; 6% instances), sv-dep/conj (124; 6% instances), sv-dep/root (32; 1% instances), sv-dep/vocative (29; 1% instances), sv-dep/appos (17; 1% instances), sv-dep/nsubj:pass (16; 1% instances), sv-dep/xcomp (14; 1% instances), sv-dep/dislocated (8; 0% instances), sv-dep/obl:agent (4; 0% instances), sv-dep/iobj (3; 0% instances), sv-dep/parataxis (3; 0% instances), sv-dep/advmod (2; 0% instances), sv-dep/discourse (2; 0% instances), sv-dep/acl:relcl (1; 0% instances), sv-dep/ccomp (1; 0% instances), sv-dep/compound (1; 0% instances), sv-dep/csubj (1; 0% instances), sv-dep/fixed (1; 0% instances)

Parents of PROPN nodes belong to 13 different parts of speech: VERB (1111; 51% instances), NOUN (514; 24% instances), PROPN (396; 18% instances), ADJ (48; 2% instances), ROOT (32; 1% instances), PRON (22; 1% instances), ADV (17; 1% instances), AUX (10; 0% instances), PUNCT (3; 0% instances), NUM (2; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)

1097 (51%) PROPN nodes are leaves.

660 (31%) PROPN nodes have one child.

249 (12%) PROPN nodes have two children.

152 (7%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 25 different relations: sv-dep/case (511; 30% instances), sv-dep/flat (290; 17% instances), sv-dep/nmod (265; 15% instances), sv-dep/conj (152; 9% instances), sv-dep/punct (152; 9% instances), sv-dep/cc (94; 5% instances), sv-dep/amod (54; 3% instances), sv-dep/acl:relcl (37; 2% instances), sv-dep/advmod (32; 2% instances), sv-dep/det (25; 1% instances), sv-dep/cop (24; 1% instances), sv-dep/appos (19; 1% instances), sv-dep/nsubj (11; 1% instances), sv-dep/nummod (11; 1% instances), sv-dep/mark (8; 0% instances), sv-dep/nmod:poss (8; 0% instances), sv-dep/parataxis (8; 0% instances), sv-dep/discourse (7; 0% instances), sv-dep/expl (7; 0% instances), sv-dep/aux (5; 0% instances), sv-dep/fixed (4; 0% instances), sv-dep/acl (3; 0% instances), sv-dep/dislocated (3; 0% instances), sv-dep/advcl (1; 0% instances), sv-dep/vocative (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: ADP (511; 30% instances), PROPN (396; 23% instances), NOUN (321; 19% instances), PUNCT (152; 9% instances), CCONJ (95; 5% instances), ADJ (55; 3% instances), VERB (51; 3% instances), ADV (30; 2% instances), AUX (29; 2% instances), PRON (26; 2% instances), DET (25; 1% instances), NUM (18; 1% instances), INTJ (8; 0% instances), SCONJ (7; 0% instances), PART (5; 0% instances), SYM (2; 0% instances), X (1; 0% instances)


PROPN in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]