Universal Dependencies
Universal Dependencies (UD) is a framework for cross-linguistically consistent grammatical annotation and an open community effort with over 200 contributors producing almost 100 treebanks in over 50 languages.
- Short introduction to UD
- UD annotation guidelines
- More information on UD:
- Query UD treebanks online:
- SETS treebank search maintained by the University of Turku
- PML Tree Query maintained by the Charles University in Prague
If you want to receive news about Universal Dependencies, you can subscribe to the UD mailing list.
Current UD Languages
Information about language families (and genera for families with multiple branches) is normally taken from WALS Online (IE = Indo-European).
Afrikaans treebanks
Language documentation
Some language documentation.Ancient Greek treebanks
Language documentation
Some language documentation.Arabic treebanks
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Luma Ateyah, Martin Popel, Daniel Zeman, Nizar Habash, Dima Taji
- Repository master dev
- README
Language documentation
Some language documentation.Basque treebanks
Language documentation
Some language documentation.Belarusian treebanks
Language documentation
Some language documentation.Bulgarian treebanks
Language documentation
Some language documentation.Buryat treebanks
Language documentation
Some language documentation.Catalan treebanks
Language documentation
Some language documentation.Chinese treebanks
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Josie Li, Cheuk Ying Li, Martin Popel, Daniel Zeman, Herman Leung
- Repository master dev
- README
Language documentation
Some language documentation.Coptic treebanks
Language documentation
Some language documentation.Croatian treebanks
Language documentation
Some language documentation.Czech treebanks
- Contributors: Václava Kettnerová, Jan Hajič jr., Silvie Cinková, Zdeňka Urešová, Milan Straka, Jan Hajič, Jaroslava Hlaváčová, Daniel Zeman
- Repository master dev
- README
Language documentation
Some language documentation.Danish treebanks
Language documentation
Some language documentation.Dutch treebanks
- Contributors: Daniel Zeman, Zdeněk Žabokrtský, Gosse Bouma, Gertjan van Noord
- Repository master dev
- README
Language documentation
Some language documentation.English treebanks
- Contributors: Natalia Silveira, Timothy Dozat, Christopher Manning, Sebastian Schuster, John Bauer, Miriam Connor, Marie-Catherine de Marneffe, Sam Bowman, Hanzhi Zhu, Daniel Galbraith
- Repository master dev
- README
- Contributors: Yevgeni Berzak, Jessica Kenney, Carolyn Spadine, Jing Xian Wang, Lucia Lam, Keiko Sophie Mori, Sebastian Garza, Boris Katz
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Jesse Kirchner, Lorenzo Lambertino, Martin Popel, Daniel Zeman, Christopher Manning, Sebastian Schuster, Siva Reddy
- Repository master dev
- README
Language documentation
Some language documentation.Estonian treebanks
Language documentation
Some language documentation.Finnish treebanks
- Contributors: Filip Ginter, Jenna Kanerva, Veronika Laippala, Anna Missilä, Stina Ojala, Sampo Pyysalo
- Repository master dev
- README
Language documentation
Some language documentation.French treebanks
- Contributors: Marie Candito, Bruno Guillaume, Teresa Lynn, Héctor Martínez Alonso, Benoit Sagot, Djamé Seddah, Eric de la Clergerie
- Repository master dev
- README
- Contributors: Marie-Catherine de Marneffe, Bruno Guillaume, Ryan McDonald, Alane Suhr, Joakim Nivre, Matias Grioni
- Repository master dev
- README
- Contributors: Marie Candito, Djamé Seddah, Guy Perrier, Bruno Guillaume
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Jana Strnadová, Gauthier Caron, Martin Popel, Daniel Zeman, Marie-Catherine de Marneffe
- Repository master dev
- README
Language documentation
Some language documentation.Galician treebanks
Language documentation
Some language documentation.German treebanks
- Contributors: Slav Petrov, Wolfgang Seeker, Ryan McDonald, Joakim Nivre, Daniel Zeman
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Georg Rehm, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Michael Mandl, Sebastian Bank, Martin Popel, Daniel Zeman
- Repository master dev
- README
Language documentation
Some language documentation.Gothic treebanks
Language documentation
Some language documentation.Greek treebanks
Language documentation
Some language documentation.Hebrew treebanks
Language documentation
Some language documentation.Hindi treebanks
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Esha Banerjee, Pinkey Nainwani, Martin Popel, Daniel Zeman
- Repository master dev
- README
Language documentation
Some language documentation.Hungarian treebanks
Language documentation
Some language documentation.Indonesian treebanks
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Ruli Manurung, Muh Shohibussirri, Martin Popel, Daniel Zeman
- Repository master dev
- README
Language documentation
Some language documentation.Irish treebanks
Language documentation
Some language documentation.Italian treebanks
- Contributors: Cristina Bosco, Alessandro Lenci, Simonetta Montemagni, Maria Simi
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Antonio Stella, Davide Rovati, Martin Popel, Daniel Zeman, Maria Simi, Manuela Sanguinetti
- Repository master dev
- README
Language documentation
Some language documentation.Japanese treebanks
- Contributors: Masayuki Asahara, Hiroshi Kanayama, Yuji Matsumoto, Yusuke Miyao, Shunsuke Mori, Takaaki Tanaka, Sumire Uematsu
- Repository master dev
- README
- Contributors: Ryan McDonald, Joakim Nivre, Daniel Zeman, Masayuki Asahara, Hiroshi Kanayama, Yuji Matsumoto, Yusuke Miyao, Shunsuke Mori, Takaaki Tanaka, Sumire Uematsu
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Atsuko Shimada, Anna Trukhina, Martin Popel, Daniel Zeman, Hiroshi Kanayama
- Repository master dev
- README
Language documentation
Some language documentation.Kazakh treebanks
Language documentation
Some language documentation.Korean treebanks
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Sookyoung Kwak, Yongseok Cho, Martin Popel, Daniel Zeman
- Repository master dev
- README
Language documentation
Some language documentation.Kurmanji treebanks
Language documentation
Some language documentation.Latin treebanks
Language documentation
Some language documentation.Latvian treebanks
Language documentation
Some language documentation.Lithuanian treebanks
Language documentation
Some language documentation.Marathi treebanks
Language documentation
Some language documentation.North Sami treebanks
Language documentation
Some language documentation.Norwegian treebanks
Language documentation
Some language documentation.Old Church Slavonic treebanks
Language documentation
Some language documentation.Persian treebanks
Language documentation
Some language documentation.Polish treebanks
Language documentation
Some language documentation.Portuguese treebanks
- Contributors: Cláudia Freitas, Eckhard Bick, Fabricio Chalub, Alexandre Rademaker, Livy Real, Valeria de Paiva, Daniel Zeman, Martin Popel, David Mareček, Natalia Silveira, André Martins
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Gustavo Mendonça, Larissa Rinaldi, Martin Popel, Daniel Zeman, Valeria de Paiva
- Repository master dev
- README
Language documentation
Some language documentation.Romanian treebanks
Language documentation
Some language documentation.Russian treebanks
- Contributors: Kira Droganova, Olga Lyashevskaya, Daniel Zeman, Lena Shakurova, Nina Mustafina
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Tatiana Lando, Olga Loginova, Martin Popel, Daniel Zeman, Kira Droganova
- Repository master dev
- README
Language documentation
Some language documentation.Sanskrit treebanks
Language documentation
Some language documentation.Serbian treebanks
Language documentation
Some language documentation.Slovak treebanks
Language documentation
Some language documentation.Slovenian treebanks
Language documentation
Some language documentation.Spanish treebanks
- Contributors: Miguel Ballesteros, Héctor Martínez Alonso, Ryan McDonald, Elena Pascual, Natalia Silveira, Daniel Zeman, Joakim Nivre
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Hector Fernandez Alcalde, Laura Moreno Romero, Martin Popel, Daniel Zeman, Héctor Martínez Alonso
- Repository master dev
- README
Language documentation
Some language documentation.Swedish treebanks
Language documentation
Some language documentation.Swedish Sign Language treebanks
Language documentation
Some language documentation.Tamil treebanks
Language documentation
Some language documentation.Thai treebanks
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Rattima Nitisaroj, Yanin Sawanakunanon, Martin Popel, Daniel Zeman
- Repository master dev
- README
Language documentation
Some language documentation.Turkish treebanks
- Contributors: Çağrı Çöltekin, Gülşen Cebiroğlu Eryiğit, Memduh Gökırmak, Hüner Kaşıkara, Umut Sulubacak, Francis Tyers
- Repository master dev
- README
- Contributors: Hans Uszkoreit, Vivien Macketanz, Aljoscha Burchardt, Kim Harris, Katrin Marheinecke, Slav Petrov, Tolga Kayadelen, Mohammed Attia, Ali Elkahky, Zhuoran Yu, Emily Pitler, Saran Lertpradit, Savas Cetin, Martin Popel, Daniel Zeman, Francis Tyers, Çağrı Çöltekin
- Repository master dev
- README
Language documentation
Some language documentation.Ukrainian treebanks
Language documentation
Some language documentation.Upper Sorbian treebanks
Language documentation
Some language documentation.Urdu treebanks
Language documentation
Some language documentation.Uyghur treebanks
Language documentation
Some language documentation.Upcoming UD Languages
Amharic treebanks
Language documentation
Some language documentation.Armenian treebanks
Language documentation
Some language documentation.Bangla treebanks
Language documentation
Some language documentation.Bengali treebanks
Language documentation
Some language documentation.Cantonese treebanks
Language documentation
Some language documentation.Dargwa treebanks
Language documentation
Some language documentation.Erzya treebanks
Language documentation
Some language documentation.Faroese treebanks
Language documentation
Some language documentation.Maltese treebanks
Language documentation
Some language documentation.Naija treebanks
Language documentation
Some language documentation.Old French treebanks
Language documentation
Some language documentation.Romansh treebanks
Language documentation
Some language documentation.Somali treebanks
Language documentation
Some language documentation.Sorani treebanks
Language documentation
Some language documentation.Disclaimer: Our use of flags to symbolise languages is only intended as a visual enhancement of the website and should not be interpreted as a political statement in any way.
Download
The data is released through LINDAT/CLARIN.
- Version 2.0 treebanks are available at http://hdl.handle.net/11234/1-1983. 70 treebanks, 50 languages, released March 1, 2017.
- Test data 2.0 are available at http://hdl.handle.net/11234/1-2184, released May 18, 2017.
- Version 1.4 treebanks are archived at http://hdl.handle.net/11234/1-1827. 64 treebanks, 47 languages, released November 15, 2016.
- Version 1.3 treebanks are archived at http://hdl.handle.net/11234/1-1699. 54 treebanks, 40 languages, released May 15, 2016.
- Version 1.2 treebanks are archived at http://hdl.handle.net/11234/1-1548. 37 treebanks, 33 languages, released Nov 15, 2015.
- Version 1.1 treebanks are archived at http://hdl.handle.net/11234/LRT-1478. 19 treebanks, 18 languages, released May 15, 2015.
- Version 1.0 treebanks are archived at http://hdl.handle.net/11234/1-1464. 10 treebanks, 10 languages, released Jan 15, 2015.
- In general, we intend to have regular treebank releases every six months. The v2.0 release was brought forward because of its usage in the CoNLL 2017 Multilingual Parsing Shared Task.
- The next release (v2.1) is scheduled for Nov 15, 2017.