577 results found.
Language Type:
Multilingual
Languages:
Egyptian Arabic
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
<Not Specified>
-
Paper title:Developing an Egyptian Arabic Treebank: Impact of Dialectal Morphology on Annotation and Tool Development
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Mohamed Maamouri | <Not Specified> | None | LDC | None | Linguistic Data Consortium | US |
| Author 2 | Ann Bies | Linguistic Data Consortium, University of Pennsylvania | US | Linguistic Data Consortium | US | ||
| Author 3 | Seth Kulick | <Not Specified> | None | LDC | None | Linguistic Data Consortium | US |
| Author 4 | Michael Ciul | Linguistic Data Consortium | US | ||||
| Author 5 | Nizar Habash | Center for Computational Learning Systems, Columbia University | US | ||||
| Author 6 | Ramy Eskander | Center for Computational Learning Systems, Columbia University | US | ||||
| Main Contact | Ann Bies | Linguistic Data Consortium, University of Pennsylvania | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Hungarian
Availability:
From Owner
License:
<Not Specified>
Size:
82000 sentences Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:e-magyar -- A Digital Language Processing System
-
Paper track:Written
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Tamás Váradi | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 2 | Eszter Simon | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 3 | Bálint Sass | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 4 | Iván Mittelholcz | Research institute for Linguistics, Hungarian Academy of Sciences | HU |
| Author 5 | Attila Novák | MTA-PPKE Hungarian Language Technology Research Group, Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest | HU |
| Author 6 | Balázs Indig | Pázmány Péter Catholic University, Faculty of Information Technology and Bionics | HU |
| Author 7 | Richárd Farkas | University of Szeged | HU |
| Author 8 | Veronika Vincze | University of Szeged | HU |
| Main Contact | Eszter Simon | Research institute for Linguistics, Hungarian Academy of Sciences | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Croatian
Availability:
Not Applicable
License:
<Not Specified>
Size:
2800 <Not Specified>Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:Croatian Dependency Treebank: Recent Development and Initial Experiments
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Dasa Berovic | University of Zagreb | None |
| Author 2 | Željko Agić | University of Zagreb | None |
| Author 3 | Marko Tadić | University of Zagreb | None |
| Main Contact | Zeljko Agic | University of Zagreb | HR |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
<Not Specified>
Size:
40097 entries Production Status:
Newly created-finished
Use:
Morphological Analysis
-
Paper title:Building a Morphological Treebank for German from a Linguistic Database
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Petra Steiner | Institute of German Language (IDS) | DE |
| Author 2 | Josef Ruppenhofer | Institute for German Language | DE |
| Main Contact | Petra Steiner | Institute of German Language (IDS) | None |
Documentation:
<Not Specified>
Written
Treebank,
Language Type:
Multilingual
Languages:
Spanish
Availability:
From Owner
License:
N/A
Size:
52746 words Production Status:
Existing-updated
Use:
Discourse
-
Paper title:Developing the Bangla RST Discourse Treebank
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Debopam Das | University of Potsdam | DE |
| Author 2 | Manfred Stede | University of Potsdam | DE |
| Main Contact | Debopam Das | University of Potsdam | None |
Documentation:
da Cunha, I., Torres-Moreno, J.-M., and Sierra, G. (2011). On the development of the rst spanish treebank. In Proceedings of the 5th Linguistic Annotation Workshop, LAW V ’11, pages 1–10, Stroudsburg, PA, USA. Association for Computational Linguistics.Language Type:
Multilingual
Languages:
Urdu
Availability:
<Not Specified>
License:
Open Source
Size:
199546 Production Status:
Existing-used
Use:
Multiple NLP tasks
-
Paper title:Improvised and Adaptable Statistical Morph Analyzer (SMA++)
-
Paper track:Short Paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Saikrishna Srirampur | IIIT Hyderabad | IN |
| Author 2 | Deepak Kumar Malladi | IIIT Hyderabad | IN |
| Author 3 | Radhika Mamidi | IIIT-Hyderabad, Professor | None |
| Main Contact | Saikrishna Srirampur | IIIT Hyderabad | None |
Documentation:
dl.acm.org/citation.cfm?id=2392773
Written
Corpus,
Language Type:
Multilingual
Languages:
Hungarian
Availability:
From Owner
License:
own
Size:
82K sentencesProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:An Empirical Evaluation of Automatic Conversion from Constituency to Dependency in Hungarian
-
Paper track:Syntax, grammar induction, syntactic and semantic parsing
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Katalin Ilona Simkó | University of Szeged | None |
| Author 2 | Veronika Vincze | University of Szeged | HU |
| Author 3 | Zsolt Szántó | University of Szeged | None |
| Author 4 | Richárd Farkas | University of Szeged | HU |
| Main Contact | Veronika Vincze | University of Szeged | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
<Not Specified>
Size:
1028 Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:Converting an HPSG-based Treebank into its Parallel Dependency-based Treebank
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Masood Ghayoomi | Freie Universitaet Berlin | DE | ||||
| Author 2 | Jonas Kuhn | Universität Stuttgart | None | University of Stuttgart | DE | Institute for Natural Language Processing, University of Stuttgart | DE |
| Main Contact | Masood Ghayoomi | Freie Universitaet Berlin | None |
Documentation:
The documentation is publicly available
Written
Treebank,
Language Type:
Multilingual
Languages:
Estonian
Availability:
Freely Available
License:
Creative Commons
Size:
400000 tokens Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Estonian Dependency Treebank: from Constraint Grammar tagset to Universal Dependencies
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Kadri Muischnek | senior researcher | EE |
| Author 2 | Kaili Müürisep | University of Tartu | EE |
| Author 3 | Tiina Puolakainen | University of Tartu | EE |
| Main Contact | Kaili Müürisep | University of Tartu | None |
Documentation:
Documentation mainly in Estonian
Written
Treebank,
Language Type:
Multilingual
Languages:
Japanese
Availability:
The original corpus is required (non-free)
License:
<Not Specified>
Size:
78 MByte (the whole UD treebanks) MByte Production Status:
Newly created-in progress
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies for Japanese
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Takaaki Tanaka | NTT CS lab | JP |
| Author 2 | Yusuke Miyao | National Instutite of Informatics | JP |
| Author 3 | Masayuki Asahara | National Institute for Japanese Language and Linguistics | JP |
| Author 4 | Sumire Uematsu | National Institute of Informatics | JP |
| Author 5 | Hiroshi Kanayama | IBM Research - Tokyo | JP |
| Author 6 | Shinsuke Mori | Kyoto University | JP |
| Author 7 | Yuji Matsumoto | Nara Institute of Science and Technology | JP |
| Main Contact | Takaaki Tanaka | NTT CS lab | None |
Documentation:
<Not Specified>




