577 results found.
Language Type:
Multilingual
Languages:
Bengali
Availability:
Not Applicable
License:
N/A
Size:
71009 <Not Specified>Production Status:
Newly created-in progress
Use:
Discourse
Paper:
N/A
Documentation:
Das, D., & Stede, M. (in progress). Developing the Bangla RST Discourse Treebank.
Written and Figures
Corpus,
Language Type:
Multilingual
Languages:
Chinese Spanish
Availability:
Freely Available
License:
Attribution Share Alike (CC BY-SA) license
Size:
39275 words Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
The documentation is available and is written in English
Written
Corpus,
Language Type:
Multilingual
Languages:
Finnish
Availability:
Freely Available
License:
<Not Specified>
Size:
17000 sentencesProduction Status:
Newly created-in progress
Use:
Language Modelling
Paper:
N/A
Documentation:
Finnish grammar corpus and dependency syntax description (In Finnish, translation in progress)Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
English
Written
Corpus,
Language Type:
Monolingual
Languages:
Japanese
Availability:
Freely Available
License:
Creative Commons Attribution 4.0 International License
Size:
4.2 MByte Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Deep Inside-outside Recursive Autoencoder with All-span Objective
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jiong Cai | Keyaki Treebank | /N |
Documentation:
http://www.compling.jp/keyaki/manual_en/contents.html
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC User Agreement for Non-Members
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Learning Efficient Task-Specific Meta-Embeddings with Word Prisms
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | KC Tsiolis | Treebank-3 | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC99T42/
Speech/Written
Treebank,
Language Type:
Bilingual
Languages:
French North African Arabic
Availability:
Freely Available
License:
CC-BY-SA
Size:
1500 sentences Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept - LREC
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Djamé Seddah | Narabizi Treebank | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
English
Availability:
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference
-
Paper track:Short/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nikita Kitaev | Penn Treebank | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
None Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:On Importance Sampling-Based Evaluation of Latent Language Models
-
Paper track:Short/Machine Learning for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Robert L Logan IV | Penn Treebank | /N |
Documentation:
NoneLanguage Type:
Trilingual
Languages:
Egyptian Arabic English Mandarin Chinese
Availability:
The Data Will Be Published Via LDC General Catalogue
License:
<Not Specified>
Size:
2709094 words Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Large Multi-lingual, Multi-level and Multi-genre Annotation Corpus
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Xuansong Li | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Author 2 | Martha Palmer | Department of Linguistics and Computer Science, University of Colorado | US | ||
| Author 3 | Nianwen Xue | Computer Science Department, Brandeis University | US | ||
| Author 4 | Lance Ramshaw | Raytheon BBN Technologies | US | ||
| Author 5 | Mohamed Maamouri | <Not Specified> | None | Linguistic Data Consortium, University of Pennsylvania | US |
| Author 6 | Ann Bies | <Not Specified> | None | Linguistic Data Consortium, University of Pennsylvania | US |
| Author 7 | Kathryn Conger | Department of Linguistics and Computer Science, University of Colorado | US | ||
| Author 8 | Stephen Grimes | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Author 9 | Stephanie Strassel | Linguistic Data Consortium, University of Pennsylvania | US | ||
| Main Contact | Xuansong Li | Linguistic Data Consortium, University of Pennsylvania | None |
Documentation:
<Not Specified>




