577 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
From Data Center(s)
License:
LDC
Size:
850000 tokens Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Querying Diverse Treebanks in a Uniform Way
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jan Štěpánek | Charles University in Prague, MFF, UFAL | None | ||
| Author 2 | Petr Pajas | Charles University in Prague | None | Charles University in Prague, MFF, UFAL | None |
| Main Contact | Jan Štěpánek | Univerzita Karlova v Praze | CZ |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC User Agreement for Non-Members
Size:
1000000 words Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Character-Level Feature Extraction with Densely Connected Networks
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Chanhee Lee | Korea University | KR |
| Author 2 | Young-Bum Kim | Amazon | N/A |
| Author 3 | Dongyub Lee | University of Korea at Seoul | KR |
| Author 4 | Heuiseok Lim | Korea University | N/A |
| Main Contact | Chanhee Lee | Korea University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
51447 sentences Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:MarsaGram: an excursion in the forests of parsing trees
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Philippe Blache | LPL | FR |
| Author 2 | Stéphane Rauzy | LPL | FR |
| Author 3 | Grégoire Montcheuil | Laboratoire Parole et Langage - UMR 7309 | FR |
| Main Contact | Grégoire Montcheuil | Laboratoire Parole et Langage - UMR 7309 | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
6.08 MByte Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Graph-based Dependency Parsing with Bidirectional LSTM
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Tuesday
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Wenhui Wang | Institute of Computational Linguistics Dept of Computer Science & Technology, Peking University | CN |
| Author 2 | Baobao Chang | Institute of Computational Linguistic, Peking Univerisity | CN |
| Main Contact | Wenhui Wang | Institute of Computational Linguistics Dept of Computer Science & Technology, Peking University | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English
Availability:
From Data Center(s)
License:
LDC (planned)
Size:
1.2 mil <Not Specified>Production Status:
Newly created-finished
Use:
parsing, parallel parsing, machine translation, coreference resolution, anaphora resolution, natural language generation, lexical acquisition
-
Paper title:Announcing Prague Czech-English Dependency Treebank 2.0
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jan Hajič | Charles University in Prague | None |
| Author 10 | Jan Popelka | Charles University in Prague | None |
| Author 11 | Jiří Semecký | Charles University in Prague | None |
| Author 12 | Jana Šindlerová | Charles University in Prague | None |
| Author 13 | Jan Štěpánek | Charles University in Prague | None |
| Author 14 | Josef Toman | Charles University in Prague | None |
| Author 15 | Zdeňka Urešová | Charles University in Prague | None |
| Author 16 | Zdeněk Žabokrtský | Charles University in Prague | None |
| Author 2 | Eva Hajičová | Charles University in Prague | None |
| Author 3 | Jarmila Panevová | Charles University in Prague | None |
| Author 4 | Petr Sgall | Charles University in Prague | None |
| Author 5 | Ondřej Bojar | Charles University in Prague | None |
| Author 6 | Silvie Cinková | Charles University in Prague | None |
| Author 7 | Eva Fučíková | Charles University in Prague | None |
| Author 8 | Marie Mikulová | Charles University in Prague | None |
| Author 9 | Petr Pajas | Charles University in Prague | None |
| Main Contact | Ondřej Bojar | Charles University in Prague | CZ |
Documentation:
http://ufal.mff.cuni.cz/pcedt2.0/en/documentation.html
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English
Availability:
Freely Available
License:
CC-BY-NC-SA + LDC
Size:
50K sentences Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:Coreference in Prague Czech-English Dependency Treebank
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Anna Nedoluzhko | Charles University in Prague | CZ |
| Author 2 | Michal Novák | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 3 | Silvie Cinkova | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 4 | Marie Mikulová | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 5 | Jiří Mírovský | Charles University in Prague | CZ |
| Main Contact | Michal Novák | Charles University in Prague, Faculty of Mathematics and Physics | None |
Documentation:
yes, English & Czech, http://ufal.mff.cuni.cz/pcedt2.0
Written
Treebank,
Language Type:
Bilingual
Languages:
Czech English
Availability:
Freely Available
License:
CreativeCommons
Size:
50000 sentences Production Status:
Existing-updated
Use:
-
Paper title:Prague Dependency Treebank - Consolidated 1.0
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marie Mikulová | Prague Czech English Dependency Treebank 2.0 | /N |
Documentation:
http://ufal.mff.cuni.cz/pcedt2.0
Written
Treebank,
Language Type:
Multilingual
Languages:
Czech English
Availability:
From Data Center(s)
License:
LDC
Size:
1000000 tokens Production Status:
Existing-used
Use:
Lexicon Creation/Annotation
-
Paper title:Tools for Building an Interlinked Synonym Lexicon Network
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Zdenka Uresova | Charles University in Prague | CZ | Charles University | CZ | ||
| Author 2 | Eva Fucikova | Charles University in Prague | CZ | Charles University | CZ | Charles University | N/A |
| Author 3 | Eva Hajicova | Charles University | CZ | ||||
| Author 4 | Jan Hajic | Charles University in Prague | CZ | Charles University | CZ | ||
| Main Contact | Zdenka Uresova | Charles University | None |
Documentation:
http://ufal.mff.cuni.cz/pcedt2.0/
Written
Treebank,
Language Type:
Multilingual
Languages:
Czech English
Availability:
From Data Center(s)
License:
LDC
Size:
1000000 tokens Production Status:
Existing-used
Use:
Lexicon Creation/Annotation
-
Paper title:Creating a Verb Synonym Lexicon Based on a Parallel Corpus
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Zdenka Uresova | Charles University in Prague | CZ | Charles University | CZ | ||
| Author 2 | Eva Fucikova | Charles University in Prague | CZ | Charles University | CZ | Charles University | N/A |
| Author 3 | Eva Hajicova | Charles University | CZ | ||||
| Author 4 | Jan Hajic | Charles University in Prague | CZ | Charles University | CZ | ||
| Main Contact | Zdenka Uresova | Charles University | None |
Documentation:
http://ufal.mff.cuni.cz/pcedt2.0/
Written
Dialogue dataset,
Language Type:
Monolingual
Languages:
Basque
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike 4.0 International Public License (CC BY-SA 4.0)
Size:
1634 questions OtherProduction Status:
Newly created-finished
Use:
Dialogue
-
Paper title:Conversational Question Answering in Low Resource Scenarios: A Dataset and Case Study for Basque
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Arantxa Otegi | ElkarHizketak v1.0 | /N |
Documentation:
None




