577 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
Uzbek
Availability:
From Data Center(s)
License:
Size:
4 GByte Production Status:
Existing-used
Use:
Morphological Analysis
-
Paper title:Morphological Segmentation for Low Resource Languages
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Justin Mott | BOLT LRL Uzbek representative language pack v1.0 | /N |
Documentation:
NoneLanguage Type:
Monolingual
Languages:
<Not Specified>
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
<Not Specified>
Corpus,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Text, eye-tracking, electroencephalography
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
CreativeCommons
Size:
50 GByte Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:ZuCo 2.0: A Dataset of Physiological Recordings During Natural Reading and Annotation
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Nora Hollenstein | Zurich Cognitive Language Processing Corpus 2.0 | /N |
Documentation:
https://osf.io/2urht/wiki/home/
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English French German Hungarian Polish Spanish Swedish
Availability:
Freely Available
License:
CreativeCommons
Size:
2 MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain
-
Paper track:Long/Information Retrieval and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shadi Saleh | Khresmoi Summary Translation Test Data 2.0 | /N |
Documentation:
None
Written
Tagger/Parser,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
From Owner
License:
part of The Prague Dependency Treebank 2.0, available under research licence from Institute of Formal and Applied Linguistics in Prague
Size:
12MB Production Status:
Existing-used
Use:
Not Applicable
-
Paper title:Czech Information Retrieval with Syntax-based Language Models
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Jana Straková | Charles University in Prague | None |
| Author 2 | Pavel Pecina | Charles University in Prague | None |
| Main Contact | Jana Strakova | Charles University in Prague | CZ |
Documentation:
Jan HajiÄ. 2004. Disambiguation of Rich Inflection (Computational Morphology of Czech), volume 1. Charles University in Prague, Prague.
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech
Availability:
<Not Specified>
License:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Size:
694 MByte Production Status:
Existing-updated
Use:
Document Classification, Text categorisation
-
Paper title:Czech Text Document Corpus v 2.0
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Pavel Kral | University of West Bohemia, Dept. of Computer Science and Engineering | CZ |
| Author 2 | Ladislav Lenc | University of West Bohemia | CZ |
| Main Contact | Pavel Kral | University of West Bohemia, Dept. of Computer Science and Engineering | None |
Documentation:
<Not Specified>
Written
Treebank,
Language Type:
Monolingual
Languages:
Icelandic
Availability:
Freely Available
License:
CC-BY 4.0
Size:
1000 sentences Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:Creating a Parallel Icelandic Dependency Treebank from Raw Text to Universal Dependencies
-
Paper track:Written/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hildur Jónsdóttir | Parallel Icelandic UD Treebank | /N |
Documentation:
None
Written
Treebank,
Language Type:
Monolingual
Languages:
Chinese
Availability:
From Owner
License:
Size:
500 documents OtherProduction Status:
Existing-used
Use:
Discourse
-
Paper title:Shallow Discourse Annotation for Chinese TED Talks
-
Paper track:Infrastructural Issues/Large Projects/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xinyi Cai | Chinese Discourse Treebank (CDTB) | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
OpenSource
Size:
None Production Status:
Newly created-in progress
Use:
Discourse
-
Paper title:Shallow Discourse Annotation for Chinese TED Talks
-
Paper track:Infrastructural Issues/Large Projects/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Xinyi Cai | Discourse Treebank for Chinese | /N |
Documentation:
None




