577 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Swedish
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Word space Modelling
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
30 MB Production Status:
Existing-used
Use:
Emotion Recognition/Generation
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian English
Availability:
Freely Available
License:
<Not Specified>
Size:
55083246 words Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Producing Monolingual and Parallel Web Corpora at the Same Time - SpiderLing and Bitextor's Love Affair
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Nikola Ljubešić | University of Zagreb | HR | ||
| Author 2 | Miquel Esplà-Gomis | Universitat d'Alacant | ES | ||
| Author 3 | Antonio Toral | Dublin City Unversity | IE | ||
| Author 4 | Sergio Ortiz Rojas | <Not Specified> | None | ||
| Author 5 | Filip Klubička | University of Zagreb | HR | ||
| Main Contact | Nikola Ljubešić | Jožef Stefan Institute | None | University of Zagreb | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
680KB Production Status:
Newly created-in progress
Use:
Temporal Reasoning
-
Paper title:TRIOS-TimeBank Corpus: Extended TimeBank Corpus with Help of Deep Understanding of Text
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Naushad UzZaman | University of Rochester | None |
| Author 2 | James Allen | University of Rochester | None |
| Main Contact | Naushad UzZaman | University of Rochester | US |
Documentation:
<Not Specified>
Speech/Written
Treebank,
Language Type:
Monolingual
Languages:
Czech
Availability:
Freely Available
License:
CreativeCommons
Size:
4000000 tokens Production Status:
Newly created-finished
Use:
-
Paper title:Prague Dependency Treebank - Consolidated 1.0
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Marie Mikulová | Prague Dependency Treebank - Consolidated 1.0 | /N |
Documentation:
https://ufal.mff.cuni.cz/pdt-c
Written
Treebank,
Language Type:
Multilingual
Languages:
Czech
Availability:
Freely Available
License:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Size:
1128 sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Czech Legal Text Treebank 1.0
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Vincent Kríž | Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague | CZ | ||
| Author 2 | Barbora Hladka | Charles University in Prague | CZ | Charles University | CZ |
| Author 3 | Zdenka Uresova | Charles University in Prague | CZ | ||
| Main Contact | Vincent Kríž | Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague | None |
Documentation:
http://ufal.mff.cuni.cz/czech-legal-text-treebank
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
2114 surface realizations Production Status:
Newly created-finished
Use:
Natural Language Generation
Paper:
N/A
Documentation:
Yes, EnglishLanguage Type:
Multilingual
Languages:
Czech
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Czech is a morphologically rich language, so some sort of lemmatization is present in almost any NLP application for Czech.
Paper:
N/A
Documentation:
yes, in English, yesLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
1.5 GByte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>




