7693 results found.
Składnica – a constituency and dependency treebank of Polish
Written
Treebank,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Polish
Availability:
Freely Available
License:
GPLv3
Size:
10500 sentences Production Status:
Existing-updated
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Chinese evaluation information corpus
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Not Available
License:
<Not Specified>
Size:
6680 Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
ixa-pipe-dep-eu
Written
Tagger/Parser,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Basque
Availability:
Freely Available
License:
GPL v3
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
TimeBank
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
LDC
Size:
68.5 K words, 183 documents Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Lingua::JA::Summarize::Extract
Written
summarization module,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
<Not Specified>
HPRD50
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
92KByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
MeCab
Written
Tokenizer,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
<Not Specified>
License:
BSD/GPL/LGPL
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
Japanese
Multi-Domain Sentiment Dataset
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Sentiment
Paper:
N/A
Documentation:
<Not Specified>
T-PAS
Written
Lexicon/Corpus/Ontology,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
italian
Availability:
Freely Available
License:
CreativeCommons
Size:
1000 entries Production Status:
Existing-updated
Use:
Language Modeling/Word Sense Disambiguation/Textual Entailment/Question Answering/Machine Translation
Paper:
N/A
Documentation:
<Not Specified>
OMProDat - Open Multilingual Prosodic Database
Speech/Written
Corpus,
IS2013
Expand/Collapse
Previous
|
Next
Language Type:
Trilingual
Languages:
English Mandarin Chinese french
Availability:
Freely Available
License:
OpenSource
Size:
5 GByte Production Status:
Newly created-finished
Use:
Analysis of Speech Prosody
Paper:
N/A
Documentation:
'Hirst, D.J.; Bigi, B.; Cho, H.-S.; Ding, H.; Herment, S.; Wang, T. ``Building OMProDat, an open multilingual prosodic database'', Proceedings of TRASP, Text and Resources for the Analysis of Speech Prosody, satellite workshop of Interspeech 2013, Aix-en-Provence, August 30, 2013 (submitted)'