7693 results found.
StackQA
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons
Size:
questions OtherProduction Status:
Newly created-finished
Use:
Question Answering
Paper:
N/A
Documentation:
English documentation
SIGNUM Database
Sign Language
Corpus,
LREC2010
Expand/Collapse
Language Type:
Multilingual
Languages:
German Sign Language
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Sign Language Recognition/Generation
Paper:
N/A
Documentation:
<Not Specified>
Korpusik US II PWr (Small Corpus for WSD)
Written
Corpus,
LTC2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Polish
Availability:
From Owner
License:
<Not Specified>
Size:
1344 text snippetsProduction Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
WordNet
Written
Lexicon,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
SentiWordNet (Bengali)
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Bengali
Availability:
From Owner
License:
OpenSource for Research purpose only
Size:
2000 lexemes Production Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
Paper:
N/A
Documentation:
http://www.amitavadas.com/sentiwordnet.php
TOEFL11
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
3465000 Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
Blanchard et al. (to appear)
Kyoto University's Case Frame Data 1.0
Written
Lexicon,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Bi-sentences,lexicon LDC2005T34,Name Entity LDC2005T34
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
77M Chinese words, 81M English words Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
<Not Specified>
Written
Evaluation Data,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
7500 Production Status:
Newly created-finished
Use:
Semantic Role Labeling
Paper:
N/A
Documentation:
<Not Specified>
Penn Chinese Treebank 5.1
Written
Corpus,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Linguistic Data Consortium (LDC)
License:
<Not Specified>
Size:
0.5M words Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
ENglish Document