7693 results found.
ACE
Speech/Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
From Owner
License:
LDC
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
<Not Specified>
Sogou Query Log
Written
Query Log,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
4.3Gbyte Production Status:
Existing-used
Use:
Session Detection
Paper:
N/A
Documentation:
<Not Specified>
Wiktionary
Multimodal/Multimedia
Lexicon,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
144 MB; 1,700,000 articles; 335,748 entries; 421,847 word senses Production Status:
Existing-used
Use:
Lexical Resource Alignment
Paper:
N/A
Documentation:
Creative Commons Attribution/Share-Alike License
Tu&Roth
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
2162 Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
Y. Tu and D. Roth, Learning English Light Verb Constructions: Contextual or Statistical , ACL-HLT workshop: Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), Portland, Oregon, 2011.
Deceptive Opinion Spam Corpus v1
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
Creative Commons Attribution-Only license.
Size:
455KB Production Status:
Newly created-finished
Use:
Deception Detection
Paper:
N/A
Documentation:
Documentation is available at http://framenet.icsi.berkeley.edu/index.php?option=com_wrapper&Itemid=126
RTE data set
Written
Evaluation Data,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
2400 pairs of text and hypothesis sentences Production Status:
Existing-used
Use:
Textual Entailment and Paraphrasing
Paper:
N/A
Documentation:
RTE workshop proceedings
SenseLearner 2.0
Written
Word Sense Disambiguator,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
Available in English
Penn Treebank
Written
Grammar/Language Model,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
Large Movie Review Dataset
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
GPL
Size:
100,000 documents Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
English html (included with code)
LADL tables
Written
Lexicon,
COLING2012
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
french
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>