7693 results found.
Tiger Treebank
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
Non-commercial
Size:
50k sentences Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
See URL. English
Word Clipping Test Set
Written
Evaluation Data,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
7600 instances Production Status:
Newly created-finished
Use:
Word Choice
Paper:
N/A
Documentation:
<Not Specified>
EPG
Written
Metadata,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
American English
Availability:
From Owner
License:
<Not Specified>
Size:
10 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
Bitter Lemons
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
7MB Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
NTCIR patent corpus
Written
Corpus,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English Japanese
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
10Gbyte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
CreativeCommons
Penn Discourse Treebank 2.0
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
American English
Availability:
From Data Center(s)
License:
LDC
Size:
30 MByte Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
CMU Let's Go Data
Speech
Evaluation Data,
IS2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
dialogs
Size:
4300 Production Status:
Existing-updated
Use:
Dialogue
Paper:
N/A
Documentation:
<Not Specified>
Text summarization corpus for the credibility of information on the Web
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Summarisation
Paper:
N/A
Documentation:
<Not Specified>
POV differences
Written
Evaluation Data,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
RST Spanish Treebank
Written
Corpus,
RANLP2011
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>