10000 results found.
ANC (American National Corpus) MASC (Manually Annotated Sub-Corpus)
Speech/Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
American English
Availability:
Freely Avalable
License:
none
Size:
500 wordsProduction Status:
Existing-updated
Use:
Most of the above
Paper:
N/A
Documentation:
None
Reuters RCV1
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Brazilian Portuguese Danish Finland-Swedish Sign Language Germany Italian Spanish french
Availability:
From Owner
License:
<Not Specified>
Size:
13 <Not Specified>Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
None
JRC (Joint Research Centre)-Acquis
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Brazilian Portuguese Danish Finland-Swedish Sign Language Germany Italian Spanish french
Availability:
Freely Avalable
License:
<Not Specified>
Size:
464 <Not Specified>Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
None
Europarl
Speech/Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Brazilian Portuguese Danish Finland-Swedish Sign Language Germany Italian Spanish french
Availability:
Freely Avalable
License:
<Not Specified>
Size:
25600000 <Not Specified>Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
None
BAF (Bilingual corpus)
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Bilingual
Languages:
English Cajun French
Availability:
Freely Avalable
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
None
Porn Train Set
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
Freely Avalable
License:
<Not Specified>
Size:
106,000 filenames/titles OtherProduction Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
None
Simple English Wikipedia
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
Freely Avalable
License:
<Not Specified>
Size:
4389599 <Not Specified>Production Status:
Existing-used
Use:
Text Complexity Analysis
Paper:
N/A
Documentation:
None
Crisis Management Corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
2728540 <Not Specified>Production Status:
Newly created-finished
Use:
Text Complexity analysis, Text Simplification
Paper:
N/A
Documentation:
None
Recognizing Narrative Similarity Task
Written
Evaluation Methodology/Standards/Guidelines,
LREC2012
Expand/Collapse
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Evaluation of Machine Story Understanding
Paper:
N/A
Documentation:
None
DramaBank
Written
Corpus,
LREC2012
Expand/Collapse
Previous
|
Next
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Freely Avalable
License:
<Not Specified>
Size:
791 <Not Specified>Production Status:
Newly created-in progress
Use:
Discourse
Paper:
N/A
Documentation:
None