7693 results found.
Bitterlemons Corpus
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
594 articles Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
<Not Specified>
Catvar
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
4.5 Production Status:
Existing-used
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>
TREC-8 collection
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
NIST, USA
Size:
1.64Gbytes Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
There is documentation publicly available in English
AURORA Project Database 2.0 - Evaluation Package
Speech
Corpus,
IS2011
Expand/Collapse
Language Type:
Multilingual
Languages:
American English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
Standard Arabic Morphological Analzyer (SAMA)
Written
Annotation Tool,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
From Data Center(s)
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Morphological Analysis, Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
RWSCor
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
100Mbyte Production Status:
Newly created-in progress
Use:
Summarisation
Paper:
N/A
Documentation:
<Not Specified>
VerbNet
Written
Lexicon,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Web Service
Written
Language Learning Tool,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Not Applicable
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Not Applicable
Paper:
N/A
Documentation:
<Not Specified>
Europarl v6 French-English
Written
Corpus,
LTC2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
1825077 sentencesProduction Status:
Existing-used
Use:
Statistical phrase alignment
Paper:
N/A
Documentation:
Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005
20 newsgroups
Written
Evaluation Data,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
14MB Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>