7693 results found.
FBIS and MTC data sets
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
10 M words Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Bengali Speech Corpus
Speech
Corpus,
O-COCOSDA2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Bengali
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
Chinese-English Translation Lexicon Version 3.0
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
54k words Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Multilingual glossary of technical and popular medical terms
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
SiBol/Port corpus
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
300 million Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
TREC corpus Disk4&5
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
1.86GB Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
YouTube Comedy Slam Preference Data Data Set
Written
user ratings,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
1138562 entries Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
AV-WordProminence
Multimodal/Multimedia
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Not Available
License:
<Not Specified>
Size:
6000 sentences Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
dialogue data
Written
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Not Available
License:
<Not Specified>
Size:
324 OtherProduction Status:
Newly created-finished
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
TreeTagger
Written
Tagger/Parser,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>