7693 results found.
Switchboard Corpus
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
16Mbyte Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
Yes, English, Yes
Dot2Text
Speech/Written
text to phonetic transcription,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Hebrew
Availability:
Freely Available
License:
Gnu
Size:
100 KByte Production Status:
Newly created-finished
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
European Parliament Proceedings Parallel Corpus v6 English-French-Spanish
Written
Corpus,
LREC2016
Expand/Collapse
Language Type:
Trilingual
Languages:
English Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
470 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
http://opus.lingfil.uu.se/Europarl.php
Penn Discourse Treebank Parser
Written
Tagger/Parser,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
18Mbytes Production Status:
Newly created-in progress
Use:
Discourse
Paper:
N/A
Documentation:
Readme (english) distributed with dataset
A Large English-Chinese Parallel Corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
14M parallel sentences Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
TAC 2011 Summarization Track Data
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
<Not Specified>
Reuters
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Dimensionality calculation
Paper:
N/A
Documentation:
<Not Specified>
TiGer
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
TWSI lexical substitution dataset
Written
Corpus,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
24700 Production Status:
Existing-used
Use:
Lexical substitution
Paper:
N/A
Documentation:
<Not Specified>
The EMIME Mandarin/English Bilingual Database
Speech
Corpus,
IS2011
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
Freely Available
License:
GByte
Size:
1.3 Production Status:
Newly created-finished
Use:
Speech Synthesis
Paper:
N/A
Documentation:
<Not Specified>