7693 results found.
Quaero named entity corpora
Speech
Corpus,
IS2011
Expand/Collapse
Language Type:
Multilingual
Languages:
french
Availability:
Available in the future through the ETAPE project
License:
MByte
Size:
1.2 Production Status:
Existing-updated
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
<Not Specified>
bsWaC
Written
Corpus,
LREC2014
Expand/Collapse
Language Type:
Trilingual
Languages:
Bosnian Croatian Serbian
Availability:
Freely Available
License:
CC-BY-SA 3.0
Size:
428925567 Production Status:
Newly created-in progress
Use:
Paper:
N/A
Documentation:
<Not Specified>
pre-test, training, post-test experimental design
Speech
Evaluation Methodology/Standards/Guidelines,
LTC2011
Expand/Collapse
Language Type:
Multilingual
Languages:
french
Availability:
From Owner
License:
<Not Specified>
Size:
2 testsProduction Status:
Newly created-finished
Use:
Acquisition
Paper:
N/A
Documentation:
in French
WT10g
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
11GB Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
TREC QA questions
Written
Corpus,
COLING2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
9.1 MByte Production Status:
Existing-used
Use:
Question Answering
Paper:
N/A
Documentation:
<Not Specified>
BAF corpus
Written
Evaluation Data,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
OpenSource
Size:
740 Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
Yes, French, publicly available
FAUST
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/370_Paper.pdf
Chinese Zero Anaphora corpus
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
OpenSource
Size:
1.73MByte Production Status:
Newly created-in progress
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
DECA Species Corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
372K gzipped Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Spanish-English Europarl
Written
Corpus,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English Spanish
Availability:
Freely Available
License:
none
Size:
169 MB, 1,689,850 parallel sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005. http://www.iccs.inf.ed.ac.uk/~pkoehn/publications/europarl-mtsummit05.pdf