7693 results found.
BTEC
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Trilingual
Languages:
English Mandarin Chinese Standard Arabic
Availability:
IWSLT evaluation
License:
<Not Specified>
Size:
80K sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
STEVIN Nederlandstalig Referentiecorpus (SoNaR)
Written
Corpus,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Dutch
Availability:
From Owner
License:
TST
Size:
500 MW Production Status:
Newly created-in progress
Use:
Anaphora, Coreference
Paper:
N/A
Documentation:
<Not Specified>
FrameNet
Written
Corpus,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
custom license, I believe
Size:
<Not Specified> Production Status:
Existing-used
Use:
Semantic Role Labeling
Paper:
N/A
Documentation:
Excellent documentation on the FrameNet website.
British National Corpus (BNC)
Speech/Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
6 million Production Status:
Existing-used
Use:
Vector space creation for distributional models
Paper:
N/A
Documentation:
<Not Specified>
L1-L2 Chinese parallel dependency treebank
Written
Treebank,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Chinese
Availability:
From Owner
License:
CreativeCommons
Size:
1297 sentences Production Status:
Newly created-in progress
Use:
Acquisition
Paper:
N/A
Documentation:
Annotation guidelines are available on the Universal Dependencies web page (universal dependencies.org)
itwewina
Written
Lexicon,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Plains Cree
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> words Production Status:
Newly created-in progress
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>
apertium-mar
Written
Lexicon,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Marathi
Availability:
Freely Available
License:
GNU GPL
Size:
2 MByte Production Status:
Newly created-in progress
Use:
Morphological Analysis
Paper:
N/A
Documentation:
http://wiki.apertium.org/wiki/apertium-mar
Augmented Multi-party Interaction (AMI) meeting corpus
Multimodal/Multimedia
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Not Applicable
Use:
Multimedia Document Processing
Paper:
N/A
Documentation:
<Not Specified>
The ILMT-s2s corpus
Multimodal/Multimedia
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English Portuguese
Availability:
Freely Available
License:
ELRA
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Dialogue
Paper:
N/A
Documentation:
<Not Specified>
TIMIT
Speech
Not Assigned,
IS2013
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
American English
Availability:
From Data Center(s)
License:
LDC
Size:
6300 Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
Available in English at the Resource URL