7693 results found.
GALE LDC Parallel Data
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
From Owner
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
The Undergraduate Learner Translator Corpus
Written and audiovisual
Corpus,
LREC2018
Expand/Collapse
Language Type:
Trilingual
Languages:
Arabic English french
Availability:
From Owner
License:
<Not Specified>
Size:
9000000 tokens Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>
Robocup sportscasting corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Korean
Availability:
Freely Available
License:
None
Size:
36MB Production Status:
Existing-used
Use:
Semantic Parsing, Language Generation, Alignment of Ambiguous Links
Paper:
N/A
Documentation:
None
data-emotions
Written
Corpus,
ACL2016
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Research License Agreement (Non-Commercial Use Only)
Size:
5000k Twitter user profiles <Not Specified>Production Status:
Existing-updated
Use:
Person Identification
Paper:
N/A
Documentation:
<Not Specified>
Prague Dependency Treebank 2.0 (PDT 2.0)
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Czech
Availability:
<Not Specified>
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Lexicon
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Hindi
Availability:
Not Available
License:
<Not Specified>
Size:
30K words Production Status:
Existing-used
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
<Not Specified>
Penn Treebank
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
Lancaster Contemporary Mandarin Corpus
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
1001549 Production Status:
Existing-used
Use:
Morphological Analysis
Paper:
N/A
Documentation:
<Not Specified>
Wikipedia Edit Category Corpus
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
1995 Production Status:
Newly created-finished
Use:
Collaborative writing
Paper:
N/A
Documentation:
Annotation Guidelines and README file are in English and both publicly available.
UIUC Question Classification Data (Training set 5)
Written
Evaluation Data,
IJCNLP2011
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Question Answering
Paper:
N/A
Documentation:
<Not Specified>