7693 results found.
FBIS corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
23 Million Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
LDC2003E14
ORTOFON v1: balanced corpus of informal spoken Czech with multi-tier transcription (transcriptions)
Speech
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Czech
Availability:
From Data Center(s)
License:
http://creativecommons.org/licenses/by-nc-sa/4.0/
Size:
1M words Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
http://wiki.korpus.cz/doku.php/en:cnk:ortofon
A corpus of German political speeches from the 21st century
Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CC BY SA
Size:
10.9 million tokens Production Status:
Existing-updated
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>
Stanford parser
Written
Tagger/Parser,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Sentiment Analysis
Paper:
N/A
Documentation:
<Not Specified>
NTU Sentiment Dictionary (NTUSD)
Written
Lexicon,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
http://nlg18.csie.ntu.edu.tw:8080/opinion/pub1.html
Size:
<Not Specified> Production Status:
Existing-updated
Use:
Emotion Recognition/Generation
Paper:
N/A
Documentation:
<Not Specified>
English Chinese Translation Treebank 1.0
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
5,777K Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
LGLex
Written
Lexicon,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
french
Availability:
Freely Available
License:
<Not Specified>
Size:
76672 lexical entries Production Status:
Existing-updated
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
LGPL-LR
RIKEN Japanese Mother-Infant Conversation Corpus
Speech
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Not Available
License:
<Not Specified>
Size:
80000 words Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>
Penn Treebank
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
694Mbyte Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
Documentation is avilable in English
i2b2 2009 shared task on medication extraction
Written
Corpus,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Available from i2b2, need to sign a DUA
License:
<Not Specified>
Size:
326,474 tokens Production Status:
Existing-used
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
there is an annotation guideline from i2b2