7693 results found.
JUMAN
Written
Tokenizer,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Freely Available
License:
Others
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Wikipedia
Multimodal/Multimedia
Ontology,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
7 Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
TAC 2010 Guided Summarization
Written
Corpus,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
<Not Specified>
License:
<Not Specified>
Size:
920 documents OtherProduction Status:
Existing-updated
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Xinhua of Gigaword
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
181M lexemes Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Verb Noun List
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
1 Mbyte Production Status:
Newly created-in progress
Use:
Textual Entailment and Paraphrasing
Paper:
N/A
Documentation:
<Not Specified>
NECTEC corpus
Written
Corpus,
ACL2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Thai
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Mathematical Linguistics
Paper:
N/A
Documentation:
<Not Specified>
Serbian Morphological Dictionary - SMD
Written
Lexicon,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Serbian
Availability:
Within Unitex distribution freely available dictionaries with 88000 lemmas
License:
<Not Specified>
Size:
158000 lexemes Production Status:
Existing-updated
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
English
Czech-English Parallel Corpus (CzEng)
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Czech English
Availability:
Freely Available
License:
<Not Specified>
Size:
23Gbyte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
MWE 2008 data sets
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
10.7Mbyte Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
In English
Fisher
Speech
Corpus,
IS2013
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
American English
Availability:
From Owner
License:
LDC
Size:
2000000 Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>