10000 results found.
Idiom dataset
Written
Idiomatic sentences,
LiNCR2018
Expand/Collapse
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Ortolang
License:
<Not Specified>
Size:
240 sentences OtherProduction Status:
<Not Specified>
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Morphological analyser for Chukchi
Written
Tagger/Parser,
COLING2018
Expand/Collapse
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
OpenSourse
Size:
2 GByte Production Status:
Newly created-in progress
Use:
Morphological Analysis
Paper:
N/A
Documentation:
Yes, English, yes
Chinese Language Technology Platform
<Not Specified>
Tokenizer,
COLING2010
Expand/Collapse
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
<Not Specified>
<Not Specified>
Lexicon,
COLING2012
Expand/Collapse
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Ubuntu dataset
<Not Specified>
Corpus,
COLING2018
Expand/Collapse
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
<Not Specified>
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
global vector for word representation
Written
Lexicon,
COLING2018
Expand/Collapse
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
822 MByte Production Status:
Existing-used
Use:
Question Answering
Paper:
N/A
Documentation:
<Not Specified>
Simple English Wiktionary
<Not Specified>
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
ARL BotLanguage Corpus
Transcribed speech, text messages, 2D map data
Multi-media corpus,
AREA2018
Expand/Collapse
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Not yet released
License:
<Not Specified>
Size:
33,561 words, 20 hours audio/video OtherProduction Status:
Under development
Use:
Machine learning training data
Paper:
N/A
Documentation:
<Not Specified>
TRECVID MED
Multimodal/Multimedia
Corpus,
IS2013
Expand/Collapse
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
5000 Production Status:
Existing-used
Use:
Multimedia Document Processing
Paper:
N/A
Documentation:
<Not Specified>
International Workshop on Spoken Language Translation (IWSLT) multilingual Corpora
Written
Corpus,
COLING2018
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> tokens Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>