7693 results found.
Beygingarlýsing íslensks nútímamáls (BÍN)
Written
Lexicon,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Icelandic
Availability:
From Data Center(s)
License:
Available for download but with restrictions on use
Size:
5.8 million word forms Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
The DHBB Corpus
Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Portuguese
Availability:
<Not Specified>
License:
Creative Commons
Size:
324054 sentences Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
http://github.com/cpdoc
The Database of Japanese Companies and Organizations
Written
Lexicon,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
From Owner
License:
<Not Specified>
Size:
600000 entries Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
http://www.cjk.org/cjk/samples/japorg.htm
Penn Discourse Treebank 2.0
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
All WSJ Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
WordNet
<Not Specified>
Ontology,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Wordnet (MIT like)
Size:
117,000 synsets Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
yes
Service Quality Evaluation Data Set
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Gnu
Size:
16Mbytes Production Status:
Newly created-finished
Use:
Knowledge Discovery/Representation
Paper:
N/A
Documentation:
It will be publicly available soon.
MADA
Written
Tokenizer,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
Freely Available
License:
<Not Specified>
Size:
2Mbyte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
IMDB dataset Maas et al. (Maas et al., 2011)
Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
80.2 MByte Production Status:
Existing-used
Use:
Opinion Mining/Sentiment Analysis
Paper:
N/A
Documentation:
yes, it's documented in English and the documentation is publicly available
GDep
Written
Tagger/Parser,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
26MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
SemDis V2 dataset
Written
Evaluation Data,
LREC2018
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
french
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License
Size:
13089 entries Production Status:
Newly created-finished
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>