7693 results found.
Multidomain Uncertainty Corpus, Bioscope
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
40000 sentences Production Status:
Existing-used
Use:
Text Mining
Paper:
N/A
Documentation:
English
ZPar
Written
Tagger/Parser,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
GPL
Size:
312Kbyte Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
http://www.cl.cam.ac.uk/~yz360/zpar.html
Prague Dependency Treebank 2.0 (PDT 2.0)
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Czech
Availability:
From Data Center(s)
License:
LDC
Size:
50 thousand Production Status:
Existing-updated
Use:
Discourse
Paper:
N/A
Documentation:
http://ufal.mff.cuni.cz/pdt2.0/doc/pdt-guide/en/html/ch05.html
MORESQUE
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
114 queries Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
ssj500k
Written
Corpus,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
Slovenian
Availability:
Freely Available
License:
CreativeCommons
Size:
500000 words Production Status:
Newly created-finished
Use:
Parser training
Paper:
N/A
Documentation:
<Not Specified>
Clause Boundary Annotation Program
Written
Annotation Tool,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
clause boudary detection
Paper:
N/A
Documentation:
Japanese documentation, publicly unavailable
Porter Stemmer
Written
Stemmer,
COLING2012
Expand/Collapse
Language Type:
Trilingual
Languages:
English German french
Availability:
Freely Available
License:
BSD License
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
Publicly available in English
Hungarian WordNet
Written
Ontology,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Hungarian
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
42K synsets Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Japanese Wikipedia
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Freely Available
License:
GNU
Size:
10GB Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
Paper:
N/A
Documentation:
<Not Specified>
ChaSen
Written
Tokenizer,
ACLHT2011
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
Japanese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>