7693 results found.
Clause boundary annotation program (CBAP)
<Not Specified>
Annotation Tool,
COLING2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Dialogue
Paper:
N/A
Documentation:
<Not Specified>
The Vroniplag Corpus: A Dataset for Monolingual and Multilingual Plagiarism Detection
Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Trilingual
Languages:
English German Spanish
Availability:
Freely Available
License:
Creative Commons Attribution-Share Alike 3.0 Unported license
Size:
4510338 words Production Status:
Newly created-finished
Use:
Machine Learning
Paper:
N/A
Documentation:
<Not Specified>
Institute of Computing Technology Chinese Lexical Analysis System (ICTCLAS)
Written
Tagger/Parser,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
unsure
Size:
<Not Specified> Production Status:
Existing-used
Use:
Chinese word segmentation
Paper:
N/A
Documentation:
http://ictclas.org/
Subtitles Dataset Ground Truth
Not Applicable
Evaluation Data,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Dutch
Availability:
Freely Available
License:
<Not Specified>
Size:
1596 links OtherProduction Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Hercules Dalianis
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Swedish
Availability:
Not Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Question Answering
Paper:
N/A
Documentation:
<Not Specified>
Penn Chinese Treebank
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Data Center(s)
License:
Gnu Public License
Size:
4950KB Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Mogura
Written
Tagger/Parser,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Enju License
Size:
56MBytes Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
The LTH Constituent-to-Dependency Conversion Tool for Penn-style Treebanks
Written
Tagger/Parser,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
88.1Kbyte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
WSJCAM0 Cambridge Read News
Speech
Corpus,
IS2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Linguistic Data Consortium (LDC)
License:
DVDes
Size:
1 Production Status:
Existing-used
Use:
Person Identification
Paper:
N/A
Documentation:
<Not Specified>
Verb Ocean
Written
Lexicon,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
<Not Specified> Production Status:
Existing-used
Use:
Textual Entailment and Paraphrasing
Paper:
N/A
Documentation:
yes