7693 results found.
Web-based Bengali news corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Bengali
Availability:
From Owner
License:
NA
Size:
34 million wordforms Production Status:
Existing-used
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
Desccribed in the paper (Asif Ekbal and Sivaji Bandyopadhyay, 2008) of Language Resource and Evaluation journal
Wikipedia datasets
Written
Corpus,
COLING2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC0 public domain dedication
Size:
130 MByte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
GhoSt-PV: A Representative Gold Standard of German Particle Verbs
Written
Lexicon,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Free for non-comercial use
License:
TBA
Size:
400 entries Production Status:
Newly created-finished
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>
JUMAN
<Not Specified>
Tagger/Parser,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
CRF Chunker
Not Applicable
Tokenizer,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
NA
Size:
NA Production Status:
Existing-used
Use:
Emotion Recognition/Generation
Paper:
N/A
Documentation:
NA
FBIS corpus
Speech/Written
Corpus,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
302000 Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Chinese product reviews
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
OpenSource
Size:
2560kb (zipped version) Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
English documentation is publicly available
Freebase
Written
Ontology,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
8.8Gbyte Production Status:
Existing-used
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
<Not Specified>
computer science terms
Written
Corpus,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
1255 Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
No
Wikipedia English version
Written
Corpus,
COLING2012
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
38 Production Status:
Existing-updated
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>