7693 results found.
DSim
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Danish
Availability:
From Owner
License:
not decided
Size:
48000 Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
in progress
natural language toolkit
Written
Word Sense Disambiguator,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Attribution-Noncommercial-No Derivative Works 3.0 United States
Size:
1.3MB Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
http://www.nltk.org/documentation
English GigaWord
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
unknown
Size:
<Not Specified> Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
Paper:
N/A
Documentation:
<Not Specified>
DUC2002
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
DUC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Summarisation
Paper:
N/A
Documentation:
<Not Specified>
Subjectivity Lexicon
<Not Specified>
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
600KByte Production Status:
Existing-used
Use:
Opinion Mining/Sentiment Analysis
Paper:
N/A
Documentation:
Available at http://www.cs.pitt.edu/mpqa/lexiconrelease/collectinfo1.html
TIMIT
Speech
Corpus,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
American English
Availability:
From Data Center(s)
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
OpenNLP POS FraMed Model
Written
Machine-Learning Model,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CC BY-NC 3.0
Size:
680 KByte Production Status:
Newly created-finished
Use:
POS tagging
Paper:
N/A
Documentation:
See OpenNLP documentation.
IPI PAN Corpus of Polish (manually disambiguated part)
Multimodal/Multimedia
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Polish
Availability:
From Owner
License:
Custom
Size:
880000 Production Status:
Existing-used
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
Yes, under above URL
Resources for CMIR based comparable corpora construction
Multimodal/Multimedia
Corpus,
COLING2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Chinese English
Availability:
From Owner
License:
<Not Specified>
Size:
1.6 GByte Production Status:
Newly created-in progress
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>
GALE LDC Parallel Data
Written
Corpus,
IJCNLP2011
Expand/Collapse
Previous
|
Next
Language Type:
Monolingual
Languages:
Standard Arabic
Availability:
From Owner
License:
<Not Specified>
Size:
50 million words Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
LDC