7693 results found.
Standard Arabic Morphological Tagger
Written
Annotation Tool,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
Not Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Morphological Analysis, Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
Very Large Pronunciation Vocabulary for Russian
Written
Lexicon,
IS2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Russian
Availability:
From Owner
License:
<Not Specified>
Size:
2300 Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
Penn Treebank
Written
Corpus Tool,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
<Not Specified>
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
IGN DVD Reviews
<Not Specified>
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
665 documents Production Status:
Newly created-finished
Use:
Sentiment Analysis
Paper:
N/A
Documentation:
<Not Specified>
PAN Plagiarism Corpus PAN-PC-11
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Trilingual
Languages:
English German Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
2 Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
in English
NAIST Text Corpus
Written
Corpus,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Freely Available
License:
<Not Specified>
Size:
51 Mbyte Production Status:
Existing-used
Use:
Training Data for Domain Adaptation
Paper:
N/A
Documentation:
open source
Persian WordNet
Written
Ontology,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
From Owner
License:
<Not Specified>
Size:
10000 synsets Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Darpa TIDES Surprise Language Dataset
Written
Corpus,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Hindi
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
200,000 sentence-pairs Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
Turkish-English gold morpheme alignments
Written
Evaluation Data,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Turkish
Availability:
Freely Available
License:
<Not Specified>
Size:
100 Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
GigaWord
Written
Corpus,
EMNLP2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
11709Mb Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
Paper:
N/A
Documentation:
<Not Specified>