7693 results found.
MC_v2_ENV_IT
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
italian
Availability:
<Not Specified>
License:
<Not Specified>
Size:
37M Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>
Lemuoklis
Written
Annotation Tool,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Lithuanian
Availability:
From Owner
License:
<Not Specified>
Size:
197 MByte Production Status:
Existing-used
Use:
Morphological Analysis
Paper:
N/A
Documentation:
No documentation.
Celebrity
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
DFKI Research License
Size:
16.6MB Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
not yet
obi2/B9
Written
tool for measuring readability,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
readability assignment
Paper:
N/A
Documentation:
<Not Specified>
Mörkuð íslensk málheild (MÍM)
Written
Corpus,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Icelandic
Availability:
From Data Center(s)
License:
Available for research purposes
Size:
25 million tokens Production Status:
Newly created-in progress
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
The IIT Bombay English-Hindi Parallel Corpus
Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English Hindi
Availability:
Freely Available
License:
CreativeCommons
Size:
1.49 million parallel segments <Not Specified>Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
http://www.cfilt.iitb.ac.in/iitb_parallel/
Penn Treebank
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
American English
Availability:
<Not Specified>
License:
<Not Specified>
Size:
45000 Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Chinese Temporal Annotation Data Set
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
80 newswire article Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Tigrinya Language Word List,Stop Words and Affix List
Written
Lexicon,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Tigrinya
Availability:
Freely Available
License:
<Not Specified>
Size:
24.3 MByte Production Status:
Newly created-in progress
Use:
for developing Tigrinya Stemmer
Paper:
N/A
Documentation:
<Not Specified>
NLTK Sentence FraMed Model
Written
Machine-Learning Model,
LREC2014
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CC BY-NC 3.0
Size:
374 KByte Production Status:
Newly created-finished
Use:
Sentence boundary detection
Paper:
N/A
Documentation:
See NLTK documentation.