5493 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Written
Treebank,
Language Type:
Multilingual
Languages:
Afrikaans
Availability:
From Data Center(s)
License:
CreativeCommons
Size:
44715 words Production Status:
Newly created-finished
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
7200 sentences Production Status:
Newly created-in progress
Use:
Morphological Analysis
Paper:
N/A
Documentation:
Trips (2012), Trips (2014), Trips and Kornfilt (2015)
Written
Corpus,
Language Type:
Multilingual
Languages:
Japanese
Availability:
need Mainichi Shinbun '95 corpus
License:
OpenSource
Size:
2929 articles Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
English
Speech
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
CDs
Size:
20 Production Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Japanese
Availability:
Eventually to be publicly available (not now).
License:
<Not Specified>
Size:
Apprx. 900 Production Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
1.8 million articles Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
LDC
Multimodal/Multimedia
Proficiency testing tool,
Language Type:
Multilingual
Languages:
English Japanese
Availability:
From Owner
License:
Creative Commons Attribution 3.0 Unported
Size:
24MB, 12000 lines of code Production Status:
Newly created-in progress
Use:
Web Services
Paper:
N/A
Documentation:
Basic setup and install documentation available in English and bundled with source




