7693 results found.
m-grams
Written
Corpus,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC
Size:
500 MByte Production Status:
Newly created-in progress
Use:
Text Mining Applications for Music Industry
Paper:
N/A
Documentation:
<Not Specified>
DIDEC: The Dutch Image Description and Eye-tracking Corpus
Multimodal/Multimedia
Corpus,
COLING2018
Expand/Collapse
Language Type:
Multilingual
Languages:
Dutch
Availability:
Freely Available
License:
Apache 2.0
Size:
307 images, 4.6K descriptions, plus eye-tracking data <Not Specified>Production Status:
Newly created-finished
Use:
Natural Language Generation
Paper:
N/A
Documentation:
Documentation in English, available on the website
English Lexical Substitution Dataset
Written
Corpus,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
2000 Production Status:
Existing-used
Use:
Lexical substitution
Paper:
N/A
Documentation:
<Not Specified>
NTCIR's Japanese patent document corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
From Owner
License:
NTCIR's contract
Size:
12Gbyte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
http://research.nii.ac.jp/ntcir/permission/ntcir-6/perm-en-PATENT.html
General Inquirer
Written
Lexicon,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
8K Production Status:
Existing-used
Use:
Text Mining
Paper:
N/A
Documentation:
<Not Specified>
English TTS speech corpus of air traffic (pilot) messages - Czech accent
Speech/Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Size:
1692 sentences Production Status:
Newly created-finished
Use:
Speech Synthesis
Paper:
N/A
Documentation:
<Not Specified>
JRC-Acquis partitioned in domains
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Modern Greek
Availability:
Freely Available
License:
<Not Specified>
Size:
1059883 Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
JCore Token FraMed Model
Written
Machine-Learning Model,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CC BY-NC 3.0
Size:
1.5 MByte Production Status:
Newly created-finished
Use:
Tokenization
Paper:
N/A
Documentation:
See JULIE Token Boundary Detector documentation.
AIMed
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
127KByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
KNP
Written
Tagger/Parser,
COLING2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
Japanese Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>