7693 results found.
hr500k
Written
Corpus,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Croatian
Availability:
From Owner
License:
<Not Specified>
Size:
496989 tokens Production Status:
Newly created-finished
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Japanese Speaking Style Parallel Database for Expressive Speech Synthesis
Speech
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
Japanese
Availability:
Not Available
License:
<Not Specified>
Size:
20 minutes Production Status:
Not Applicable
Use:
Speech Synthesis
Paper:
N/A
Documentation:
Not Available
Cantonese-Mandarin Parallel Corpus
Written
Corpus,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese Yue Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
40K characters Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
VIT
Speech/Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
italian
Availability:
From Data Center(s)
License:
ELRA
Size:
20 Mb Production Status:
Existing-updated
Use:
Language Modelling
Paper:
N/A
Documentation:
Yes, in English
<Not Specified>
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Urdu
Availability:
From Owner
License:
None
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
TimeBank
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
68K lexemes Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
JMdict
Written
Lexicon,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Japanese
Availability:
Freely Available
License:
EDRDG General Dictionary License
Size:
133715 entries Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
Available at above site in English
People’s Daily from 1993-1997
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
212M Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
PAN Plagiarism Corpus PAN-PC-10
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Trilingual
Languages:
English German Spanish
Availability:
Freely Available
License:
<Not Specified>
Size:
1.6 Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
Web-service reviews
Written
Corpus,
EMNLP2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Not Available
License:
<Not Specified>
Size:
234 documents, 6091 sententences Production Status:
Newly created-in progress
Use:
Opinion Mining
Paper:
N/A
Documentation:
<Not Specified>