7693 results found.
Stockholm Umeå Corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Swedish
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>
Corpus CINTIL - Corpus Internacional do Português
Written
Corpus,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Portuguese
Availability:
From Data Center(s)
License:
ELRA END USER
Size:
1000000 tokens Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Movie Review Data
Written
Corpus,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
Subjectivity Lexicon
Written
Lexicon,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Opinion Mining/Sentiment Analysis
Paper:
N/A
Documentation:
documentation in English publicly available
A Parallel Corpus of Thesis and Dissertations Abstracts
Written
Corpus,
COLING2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English Portuguese
Availability:
Freely Available
License:
OpenSource
Size:
1289372 sentences Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
CiNii articles
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English Japanese
Availability:
From Owner
License:
National Institute of Informatics
Size:
15 million entries Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
English Incremental Right-Corner Grammar for HHMM
Written
Grammar/Language Model,
ACLHT2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
228Mb Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
2014 French Diachronic News Corpora
Written
Corpus,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
french
Availability:
Not Available
License:
Copyright by respective news sources
Size:
78 million words Production Status:
Newly created-in progress
Use:
OOV Proper Name recovery
Paper:
N/A
Documentation:
To appear in the corresponding LREC paper
IRNA newspaper text corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Persian
Availability:
Freely Available
License:
<Not Specified>
Size:
3000 document Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
Alpino
Multimodal/Multimedia
Grammar/Language Model,
EMNLP2010
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
Dutch
Availability:
Freely Available
License:
<Not Specified>
Size:
800 rules, 100K lexical entries, 200K named entities Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>