7693 results found.
EmotiBlog
Written
Corpus,
RANLP2011
Expand/Collapse
Language Type:
Trilingual
Languages:
English Spanish italian
Availability:
Free on request from authors
License:
CreativeCommons
Size:
270000 words Production Status:
Newly created-in progress
Use:
Emotion Recognition/Generation
Paper:
N/A
Documentation:
In Spanish and English
Persian TimeBank (PerTimeBank)
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Not Available
License:
<Not Specified>
Size:
26949 Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
The documentation is written in Persian
MSNBC News test set
Written
Evaluation Data,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
NA
Size:
200 news articles Production Status:
Existing-used
Use:
Word Sense Disambiguation
Paper:
N/A
Documentation:
Cucerzan, Silviu. 2007. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. Pro-ceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning.
Wiki50
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution Share Alike
Size:
4350 Production Status:
Existing-used
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
Vincze, Veronika; Nagy T., István; Berend, Gábor 2011: Multiword expressions and Named Entities in the Wiki50 corpus. In: Proceedings of RANLP 2011. Hissar, Bulgaria, pp. 289-295.
Sejong morphological analyzed corpus for written language
Written
Corpus,
COLING2016
Expand/Collapse
Language Type:
Multilingual
Languages:
Korean
Availability:
Freely Available
License:
CC BY-NC-ND 4.0
Size:
871889 sentences Production Status:
Existing-used
Use:
Morphological Analysis
Paper:
N/A
Documentation:
<Not Specified>
metaTED
Written
Corpus,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
16 concepts Production Status:
Newly created-finished
Use:
Discourse
Paper:
N/A
Documentation:
Provided in a README file with the resource. Documentation is in English.
taln-archives
Written
Metadata,
LREC2016
Expand/Collapse
Language Type:
Multilingual
Languages:
french
Availability:
Freely Available
License:
<Not Specified>
Size:
3,5 MByte Production Status:
Existing-used
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
https://github.com/boudinfl/taln-archives
<Not Specified>
Written
Evaluation Data,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
SentiWordNet
Written
Lexicon,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Opinion Mining/Sentiment Analysis
Paper:
N/A
Documentation:
<Not Specified>
Corpus of Bilingual Emphasized Speech
Speech
Corpus,
COLING2012
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English Japanese
Availability:
Freely Available
License:
CreativeCommons
Size:
1000 utterances OtherProduction Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
In Preparation