7693 results found.
ACE 2004 training data
Written
Evaluation Data,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
471 documents Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
CONLL-X Shared Task
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Trilingual
Languages:
Portuguese Spanish Swedish
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing
Paper:
N/A
Documentation:
<Not Specified>
Minho Quotation Bank
Speech
Corpus,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
In progress. Please contact lead author
License:
OpenSource - research only
Size:
300000 Individual Quotations Production Status:
Newly created-in progress
Use:
Not Applicable
Paper:
N/A
Documentation:
<Not Specified>
Icelandic Frequency Dictionary
Written
Corpus,
RANLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
Icelandic
Availability:
From Data Center(s)
License:
Available for research purposes
Size:
590000 tokens Production Status:
Existing-updated
Use:
Language Modelling
Paper:
N/A
Documentation:
Íslensk orðtíðnibók (Pind et al., 1990). A book In Icelandic.
English Wikipedia
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike
Size:
200MB Production Status:
Existing-used
Use:
Natural Language Generation
Paper:
N/A
Documentation:
<Not Specified>
Korean emotional speech corpus
Speech
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Korean
Availability:
From Data Center(s)
License:
Commercial
Size:
96Mbyte Production Status:
Existing-used
Use:
Speech Synthesis
Paper:
N/A
Documentation:
documentation written in Korean
TREC Entity 2010
Written
Evaluation Data,
IJCNLP2011
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
50 test questions and their answers Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
TREC
<Not Specified>
Multimodal/Multimedia
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Polish
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>
TIMIT
Speech
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
American English
Availability:
From Data Center(s)
License:
LDC
Size:
6300 sentences Production Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
NLTK POS FraMed Model
Written
Machine-Learning Model,
LREC2014
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
CC BY-NC 3.0
Size:
5.8 MByte Production Status:
Newly created-finished
Use:
POS tagging
Paper:
N/A
Documentation:
See NLTK documentation.