7693 results found.
Tycho Brahe parsed corpus
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
Portuguese
Availability:
Freely Available
License:
free
Size:
20,000 sentences Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
RST Discourse Treebank
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
Dicovalence
Written
Lexicon,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
<Not Specified>
German Reference Corpus DeReKo
Written
Corpus,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
German
Availability:
Freely available for scientific, non-commercial research purposes through search and analysis software COSMAS II
License:
non-standard license: acadamic, non-commercial use only, only via special software (EULA: http://www.ids-mannheim.de/cosmas2/projekt/registrierung/ )
Size:
24 billion tokens Production Status:
Existing-updated
Use:
Linguistic Research
Paper:
N/A
Documentation:
German documentation: http://www.ids-mannheim.de/kl/projekte/korpora/
HPSG-WSJ
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
~48K sentences Production Status:
Newly created-in progress
Use:
Not Applicable
Paper:
N/A
Documentation:
<Not Specified>
TDT3, TDT4, TDT5
Speech and Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Trilingual
Languages:
English Mandarin Chinese Standard Arabic
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Topic Detection and Tracking
Paper:
N/A
Documentation:
<Not Specified>
ACE 2005
Written
Corpus,
EMNLP2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Named Entity Recognition
Paper:
N/A
Documentation:
Serbian
C&C Tools
Written
Tagger/Parser,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
C&C System Licence
Size:
1.1MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
ACE 2007
Written
Corpus,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English Standard Arabic
Availability:
From Data Center(s)
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Europarl Corpus of Native, Non-native and Translated Texts
Written
Corpus,
LREC2016
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
230 MByte Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
http://nlp.unibuc.ro/resources.html