577 results found.
Written
Treebank,
Language Type:
Multilingual
Languages:
Afrikaans
Availability:
From Data Center(s)
License:
CreativeCommons
Size:
44715 words Production Status:
Newly created-finished
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
Non-commercial
Size:
50k sentences Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
See URL. English
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
The GENIA License
Size:
2,347,058 bytes, 1999 PubMed abstracts Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
CC
Size:
18Mbyte Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
Written
Treebank,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
1 Million texts Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
http://www.cis.upenn.edu/~treebank/, English, publicLanguage Type:
Multilingual
Languages:
German
Availability:
Freely Available
License:
<Not Specified>
Size:
50000 Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
<Not Specified>
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Hindi
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
~1M tokens Production Status:
Existing-used
Use:
Tagging
Paper:
N/A
Documentation:
<Not Specified>




