577 results found.
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Linguistic Data Consortium (LDC)
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
English documentationLanguage Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
LDC
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
From Owner
License:
<Not Specified>
Size:
1M Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
Chinese Documentation
Written
Treebank,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
51000 words Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
Yes English YesLanguage Type:
Monolingual
Languages:
<Not Specified>
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
250K Production Status:
Existing-used
Use:
analysis
Paper:
N/A
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Size:
438 articles OtherProduction Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
-
Paper title:Temporal Histories of Epidemic Events (THEE): A Case Study in Temporal Annotation for Public Health
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jingcheng Niu | TheeBank | /N |
Documentation:
There will be a publicly available annotation standard (written in English) provided with the corpus
Written
Corpus,
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
OpenSource
Size:
Around 70000 <Not Specified>Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:TimeBankPT: A TimeML Annotated Corpus of Portuguese
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Francisco Costa | <Not Specified> | None | ||
| Author 2 | António Branco | <Not Specified> | None | Universidade de Lisboa | None |
| Main Contact | Francisco Costa | University of Lisbon | PT |
Documentation:
<Not Specified>Language Type:
Bilingual
Languages:
<Not Specified>
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Lexicon Creation/Annotation
-
Paper title:Grammar Extraction from Treebanks for Hindi and Telugu
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Prasanth Kolachina | Affiliation | None |
| Author 2 | Sudheer Kolachina | <Not Specified> | None |
| Author 3 | Anil Kumar Singh | <Not Specified> | None |
| Author 4 | Samar Husain | <Not Specified> | None |
| Author 5 | Viswanath Naidu | <Not Specified> | None |
| Author 6 | Rajeev Sangal | <Not Specified> | None |
| Author 7 | Aksar Bharati | <Not Specified> | None |
| Main Contact | Sudheer Kolachina | IIIT | IN |
Documentation:
<Not Specified>
text
Corpus,
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
open
License:
CC BY
Size:
10786651 words <Not Specified>Production Status:
production
Use:
any
Paper:
N/A
Documentation:
<Not Specified>




