10000 results found.
Prague Czech English Dependency Treebank 2.0
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Bilingual
Languages:
English Czech
Availability:
From Data Center(s)
License:
Linguistic Data Consortium - LDC2004T25
Size:
1.2 million <Not Specified>Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
None
Lemur retrieval tool
Written
Information Retrieval Tool,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
Freely Avalable
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
None
LOB Corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
ICAME
Size:
approx. 1,000,000 <Not Specified>Production Status:
Existing-used
Use:
diachronic comparison
Paper:
N/A
Documentation:
None
Frown Corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
ICAME
Size:
approx. 1,000,000 <Not Specified>Production Status:
Existing-used
Use:
diachronic comparison
Paper:
N/A
Documentation:
None
FLOB
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
ICAME
Size:
approx. 1,000,000 <Not Specified>Production Status:
Existing-used
Use:
diachronic comparison
Paper:
N/A
Documentation:
None
Connexor Machinese Syntax
Written
Tagger/Parser,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
Connexor
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
tokenisation and lemmatisation
Paper:
N/A
Documentation:
None
Brown Corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
ICAME
Size:
approx. 1,000,000 <Not Specified>Production Status:
Existing-used
Use:
diachronic comparison
Paper:
N/A
Documentation:
None
RoTC
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Aromanian; Arumanian; Macedo-Romanian
Availability:
From Owner
License:
<Not Specified>
Size:
341320 <Not Specified>Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
None
Russian-Ukrainian parallel corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Bilingual
Languages:
Old Russian Ukrainian
Availability:
Freely Avalable
License:
<Not Specified>
Size:
2,5 million wordsProduction Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
None
Russian-Belorussian corpus
Written
Corpus,
LREC2012
Expand/Collapse
Previous
|
Next
Language Type:
Bilingual
Languages:
Belarusian Old Russian
Availability:
Freely Avalable
License:
<Not Specified>
Size:
1 million tokensProduction Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
None