7693 results found.
Turkmen web corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Turkmen
Availability:
From Owner
License:
<Not Specified>
Size:
2000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
Paper:
N/A
Documentation:
None
Turkish web corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Balkan Gagauz Turkish
Availability:
From Owner
License:
<Not Specified>
Size:
3370000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
Paper:
N/A
Documentation:
None
Morfessor
Written
Language Modeling Tool,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
No linguistic content; Not applicable
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
None
Kyrgyz web corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Kirghiz
Availability:
From Owner
License:
<Not Specified>
Size:
19000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
Paper:
N/A
Documentation:
None
Kazakh web corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Kazakh
Availability:
From Owner
License:
<Not Specified>
Size:
136000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
Paper:
N/A
Documentation:
None
Azerbaijani web corpus
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Azerbaijani
Availability:
From Owner
License:
<Not Specified>
Size:
92000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
Paper:
N/A
Documentation:
None
HaberOzetleri
Written
Evaluation Data,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Balkan Gagauz Turkish
Availability:
Not Applicable
License:
<Not Specified>
Size:
160 <Not Specified>Production Status:
Newly created-in progress
Use:
Summarisation
Paper:
N/A
Documentation:
None
Wikipedia
Written
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Balkan Gagauz Turkish
Availability:
Freely Avalable
License:
<Not Specified>
Size:
1.12 GbyteProduction Status:
Existing-used
Use:
Acquisition
Paper:
N/A
Documentation:
None
METU Turkish Microphone Speech Corpus
Speech
Corpus,
LREC2012
Expand/Collapse
Language Type:
Monolingual
Languages:
Balkan Gagauz Turkish
Availability:
Free for LDC members
License:
<Not Specified>
Size:
5.2 hoursProduction Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
None
h2p.learnpunjabi.org
Written
Machine Translation Tool,
LREC2012
Expand/Collapse
Previous
|
Next
Language Type:
Bilingual
Languages:
Hindi Punjabi
Availability:
SpeechToSpeech Translation
License:
Freely Avalable
Size:
online system OtherProduction Status:
Existing-used
Use:
Machine Translation
Paper:
N/A
Documentation:
None