5493 results found.
Speech
Evaluation Data,
Language Type:
Trilingual
Languages:
Basque Catalan English
Availability:
From Owner
License:
hours
Size:
125 Production Status:
Newly created-finished
Use:
Language Identification
Paper:
N/A
Documentation:
<Not Specified>
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English
Availability:
Freely Available
License:
<Not Specified>
Size:
2 million sentences Production Status:
Existing-used
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>
Written
Tool: discourse coherence model,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Discourse
Paper:
N/A
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA 3.0 US
Size:
351 MByte Production Status:
Newly created-finished
Use:
Discourse
Paper:
N/A
Documentation:
English documentation available in the README file that comes with the dataset.
Written
Tokenizer,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Apache 2.0
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
Yes, in English, publicly available at http://www.nltk.org/Language Type:
Multilingual
Languages:
Hungarian
Availability:
From Owner
License:
<Not Specified>
Size:
9500 sentences Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
50914 words Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Chinese
Availability:
Freely Available
License:
GPL
Size:
340 KByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>




