7693 results found.
Lectra
Speech
Corpus,
IS2013
Expand/Collapse
Language Type:
Multilingual
Languages:
Portuguese
Availability:
Partially available
License:
<Not Specified>
Size:
32 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
Paper:
N/A
Documentation:
<Not Specified>
Event Extractor
Written
Evaluation Tool,
COLING2010
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
Paper:
N/A
Documentation:
<Not Specified>
multilingual_multidomain_dataset
Written
Corpus,
LREC2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English italian
Availability:
Freely Available
License:
CreativeCommons (strictly for research use only)
Size:
444744 users OtherProduction Status:
Newly created-in progress
Use:
Knowledge Discovery/Representation
Paper:
N/A
Documentation:
The documentation is written in english and located in the same folder of the dataset, publicly available
SQUAD question generation dataset
Not Applicable
Corpus,
COLING2018
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA 4.0
Size:
30 MByte Production Status:
Existing-used
Use:
Question Answering
Paper:
N/A
Documentation:
<Not Specified>
FarsNet
Written
Not Assigned,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
From Owner
License:
<Not Specified>
Size:
9266 Production Status:
Existing-used
Use:
Evaluation/Validation
Paper:
N/A
Documentation:
<Not Specified>
WordNet
Written
Lexicon,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Anaphora, Coreference
Paper:
N/A
Documentation:
<Not Specified>
hrWac
Written
Corpus,
LREC2014
Expand/Collapse
Language Type:
Multilingual
Languages:
Croatian
Availability:
From Data Center(s)
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
Hand aligned data
Written
Evaluation Data,
NAACL2013
Expand/Collapse
Language Type:
Multilingual
Languages:
English Turkish
Availability:
Freely Available
License:
<Not Specified>
Size:
75 Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
Paper:
N/A
Documentation:
<Not Specified>
HSK corpus
Written
Corpus,
COLING2012
Expand/Collapse
Language Type:
Multilingual
Languages:
Mandarin Chinese
Availability:
Freely Available
License:
<Not Specified>
Size:
2.83 million Production Status:
Existing-used
Use:
Computer Assisted Language Learning
Paper:
N/A
Documentation:
Documents are in the web site and in Chinese.
Arabic Segmentation test set
Written
Corpus,
LREC2016
Expand/Collapse
Previous
|
Next
Language Type:
Multilingual
Languages:
Standard Arabic
Availability:
From Owner
License:
Research
Size:
387 KByte Production Status:
Newly created-finished
Use:
Morphological Analysis
Paper:
N/A
Documentation:
<Not Specified>