10000 results found.
Written
Corpus,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
2 MByte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
Paper:
N/A
Documentation:
<Not Specified>
Not Applicable
Tagger/Parser,
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
Freely Available
License:
GNU
Size:
<Not Specified> Production Status:
Existing-used
Use:
Parsing and Tagging
Paper:
N/A
Documentation:
<Not Specified>
Transcribed speech
telephone conversations,
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
LDC
Size:
9.61 Gbyte OtherProduction Status:
conversational corpus
Use:
speech recognition, speaker identification
Paper:
N/A
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC97S62/
Word Alignment Tool,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
License:
<Not Specified>
Size:
<Not Specified> Production Status:
<Not Specified>
Use:
Paper:
N/A
Documentation:
<Not Specified>
Speech/Written
semi-automatic transliterator,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Existing-used
Use:
Transliteration
Paper:
N/A
Documentation:
<Not Specified>
Speech/Written
Language Modeling Tool,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Language Modelling
Paper:
N/A
Documentation:
Publicly available documentation is provided in English on github.
Written
Optical Character Recognition,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
Freely Available
License:
Apache License 2.0
Size:
2.4 MByte Production Status:
Existing-used
Use:
Optical Character Recognition
Paper:
N/A
Documentation:
'github repository: https://github.com/tesseract-ocr/tesseract; ''training tesseract'' (English) on github: https://github.com/tesseract-ocr/tesseract/wiki/TrainingTesseract; manpage in English: https://tesseract-ocr.googlecode.com/svn/trunk/doc/tesseract.1.html'
<Not Specified>
Tagger/Parser,
Language Type:
Language Independent
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-updated
Use:
General LT infrastructure
Paper:
N/A
Documentation:
<Not Specified>Language Type:
Monolingual
Languages:
<Not Specified>
Availability:
From Owner
License:
<Not Specified>
Size:
180 synsets Production Status:
Newly created-in progress
Use:
Temporal IR/NLP applications
Paper:
N/A
Documentation:
<Not Specified>




