925 results found.
Written
Lexicon,
Language Type:
Monolingual
Languages:
Arabic
Availability:
Freely Available
License:
OpenSource
Size:
78915 words Production Status:
Newly created-finished
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:Toward Qualitative Evaluation of Embeddings for Arabic Sentiment Analysis
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Amira Barhoumi | ArSentLex | /N |
Documentation:
English documentation will be publicly available.
Written
Lexicon,
Language Type:
Multilingual
Languages:
Acadian Aragonese Aromanian Asturian Auvergnat Campidanese Sardinian
Availability:
Freely Available
License:
GPLv3
Size:
102698 words Production Status:
Newly created-finished
Use:
Study of language evolution
-
Paper title:Opening the Romance Verbal Inflection Dataset 2.0: A CLDF lexicon
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sacha Beniamine | Romance Verbal Inflection Datase | /N |
Documentation:
Publicly available English documentation.
Written
Lexicon,
Language Type:
Multilingual
Languages:
Dutch English French
Availability:
From Owner
License:
Size:
51220 entries Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Identifying Cognates in English-Dutch and French-Dutch by means of Orthographic Information and Cross-lingual Word Embeddings
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Els Lefever | Gold Standard for Cognate Pairs in English-Dutch and French-Dutch | /N |
Documentation:
Labat, S., Vandevoorde, L., and Lefever, E. (2019). Annotation Guidelines for Labeling English-Dutch Cognate Pairs, version 1.0. Technical report, Ghent University, LT3 15-01
Written
Lexicon,
Language Type:
Monolingual
Languages:
Afrikaans Albanian Arabic Armenian Bangla Basque Bosnian Breton Bulgarian Catalan Croatian Czech Danish Dutch English Esperanto Estonian Filipino Finnish French Galician Georgian German Greek Hebrew Hindi Hungarian Icelandic Indonesian Italian Japanese Kazakh Korean Latvian Lithuanian Macedonian Malay Malayalam Norwegian Persian Polish Portuguese Romanian Russian Serbian Sinhala Slovak Slovenian Spanish Swedish Tamil Telugu Thai Turkish Ukrainian Urdu Vietnamese pt_br ze_en ze_zh zh_cn zh_tw
Availability:
Freely Available
License:
CreativeCommons Attribution 4.0 International
Size:
41 GByte Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:word2word: A Collection of Bilingual Lexicons for 3,564 Language Pairs
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yo Joong Choe | word2word | /N |
Documentation:
Yes, on the website.
Written
Lexicon,
Language Type:
Multilingual
Languages:
Abron Acehnese Afar Arabic Baharna Arabic Mesopotamian Arabic
Availability:
Freely Available
License:
CreativeCommons
Size:
50000 tokens Production Status:
Newly created-in progress
Use:
Text Mining
-
Paper title:Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Isaac Caswell | TF-IDF-IIF top100 wordlists | /N |
Documentation:
https://github.com/google-research-datasets/TF-IDF-IIF-top100-wordlists
Lexicon,
Language Type:
Monolingual
Languages:
highly multilingual
Availability:
Freely Available
License:
Size:
None Production Status:
Newly created-finished
Use:
-
Paper title:Learning and Evaluating Emotion Lexicons for 91 Languages
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sven Buechel | MEmoLon – The Multilingual Emotion Lexicon | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Multilingual
Languages:
English French German Italian Spanish
Availability:
Freely Available
License:
Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Size:
None Production Status:
Newly created-in progress
Use:
Word Sense Disambiguation
-
Paper title:Clu{BERT}: {A} Cluster-Based Approach for Learning Sense Distributions in Multiple Languages
-
Paper track:Long/Semantics: Lexical
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bianca Scarlini | CluBERT Distributions | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:Agreement Prediction of Arguments in Cyber Argumentation for Detecting Stance Polarity and Intensity
-
Paper track:Long/Sentiment Analysis, Stylistic Analysis, and Argum
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joseph Sirrianni | NRC opinion lexicon | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:Agreement Prediction of Arguments in Cyber Argumentation for Detecting Stance Polarity and Intensity
-
Paper track:Long/Sentiment Analysis, Stylistic Analysis, and Argum
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joseph Sirrianni | SentiWordNet 3.0 | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:Agreement Prediction of Arguments in Cyber Argumentation for Detecting Stance Polarity and Intensity
-
Paper track:Long/Sentiment Analysis, Stylistic Analysis, and Argum
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joseph Sirrianni | MPQA subjectivity lexicon | /N |
Documentation:
None




