Conference: ACL 2020

Year: 2020
Description: VoxClamantis v1.0 is the first large-scale corpus for phonetic typology, with aligned segments and estimated phoneme-level labels in 690 recorded spoken readings of the Bible spanning 635 languages, along with acoustic-phonetic measures of vowels and sibilants. For 57 readings, language-specific resources were used to estimate phoneme labels. Access to such data can greatly facilitate investigation of phonetic typology at a large scale and across many languages. Our corpus and scripts are publicly available for non-commercial use at https://voxclamantisproject.github.io.
URL: http://voxclamantisproject.github.io/