TY - GEN
T1 - Low-resource grapheme-to-phoneme mapping with phonetically-conditioned transfer
AU - Hammond, Michael
N1 - Publisher Copyright:
© 2023 Association for Computational Linguistics.
PY - 2023
Y1 - 2023
N2 - In this paper we explore a very simple nonneural approach to mapping orthography to phonetic transcription in a low-resource context with transfer data from a related language. We start from a baseline system and focus our efforts on data augmentation. We make three principal moves. First, we start with an HMMbased system (Novak et al., 2012). Second, we augment our basic system by recombining legal substrings in restricted fashion (Ryan and Hulden, 2020). Finally, we limit our transfer data by only using training pairs where the phonetic form shares all bigrams with the target language.
AB - In this paper we explore a very simple nonneural approach to mapping orthography to phonetic transcription in a low-resource context with transfer data from a related language. We start from a baseline system and focus our efforts on data augmentation. We make three principal moves. First, we start with an HMMbased system (Novak et al., 2012). Second, we augment our basic system by recombining legal substrings in restricted fashion (Ryan and Hulden, 2020). Finally, we limit our transfer data by only using training pairs where the phonetic form shares all bigrams with the target language.
UR - http://www.scopus.com/inward/record.url?scp=85175400254&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85175400254&partnerID=8YFLogxK
U2 - 10.18653/v1/2023.sigmorphon-1.29
DO - 10.18653/v1/2023.sigmorphon-1.29
M3 - Conference contribution
AN - SCOPUS:85175400254
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 245
EP - 248
BT - ACL 2023 - 20th SIGMORPHON Workshop on Computational Morphology, Phonology, and Phonetics, CMPP 2023
A2 - Nicolai, Garrett
A2 - Chodroff, Eleanor
A2 - Coltekin, Cagri
A2 - Mailhot, Fred
PB - Association for Computational Linguistics (ACL)
T2 - 20th SIGMORPHON Workshop on Computational Morphology, Phonology, and Phonetics, CMPP 2023, as part of ACL 2023
Y2 - 14 July 2023
ER -