Is there a bilingual disadvantage for word segmentation? A computational modeling approach

Laia Fibla; Nuria Sebastian-Galles; Alejandrina Cristia

doi:10.1017/S0305000921000568

Is there a bilingual disadvantage for word segmentation? A computational modeling approach

J Child Lang. 2022 Nov;49(6):1119-1146. doi: 10.1017/S0305000921000568. Epub 2021 Nov 3.

Authors

Laia Fibla^{1

2}, Nuria Sebastian-Galles³, Alejandrina Cristia²

Affiliations

¹ School of Psychology, The University of East Anglia, UK.
² Laboratoire de Sciences Cognitives et de Psycholinguistique, Département d'études cognitives, ENS, EHESS, CNRS, PSL University, France.
³ Center for Brain and Cognition, Universitat Pompeu Fabra, Spain.

PMID: 39656025
DOI: 10.1017/S0305000921000568

Abstract

Since there are no systematic pauses delimiting words in speech, the problem of word segmentation is formidable even for monolingual infants. We use computational modeling to assess whether word segmentation is substantially harder in a bilingual than a monolingual setting. Seven algorithms representing different cognitive approaches to segmentation are applied to transcriptions of naturalistic input to young children, carefully processed to generate perfectly matched monolingual and bilingual corpora. We vary the overlap in phonology and lexicon experienced by modeling exposure to languages that are more similar (Catalan and Spanish) or more different (English and Spanish). We find that the greatest variation in performance is due to different segmentation algorithms and the second greatest to language, with bilingualism having effects that are smaller than both algorithm and language effects. Implications of these computational results for experimental and modeling approaches to language acquisition are discussed.

Keywords: computational modeling; infancy; word segmentation.

Abstract

Grants and funding