Published November 21, 2024 | Version v1
Journal article Open

Using computational modeling to validate the onset of productive determiner–noun combinations in English-learning children

  • 1. University of Amsterdam
  • 2. University of Chicago
  • 3. Allen Institute for AI
  • 4. Tilburg University

Description

Language is a productive system––we routinely produce well-formed utterances that we have never heard before. It is, however, difficult to assess when children first achieve linguistic productivity simply because we rarely know all the utterances a child has experienced. The onset of linguistic productivity has been at the heart of a long-standing theoretical question in language acquisition––do children come to language learning with abstract categories that they deploy from the earliest moments of acquisition? We address the problem of when linguistic productivity begins by marrying longitudinal behavioral observations and computational modeling to capitalize on the strengths of each. We used behavioral data to assess when a sample of 64 English-learning children began to productively combine determiners and nouns, a linguistic construction previously used to address this theoretical question. After the onset of productivity, the children produced determiner–noun combinations that were not attested in our sample of their linguistic input from caregivers. We used computational techniques to model the onsets and trajectories of determiner–noun combinations in these 64 children, as well as characteristics of their utterances in which the determiner was omitted. Because we knew exactly what input the model was trained on, we could, with confidence, know that the model had gone beyond its input. The parallels found between child and model in the timing and number of novel combinations suggest that the children too were creatively going beyond their input.

Data availability

Some study data are available. We will share all code associated with this study, along with in-depth descriptive summaries of the behavioral data underlying the models; however, we do not yet have consent to share the raw child language data previously collected and reported in (15).

Files

alhama-et-al-2024-using-computational-modeling-to-validate-the-onset-of-productive-determiner-noun-combinations-in.pdf

Files (875.6 kB)

Name Size Download all
Article
md5:60ad3480c3ef0271601cda73cbc963a1
619.2 kB Preview Download
Supporting information
md5:29fa93f9e557bfbc9ad6073e6986bf60
256.4 kB Preview Download

Additional details

Identifiers

DOI
10.1073/pnas.2316527121
Other
oai:uchicago.tind.io:14098

Funding

National Institute of Child Health and Human Development
P01 HD 40605
Institute of Education Sciences
R305A190467

UChicago Information

Division(s)
Social Sciences Division
Department(s)
Comparative Human Development, Education, Psychology