Published November 21, 2024
| Version v1
Journal article
Open
Using computational modeling to validate the onset of productive determiner–noun combinations in English-learning children
Creators
- 1. University of Amsterdam
- 2. University of Chicago
- 3. Allen Institute for AI
- 4. Tilburg University
Description
Language is a productive system––we routinely produce well-formed utterances that we have never heard before. It is, however, difficult to assess when children first achieve linguistic productivity simply because we rarely know all the utterances a child has experienced. The onset of linguistic productivity has been at the heart of a long-standing theoretical question in language acquisition––do children come to language learning with abstract categories that they deploy from the earliest moments of acquisition? We address the problem of when linguistic productivity begins by marrying longitudinal behavioral observations and computational modeling to capitalize on the strengths of each. We used behavioral data to assess when a sample of 64 English-learning children began to productively combine determiners and nouns, a linguistic construction previously used to address this theoretical question. After the onset of productivity, the children produced determiner–noun combinations that were not attested in our sample of their linguistic input from caregivers. We used computational techniques to model the onsets and trajectories of determiner–noun combinations in these 64 children, as well as characteristics of their utterances in which the determiner was omitted. Because we knew exactly what input the model was trained on, we could, with confidence, know that the model had gone beyond its input. The parallels found between child and model in the timing and number of novel combinations suggest that the children too were creatively going beyond their input.
Data availability
Some study data are available. We will share all code associated with this study, along with in-depth descriptive summaries of the behavioral data underlying the models; however, we do not yet have consent to share the raw child language data previously collected and reported in (15).Files
alhama-et-al-2024-using-computational-modeling-to-validate-the-onset-of-productive-determiner-noun-combinations-in.pdf
Files
(875.6 kB)
| Name | Size | Download all |
|---|---|---|
|
Article md5:60ad3480c3ef0271601cda73cbc963a1 |
619.2 kB | Preview Download |
|
Supporting information md5:29fa93f9e557bfbc9ad6073e6986bf60 |
256.4 kB | Preview Download |
Additional details
Identifiers
- DOI
- 10.1073/pnas.2316527121
- Other
- oai:uchicago.tind.io:14098
Funding
- National Institute of Child Health and Human Development
- P01 HD 40605
- Institute of Education Sciences
- R305A190467