Skip to main navigation Skip to search Skip to main content

A year of nouns from English-learning infants’ daily lives: The SEEDLingS-Nouns dataset

  • Evgenii Kalenkovich
  • , Sharath Koorathota
  • , Shaelise Tor
  • , Andrei Amatuni
  • , Shannon Egan-Dailey
  • , Charlotte Moore
  • , Catherine Laing
  • , Hallie Garrison
  • , Gladys Baudet
  • , Federica Bulgarelli
  • , Sarp Uner
  • , Lillianna Righter
  • , Elika Bergelson
  • Harvard University
  • University of Rochester
  • University of Texas at Austin
  • Duke University
  • Concordia University
  • University of York

Research output: Contribution to journalArticlepeer-review

Abstract

This paper describes a dataset consisting of manually annotated nouns from a corpus of longitudinal day-long audio and hour-long video recordings collected monthly from 44 babies from age 6 months to age 17 months. This dataset was created as part of a larger project, called SEEDLingS, that examines the development of infants’ language comprehension before and after their first birthday, from earliest comprehension to the early days of word production. This paper provides an overview of the corpus, describes how and why the nouns from the corpus were annotated, and discusses considerations for the reuse of this dataset for future work. The described annotations and relevant metadata are publicly available alongside this manuscript.

Original languageEnglish
Article number298
JournalBehavior Research Methods
Volume57
Issue number11
DOIs
StatePublished - Nov 2025

Keywords

  • Corpus
  • Home recordings
  • Infancy
  • Language acquisition

Fingerprint

Dive into the research topics of 'A year of nouns from English-learning infants’ daily lives: The SEEDLingS-Nouns dataset'. Together they form a unique fingerprint.

Cite this