A baby's first word typically arrives around their first birthday — and parents understandably treat it as the beginning of language development. But language development begins at birth, possibly earlier. By the time a baby says 'mama,' they have spent twelve months building the perceptual, social, and neural architecture that makes spoken language possible. Understanding this hidden work changes how parents can support it.
Newborns arrive with a remarkable statistical learning system. Research by Patricia Kuhl at the University of Washington showed that fetuses in the third trimester have already begun to process the specific sound patterns of their mother's native language. Newborns prefer their mother's voice over other voices, and prefer the language spoken during pregnancy over foreign languages.
In the first two months, babies communicate through crying, and parents should respond consistently — not to 'spoil' the baby, but because responsive caregiving is the first lesson in conversational turn-taking: you communicate, someone responds, communication works.
At 6–8 weeks, most babies begin social smiling — smiling in response to a human face. Shortly after, cooing begins: soft, vowel-like sounds produced during quiet, alert states. These are not random vocalizations. High-speed video research shows that babies modulate their cooing in response to parental voice — pausing when the parent speaks, then cooing again. They are practicing conversational turn-taking.
Singing to babies at this stage is particularly powerful. The stretched vowels and exaggerated pitch contours of infant-directed singing help babies map their own cooing onto the phonological structure of language.
Around 4 months, babies begin canonical babbling — strings of consonant-vowel combinations like 'bababababa' or 'mamamama.' This is a significant milestone: it requires control of the lips, tongue, and jaw in coordination with vocalization — a complex motor task.
Babies babble more with caregivers who respond contingently to their babbling. Studies show that infants whose babbles are answered with matching sounds develop more complex and linguistically diverse babbling patterns. In other words, conversation practice — even before words — accelerates language development.
Between 6 and 9 months, receptive language — the ability to understand words and phrases before producing them — begins to develop rapidly. Many babies recognize their own name by 6 months and understand 'no' by 8 months.
This is also when the 'word gap' research becomes relevant. Hart and Risley's landmark longitudinal study found that the number of words children hear per day by age 3 strongly predicts vocabulary and school readiness. Songs are extraordinarily efficient word-delivery systems — a single short song repeats its vocabulary many times per play session, and children who hear the same songs repeatedly accumulate substantial vocabulary exposure.
In the months before a first word, babies show a cluster of milestones that indicate language readiness: joint attention (following a caregiver's gaze or pointing gesture), proto-declarative pointing (pointing at things to share attention, not just to request), and recognizable word-like vocalizations that are used consistently for specific objects or people.
First words typically emerge between 10 and 14 months. They are usually nouns for objects of high personal relevance (names, 'ball,' 'dog,' 'more') or social-pragmatic words ('hi,' 'bye,' 'no'). The words babies say first are the words they have heard most often in meaningful contexts.
Songs work at every stage of this development because they deliver language in a multi-sensory, emotionally engaged format that the infant brain is specifically calibrated to process. Specifically:
- •Birth–2 months: Lullabies provide emotional security and introduce the music of language (prosody, rhythm, pitch) before words are processed
- •2–4 months: Action songs with face and eye contact support social-communicative development
- •4–6 months: Call-and-response songs train turn-taking and contingent vocalization
- •6–9 months: Songs with repeated vocabulary efficiently build receptive vocabulary
- •9–12 months: Songs with gaps ('Old MacDonald had a farm, E-I-E-I-...') invite filling-in, a precursor to word production
