Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Americans pronounce city as SIH-tee (/ˈsɪɾi/). In "city", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. This is called the Flap T, a hallmark of natural-sounding American speech. It comes out as SIH·tee. Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "What a beautiful city this is" or "He's from a city near the coast" — more examples below.
Record yourself saying "city" and play it back. The mic stays on your device — nothing's uploaded.
2 syllables, 4 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Click any sentence to see the full breakdown — every link, every reduction, every flap-T.
The textbook way isn't wrong — it's just not how anyone actually says it.
In "city", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.
Stress falls on the first syllable, not the others. Stretch SIH — keep everything else short and quick.