Americans pronounce "What are you going to do now?" as "WUHT er yuh GOH-uhng tuh DOO NOW" in casual speech. Two things bend the textbook pronunciation. The headline is the Flap T Across Words — the T at the end of one word flaps into the vowel that starts the next. It lands on what, what turns word-by-word reading into actual conversation. Keep stressed words long, unstressed words short, and link the consonants forward into the vowels.
Now you try.
Read the sentence out loud at native speed. The mic stays on your device — nothing's uploaded.
What's happening in this sentence.
Small tricks that turn a textbook sentence into how an American actually says it.
Looking for a different word or sentence?
Common pronunciation mistakes in American English.
The textbook way isn't wrong — it's just not how anyone actually says it.
Hard T at the end of a word, not a flap.
The "t" at the end of "what" links to the vowel starting "are" — it flaps to sound like a quick "d", with the tongue briefly tapping the ridge behind the upper teeth. Same flap as within-word (R1) but spanning two words.
Pronouncing the function word too fully.
"are" is a function word — in connected speech, the full vowel reduces to a quick "er" sound and consonants may simplify. Full vowel reduces to schwa /ə/ or other weak vowel. Consonants may simplify.