In casual American English, "He submitted the rough draft for peer review before finalizing" sounds like "hee suhb-MIH-duhd dhuh RUHF DRAFT fer PEER ree-VYOO buh-FOR FAHY-nuh-lahy-zuhng". Several things happen here, and the headline one is the Flap T: the T between vowels turns into a quick D-like flap. Keep stressed words long, unstressed words short, and link the consonants forward into the vowels.
Now you try.
Read the sentence out loud at native speed. The mic stays on your device — nothing's uploaded.
What makes this sentence sound American.
In "submitted", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. This is called the Flap T, a hallmark of natural-sounding American speech. It comes out as suhb-MIH-duhd.
What's happening in this sentence.
Small tricks that turn a textbook sentence into how an American actually says it.
Tap any word for its full breakdown.
Each word has its own page with examples, common mistakes, and related words.
Common pronunciation mistakes in American English.
The textbook way isn't wrong — it's just not how anyone actually says it.
Saying a hard "T" in the middle.
In "submitted", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.
Saying a clean "dr" instead of a "j" sound.
In "draft", the "dr" cluster blends into a "jr" sound — a natural American English pronunciation. /d/ shifts toward /dʒ/ ("j"), so DR sounds like "jr".
Releasing the final consonant with a puff of air.
In "submitted", the "" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.
Pronouncing every consonant in the cluster.
The "" at the end of "" is dropped before the consonant starting "" — the surrounding consonants flow directly together — common in flowing natural speech; in careful or formal speech, the sound is often kept. The /t/ or /d/ at the end is dropped — surrounding consonants flow directly.