In casual American English, "He said he was feeling very sad yesterday" sounds like "hee sehd hee wuhz FEE-luhng VEH-ree SAD YEH-ster-day". Several things happen here, and the headline one is the Y-Merging (gotcha, didja): the T/D/S/Z fuses with the following Y into CH or J. Keep stressed words long, unstressed words short, and link the consonants forward into the vowels.
Now you try.
Read the sentence out loud at native speed. The mic stays on your device — nothing's uploaded.
What makes this sentence sound American.
The "" at the end of "sad" and the "y" starting "yesterday" blend together into "" — natural in casual conversation; in formal or careful speech, the two sounds stay separate. This is called the Y-Merging (gotcha, didja), what turns word-by-word reading into actual conversation. It comes out as SAD.
What's happening in this sentence.
Small tricks that turn a textbook sentence into how an American actually says it.
Tap any word for its full breakdown.
Each word has its own page with examples, common mistakes, and related words.
Common pronunciation mistakes in American English.
The textbook way isn't wrong — it's just not how anyone actually says it.
Saying the consonants separately.
The "" at the end of "" and the "y" starting "" blend together into "" — natural in casual conversation; in formal or careful speech, the two sounds stay separate. The two sounds merge: T+Y → CH, D+Y → J, S+Y → SH, Z+Y → ZH.
Pausing between the words.
The "" at the end of "" flows directly into the vowel starting "" — the consonant migrates to the next word with no pause between. Final consonant "migrates" to next word — no pause between.
Pronouncing the function word too fully.
"he" is a function word — in connected speech, the full vowel reduces to a quick "" sound and consonants may simplify. Full vowel reduces to schwa /ə/ or other weak vowel. Consonants may simplify.
Pronouncing the H clearly.
The "h" in "he" is dropped in connected speech — the preceding word's final consonant links directly to the remaining vowel — most natural in casual, rapid speech; in careful or formal speech, the H is typically kept. /h/ is dropped entirely — preceding word links directly into the remaining vowel (works after both consonants and vowels).