Americans pronounce "Let's try to fix this broken screen" as "LEHTS TRAHY tuh FIHKS dhihs BROH-kuhn SKREEN" in casual speech. Several things bend the textbook pronunciation. The headline is the TR Sounds Like CHR — the TR sounds more like CH than two crisp consonants. It lands on try, a hallmark of natural-sounding American speech. Keep stressed words long, unstressed words short, and link the consonants forward into the vowels.
Now you try.
Read the sentence out loud at native speed. The mic stays on your device — nothing's uploaded.
What's happening in this sentence.
Small tricks that turn a textbook sentence into how an American actually says it.
Tap any word for its full breakdown.
Each word has its own page with examples, common mistakes, and related words.
Looking for a different word or sentence?
Common pronunciation mistakes in American English.
The textbook way isn't wrong — it's just not how anyone actually says it.
Saying a clean "tr" instead of a "ch" sound.
In "try", the "t" cluster blends into a "chr" sound — a natural American English pronunciation. /t/ shifts toward /tʃ/ ("ch"), so TR sounds like "chr".
Inserting a vowel before the syllabic consonant.
In "broken", the short unstressed vowel before "n" disappears — the schwa is absorbed and the "n" becomes the syllable nucleus on its own. Schwa is absorbed — consonant becomes the syllable nucleus.
Pronouncing the function word too fully.
"to" is a function word — in connected speech, the full vowel reduces to a quick "tuh" sound and consonants may simplify. Full vowel reduces to schwa /ə/ or other weak vowel. Consonants may simplify.
Pronouncing the vowel inside the contraction.
In fast speech, the vowel in "let's" vanishes — the "eh" is completely elided, leaving only a quick "ts" cluster — this is a feature of casual, connected speech; in careful speech, the vowel is retained. In single-syllable -ts contractions (it's = it + is, that's = that + is, what's = what + is, let's = let + us), the unstressed vowel of the enclitic ("is" /ɪ/ or "us" /ə/) is completely elided in fast speech, leaving only the final /ts/ cluster.