How to pronounce It is a pity that the kitten bit the mitten. in American English

Words 10 Difficulty Intermediate Featured sound Glottal T
iht it ihz is uh a PIH·dee pity dhuht that dhuh the KIH·tuhn kitten BIHT bit dhuh the MIH·tuhn mitten
Start here

Americans pronounce "It is a pity that the kitten bit the mitten" as "iht ihz uh PIH-dee dhuht dhuh KIH-tuhn BIHT dhuh MIH-tuhn" in casual speech. Several things bend the textbook pronunciation. The headline is the Glottal T — the T closes off into a tiny silent pause instead of a clean release. You'll hear it on kitten and again on mitten — and it's why Americans sound more relaxed than the textbook. Keep stressed words long, unstressed words short, and link the consonants forward into the vowels.

Now you try.

Read the sentence out loud at native speed. The mic stays on your device — nothing's uploaded.

Ready when you are
Tap the mic to start
Preview your accent profile

Get your accent profile and 5-axes assessment.

Sounds
75%
Clarity
68%
Stress
78%
Intonation
65%
Fluency
62%

Overall assessment

Our AI coach listens to your recording and grades 5 dimensions of pronunciation — then tells you exactly what to fix next.

72% Noticeable accent

Common mistakes

Saying a hard "T" in the middle.

In "pity", the "d" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.

Releasing the T before the syllabic N.

In "kitten", the "t" before the syllabic nasal becomes a glottal stop — a catch in the throat where the schwa drops and the nasal becomes syllabic. /t/ becomes a glottal stop [ʔ] — a catch in the throat. The schwa in the following syllable is dropped, making the nasal syllabic.

Unlock the full report in the app
The breakdown

What's happening in this sentence.

Small tricks that turn a textbook sentence into how an American actually says it.

ɾ
Flap T Across Words between "it" & "is"The "t" at the end of "it" links to the vowel starting "is" — it flaps to sound like a quick "d", with the tongue briefly tapping the ridge behind the upper teeth.
→ə
Reduced Words (to, for, of) in "it""it" is a function word — in connected speech, the full vowel reduces to a quick "iht" sound and consonants may simplify.
C–V
Consonant-to-Vowel Linking between "is" & "a"The "z" at the end of "is" flows directly into the vowel starting "a" — the consonant migrates to the next word with no pause between.
t→ɾ
Flap T in "pity"In "pity", the "d" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth.
Unreleased Stops in "that"In "that", the "t" is not released — the articulators get into position but hold without the burst of air.
t→ʔ
Glottal T in "kitten"In "kitten", the "t" before the syllabic nasal becomes a glottal stop — a catch in the throat where the schwa drops and the nasal becomes syllabic.
Word by word

Tap any word for its full breakdown.

Each word has its own page with examples, common mistakes, and related words.

Find another

Looking for a different word or sentence?

Search the entire library
/
Press / anywhere to focus the search box.
Watch out

Common pronunciation mistakes in American English.

The textbook way isn't wrong — it's just not how anyone actually says it.

01

Saying a hard "T" in the middle.

In "pity", the "d" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.

PIH-teePIH·dee
02

Releasing the T before the syllabic N.

In "kitten", the "t" before the syllabic nasal becomes a glottal stop — a catch in the throat where the schwa drops and the nasal becomes syllabic. /t/ becomes a glottal stop [ʔ] — a catch in the throat. The schwa in the following syllable is dropped, making the nasal syllabic.

KIH-tuhnKIH·tuhn
03

Releasing the final consonant with a puff of air.

In "that", the "t" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.

dhuhtdhuht
04

Inserting a vowel before the syllabic consonant.

In "kitten", the short unstressed vowel before "n" disappears — the schwa is absorbed and the "n" becomes the syllable nucleus on its own. Schwa is absorbed — consonant becomes the syllable nucleus.

KIH-tuhnKIH·tuhn
Questions

Questions people ask about this.

Why do the T sounds turn into D-like sounds in this sentence?
That's the flap-T rule: when /t/ sits between two vowels — inside a single word, or across the boundary between two words — Americans replace the crisp T with a quick D-like flap. It's one of the most distinctive sounds of casual American speech and one of the first things to copy if you want to sound less textbook.
Why does the T in "kitten" sound silent here?
It isn't fully silent — the T closes off into a tiny throat catch (a glottal stop), then the next sound continues. Americans replace clean-T with this glottal-stop T whenever /t/ sits at the end of a stressed syllable before an N or a similar consonant. The textbook T release sounds over-articulated in everyday speech.
Why is "it" said so quickly in this sentence?
Function words — articles, prepositions, auxiliaries, pronouns — reduce to short, unstressed schwa shapes in casual American speech. Pronouncing them fully like the dictionary entry is a dead giveaway of a textbook accent. Native speakers stress only the content words and let everything else collapse.
How are the words connected in casual American speech?
Americans don't pause between words. A consonant at the end of one word links forward into the vowel that starts the next; two vowels in a row get bridged by a tiny W or Y glide; an identical consonant repeated across a word boundary is held just once. The result is a continuous flow rather than a textbook word-by-word delivery.

Practice this sentence with an AI coach.

SayWaader is the AI pronunciation coach for American English. Practice 5 minutes a day. Get a 5-axes accent assessment. Sound like you live here.