How to pronounce photos in American English

IPA /ˈfoʊɾoʊz/ Syllables 2 · foh·tohz Stress 1st syllable
FOH·tohz
Start here

Americans pronounce photos as FOH-tohz (/ˈfoʊɾoʊz/). In "photos", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. This is called the Flap T, a small move that separates 'classroom' from 'native'. It comes out as FOH·tohz. Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "Let's find some old photos" or "I felt a wave of nostalgia when I saw my old photos" — more examples below.

Now you try.

Record yourself saying "photos" and play it back. The mic stays on your device — nothing's uploaded.

Ready when you are
Tap the mic to start
Preview your accent profile

Get your accent profile and 5-axes assessment.

Sounds
75%
Clarity
68%
Stress
78%
Intonation
65%
Fluency
62%

Overall assessment

Our AI coach listens to your recording and grades 5 dimensions of pronunciation — then tells you exactly what to fix next.

72% Noticeable accent

Common mistakes

Saying a hard "T" in the middle.

In "photos", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.

Stressing the wrong syllable.

Stress falls on the first syllable, not the others. Stretch FOH — keep everything else short and quick.

Unlock the full report in the app
Sound by sound

Every sound in "photos".

2 syllables, 5 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.

f/f/

Lift your bottom lip to touch the very bottom of your top front teeth. Blow air through this contact point without voicing.

Mouth position for /f/ as in FAN
oh/oʊ/

Start with your mouth slightly open, then close your jaw slightly as your lips round. Shift your tongue back slightly, then stretch the back up.

t/t/
Flap

Quickly bounce the front of your tongue against the roof of your mouth. Don't stop the airflow — just a quick tap.

Mouth position for /t/ as in TEN
oh/oʊ/

Start with your mouth slightly open, then close your jaw slightly as your lips round. Shift your tongue back slightly, then stretch the back up.

z/z/

Same position as S, but add vocal cord vibration. Feel the buzz.

Mouth position for /z/ as in ZOO
In real conversation

Hear "photos" in the wild.

Click any sentence to see the full breakdown — every link, every reduction, every flap-T.

"I felt a wave of nostalgia when I saw my old photos."
ahy FEHLT uh WAYV uhv nah·STAL·juh wehn ahy SAH mahy OHLD FOH·dohz
"Let's find some old photos."
LEHTS FAHYND suhm OHLD FOH·dohz
Find another

Looking for a different word or sentence?

Search the entire library
/
Press / anywhere to focus the search box.
Watch out

Common pronunciation mistakes in American English.

The textbook way isn't wrong — it's just not how anyone actually says it.

01

Saying a hard "T" in the middle.

In "photos", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.

FOH-tohzFOH·tohz
02

Stressing the wrong syllable.

Stress falls on the first syllable, not the others. Stretch FOH — keep everything else short and quick.

foh·TOHZFOH·tohz
Questions

Questions people ask about this.

How is "photos" stressed in American English?
Stress falls on the first syllable — say "FOH" with a longer, fuller vowel and keep every other syllable short and quick. The respell "FOH-tohz" marks the stressed syllable in capitals so the rhythm is easy to read at a glance.
Why doesn't the T sound like a T in "photos"?
In American English, when /t/ sits between two vowels with the second one unstressed, it turns into a quick D-like flap. So "photos" sounds closer to "FOH-tohz" than to a crisp-T pronunciation. This is the flap-T rule, one of the most distinctive sounds of casual American speech.
Is the American pronunciation of "photos" different from British English?
American English uses different vowel shapes, a relaxed retroflex R, and connected-speech tricks like flap-T and glottal-stop T that British Received Pronunciation generally avoids. The respell "FOH-tohz" reflects the casual American form; British dictionaries typically print a citation form with crisper consonants and different vowel choices.

Stop reading about "photos". Start saying it.

SayWaader is the AI pronunciation coach for American English. Practice 5 minutes a day. Get a 5-axes accent assessment. Sound like you live here.