How to pronounce Zip /z/ vs Sip /s/ in American English

/z/
z
zip · zoo · zero · zone
vs
/s/
s
sip · see · say · sit
Start here

Z /z/ and S /s/ are identical in the mouth. Your tongue hovers just behind your upper teeth to push air through a narrow gap. The difference lives in your throat. /z/ is voiced, meaning your vocal cords vibrate to create a buzz, while /s/ is voiceless, using just a quiet hiss of air. Speakers of Spanish, Mandarin, and German often replace /z/ with /s/ at the ends of words, turning eyes into ice and his into hiss.

Side by side

How the two sounds differ.

3 small mouth adjustments. Get any one of them wrong and the sound slides into its neighbor.

Lip shape shared by zip and sip
Same lip shape. The contrast is voicing and aspiration, which you hear rather than see. Look at the dimensions table below for the difference your ear is doing.
Dimension
/z/ Zip
/s/ Sip
Tongue
Tip hovers near the alveolar ridge (the bumpy area behind the top teeth) to make a narrow groove.
Exactly the same position as /z/.
Voicing
Voiced, vocal cords vibrate. Place a hand on your throat to feel a strong buzz.
Voiceless, vocal cords don't vibrate. Hand on throat: no buzz, just a steady hiss.
Airflow
A buzzing, vibrating stream of air, like a bee.
A sharp, quiet stream of air, like a snake.
Try saying
zoo, zip, buzz, his, eyes
sue, sip, bus, hiss, ice

Now you try.

Record yourself saying "Zip" and "Sip" a few times. Listen back — your own ear is the best feedback for nailing the contrast.

Ready when you are
Tap the mic to start
Preview your accent profile

Get your accent profile and 5-axes assessment.

Sounds
75%
Clarity
68%
Stress
78%
Intonation
65%
Fluency
62%

Overall assessment

Our AI coach listens to your recording and grades 5 dimensions of pronunciation — then tells you exactly what to fix next.

72% Noticeable accent
Unlock the full report in the app
Minimal pairs

Words that change with one sound.

Every pair below differs by exactly one sound: flip /z/ to /s/ and the meaning flips with it. Tap any word for its full breakdown.

/z/ Zip
/s/ Sip
Why people mix them up

If your ear blurs them, here's why.

The biggest trap with these two sounds is spelling, not anatomy. In American English, the letter S is pronounced as a buzzing /z/ almost as often as it's pronounced as a hissing /s/. Every time you make a word plural after a voiced sound (dogs, shoes), or use basic verbs like is, was, has, and does, you need a /z/. Many languages, like Spanish and Mandarin, either don't use the /z/ sound at all or never allow it at the ends of words. German speakers actively swap them, turning an initial S into /z/ and a final Z into /s/. When learners trust the spelling, they end up using a sharp /s/ for words like please and boys, which sounds abrupt and confusing to American ears.

How to practice

Train the muscle, then the ear.

4 short drills. Do them out loud: feel the change inside your mouth before you try to hear it.

Place two fingers on your throat and say a long ssssss. You shouldn't feel anything. Now switch to zzzzzz without moving your tongue. You should instantly feel a strong vibration. Switch back and forth: sss-zzz-sss-zzz.

Use the vowel length trick: vowels are always slightly longer before a voiced consonant like /z/. Say eyes and stretch out the vowel, then finish with a gentle buzz. Now say ice with a short, clipped vowel and a sharp hiss.

Read minimal pairs out loud, focusing on the endings: eyes / ice, bus / buzz, plays / place, laws / loss. Make sure the words with /z/ have a distinct throat vibration at the end.

Practice common sight words where S makes a /z/ sound. Say out loud: is, was, has, does, these, those, because. Getting the buzz right on these everyday words is a simple way to make your connected speech sound much more fluid.

FAQ

Common questions about Zip vs Sip.

Why does the letter S sometimes sound like Z?
It happens because of a strict English pronunciation rule. When the grammatical "-s" ending (used for plurals, verbs, and possessives) is added to a word ending in a voiced sound (a vowel or a voiced consonant), it takes on the voiced /z/ sound. That's why the S in dogs and boys buzzes, but the S in cats and books hisses. Your vocal cords are already vibrating for the vowel or the G/B, so they just keep vibrating right through the ending. This rule is mandatory in English at every speed and register.
Do I really need to pronounce the /z/ at the end of words like "is" and "was"?
Yes. Pronouncing is as iss or was as wass is a very noticeable marker of a foreign accent. These tiny function words glue English sentences together, and Americans expect them to have a soft, buzzing /z/. If you replace them with a sharp /s/, your speech sounds choppy and abrupt. Smoothing out these little words with a /z/ makes your English sound much more fluid.
How can I tell if a word ends in /s/ or /z/ just by looking at it?
If the S is an added ending (a plural or a verb), look at the sound right before it. If the base word ends in a voiceless sound like P, T, K, or F (cups, hats, takes), the S is a hissing /s/. If it ends in a vowel or a voiced consonant like B, D, G, L, M, N, or R (shoes, beds, dogs, calls), the S is a buzzing /z/. However, for base words that just happen to end in the letter S (like bus, yes, this, us, gas, famous), the S is almost always a hissing /s/, even after a vowel (with a few very common exceptions like is, was, his, and has). The voicing rule only applies to suffix endings, not to the original spelling of standalone words.

Master Zip vs Sip with an AI coach.

SayWaader is the AI pronunciation coach for American English. Practice 5 minutes a day. Get a 5-axes accent assessment. Sound like you live here.