How to pronounce Vowel-to-Vowel Linking V–V in American English
A brief glide (y or w) bridges two vowels for smooth flow.
When one word ends with a vowel and the next starts with one, American English slips a brief glide in to bridge them — Y or W, depending on what your lips are doing. Front/spread vowels like /i/ trigger a Y glide: see it → see-yit. Back/rounded vowels like /oʊ/ trigger a W glide: go out → go-wout. Do it uses W; the answer uses Y. The glide is subtle enough that most speakers don't notice they're making it, but it's what keeps the airflow going instead of dropping into a glottal stop.
Watch it happen in real phrases.
Three example phrases showing exactly when this rule fires.
see it
See ends on a high front /i/ — your tongue is already arched up and forward, the same position that makes a /j/ (Y). As you transition to /ɪ/ in it, that tongue position naturally produces a fleeting Y glide. The result lands as see-yit, one continuous voiced phrase.
go out
Go ends on the rounded /oʊ/ diphthong — your lips are pushed forward and almost closed. As they open into the /aʊ/ of out, the brief lip-rounded vibration produces an automatic /w/. Same physics as so easy, too often, no other.
the answer
The changes pronunciation before a vowel: from /ðə/ (with schwa) to /ði/ (rhyming with see). Once it ends in the high front /i/, it triggers a Y glide just like see does — thee-yanswer. Same shift in the end (thee-yend), the apple (thee-yapple). The glide isn't coming from the schwa; it's coming from the /i/ that replaces the schwa before vowels.
Where two words run together.
Real phrases where this rule fires across the word boundary.
In real American conversation.
Listen for any phrase where two vowels would otherwise butt against each other. Do it (do-wit), the end (the-yend), too often (too-woften), so easy (so-weasy), he is (he-yiz) — the glide is automatic. Stop the airflow in your throat between the vowels instead and you get a textbook glottal break that immediately reads as non-native.
Five sentences where Y and W bridge the gap.
Each one runs a vowel-final word straight into a vowel-initial word — tap to hear the glide land instead of a stop.