Raise the back of your tongue to touch the soft palate (velum). Stop the air, then release.

Americans pronounce canyon as KAN-yuhn (/ˈkænjən/). In "canyon", the "a" vowel before M or N raises and fronts toward [eə] — the tongue pulls up and forward, breaking the vowel into a tense glide as it anticipates the nasal. This is called the Cat-Vowel Before M/N, a hallmark of natural-sounding American speech. It comes out as KAN·yuhn. Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "The canyon walls are steep and rocky" or "The canyon was formed by the erosion of the river over millions of years" — more examples below.
Record yourself saying "canyon" and play it back. The mic stays on your device — nothing's uploaded.
2 syllables, 6 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Raise the back of your tongue to touch the soft palate (velum). Stop the air, then release.

The tongue relaxes down in the back and the corners of the lips relax before the consonant. This adds a schwa-like 'uh' relaxation after the /æ/. Think of it as 'relaxing out of the vowel' — it is no longer a pure /æ/ sound.

Touch the tip or front edge of your tongue to the roof of your mouth behind your teeth. Air flows through your nose.

Lift the middle of your tongue toward the roof of your mouth, but stop just short of touching. /j/ is an approximant, not a stop. The tongue tip stays down, lightly resting near the back of your bottom front teeth. Voice runs through the whole gesture, and the tongue glides smoothly down into the next vowel. The lips stay neutral or pre-shape for the upcoming vowel (rounding early for OO in <em>youth</em>, for example).

Relax your lips, jaw, and tongue completely. Drop your jaw slightly and keep the tongue neutral.
The schwa before N disappears — N becomes the vowel of the syllable. Go straight from the previous consonant to N.

Click any sentence to see the full breakdown — every link, every reduction, every flap-T.
The textbook way isn't wrong — it's just not how anyone actually says it.
In "canyon", the "a" vowel before M or N raises and fronts toward [eə] — the tongue pulls up and forward, breaking the vowel into a tense glide as it anticipates the nasal. The "/æ/" vowel raises and fronts before M or N — tongue pulls up and forward, producing a tense [eə] glide (between /e/ and /ə/). Not a pure /æ/.
In "canyon", the short unstressed vowel before "n" disappears — the schwa is absorbed and the "n" becomes the syllable nucleus on its own. Schwa is absorbed — consonant becomes the syllable nucleus.
Stress falls on the first syllable, not the others. Stretch KAN — keep everything else short and quick.
Don't pronounce the first syllable too fully. The unstressed syllable reduces to a schwa — the lazy "uh" sound — in casual speech.