Relax your lips and drop your jaw significantly. The tongue tip lightly touches behind the bottom front teeth and the back part of the tongue presses down a little to create more dark space in the back of the mouth.

Americans pronounce automation as ah-tuh-MAY-shuhn (/ˌɑɾəˈmeɪʃən/). In "automation", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. This is called the Flap T, and it's one of the defining features of casual American English. It comes out as AH·tuh·MAY·shuhn. Stress falls on the third syllable — keep everything else short and quick. You'll hear it in sentences like "Automation is expected to replace numerous jobs in manufacturing".
Record yourself saying "automation" and play it back. The mic stays on your device — nothing's uploaded.
4 syllables, 8 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
The schwa before M disappears — M becomes the vowel of the syllable. Go straight from the previous consonant to M.

Start with your jaw slightly open and the front of your tongue forward and slightly up. Glide upward, your jaw closes a little more and your tongue arches higher toward the roof of the mouth.
Flare your lips and lift the mid-front tongue close to the roof of your mouth. Blow air through without voicing.

Relax your lips, jaw, and tongue completely. Drop your jaw slightly and keep the tongue neutral.
The schwa before N disappears — N becomes the vowel of the syllable. Go straight from the previous consonant to N.

Click any sentence to see the full breakdown — every link, every reduction, every flap-T.
The textbook way isn't wrong — it's just not how anyone actually says it.
In "automation", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.
In "automation", the short unstressed vowel before "n" disappears — the schwa is absorbed and the "n" becomes the syllable nucleus on its own. Schwa is absorbed — consonant becomes the syllable nucleus.
Stress falls on the third syllable, not the others. Stretch MAY — keep everything else short and quick.
Don't pronounce the first syllable too fully. The unstressed syllable reduces to a schwa — the lazy "uh" sound — in casual speech.