Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Americans pronounce stadium as STAY-dee-uhm (/ˈsteɪdiəm/). In "stadium", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. This is called the Flap T, a hallmark of natural-sounding American speech. So instead of STAY·tee·uhm, you get STAY·dee·uhm. Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "The stadium was packed with cheering fans wearing team colors".
Record yourself saying "stadium" and play it back. The mic stays on your device — nothing's uploaded.
3 syllables, 7 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Touch the tip or front edge of your tongue to the roof of your mouth just behind your teeth. Keep your jaw relaxed. Stop the air, then release with a puff.

Start with your jaw slightly open and the front of your tongue forward and slightly up. Glide upward, your jaw closes a little more and your tongue arches higher toward the roof of the mouth.
The textbook way isn't wrong — it's just not how anyone actually says it.
In "stadium", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.
In "stadium", the short unstressed vowel before "m" disappears — the schwa is absorbed and the "m" becomes the syllable nucleus on its own. Schwa is absorbed — consonant becomes the syllable nucleus.
Stress falls on the first syllable, not the others. Stretch STAY — keep everything else short and quick.
Don't pronounce the second syllable too fully. The unstressed syllable reduces to a schwa — the lazy "uh" sound — in casual speech.