Press your lips together. Air flows through your nose. Vocal cords vibrate.

Americans pronounce monitored as MAH-nuh-terd (/ˈmɑnəɾərd/). In "monitored", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. This is called the Flap T, a hallmark of natural-sounding American speech. So instead of MAH·nuh·tert, you get MAH·nuh·terd. Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "He monitored the active volcano for signs of an eruption" or "The patrol car monitored the neighborhood throughout the night" — more examples below.
Record yourself saying "monitored" and play it back. The mic stays on your device — nothing's uploaded.
3 syllables, 7 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Quickly bounce the front of your tongue against the roof of your mouth. Don't stop the airflow — just a quick tap.

Relax your mouth and lift the tongue back and up. Keep the lips neutral.

Touch the tip of your tongue to the roof of your mouth just behind your teeth. Add vocal cord vibration as you release.

Click any sentence to see the full breakdown — every link, every reduction, every flap-T.
The textbook way isn't wrong — it's just not how anyone actually says it.
In "monitored", the "t" between vowels sounds like a quick "d" — the tongue briefly taps the ridge behind the upper teeth. /t/ or /d/ becomes a quick tap [ɾ] — sounds like a soft D. The tongue briefly taps the ridge behind the upper teeth.
In "monitored", the "d" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.
Stress falls on the first syllable, not the others. Stretch MAH — keep everything else short and quick.
Don't pronounce the first syllable too fully. The unstressed syllable reduces to a schwa — the lazy "uh" sound — in casual speech.