Press your lips together. Air flows through your nose. Vocal cords vibrate.

Americans pronounce midterm as MIHD-turm (/ˈmɪdˌtɜrm/). Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "I studied for hours to prepare for the midterm exam tomorrow".
Record yourself saying "midterm" and play it back. The mic stays on your device — nothing's uploaded.
2 syllables, 6 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Press your lips together. Air flows through your nose. Vocal cords vibrate.

Drop your jaw slightly with relaxed lips. Touch the tongue tip behind the bottom front teeth and arch the top-front toward the roof.

Touch the tip of your tongue to the roof of your mouth just behind your teeth. Add vocal cord vibration as you release.

Touch the tip or front edge of your tongue to the roof of your mouth just behind your teeth. Keep your jaw relaxed. Stop the air, then release with a puff.

Flare your lips and push them away from the face. Lift the middle of your tongue toward the roof of the mouth.

Press your lips together. Air flows through your nose. Vocal cords vibrate.

The textbook way isn't wrong — it's just not how anyone actually says it.
In "midterm", the "t" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.
Stress falls on the first syllable, not the others. Stretch MIHD — keep everything else short and quick.
Americans use a relaxed retroflex R — the tongue curls back rather than rolling. The R is one continuous sound with the vowel before it, not two separate sounds.