Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Americans pronounce step as STEHP (/stɛp/). You'll hear it in sentences like "I hope this apology can be the first step toward reconciliation".
Record yourself saying "step" and play it back. The mic stays on your device — nothing's uploaded.
1 syllable, 4 sounds. Explore each sound's mouth shape and how it's made.
Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Touch the tip or front edge of your tongue to the roof of your mouth just behind your teeth. Keep your jaw relaxed. Stop the air, then release with a puff.

Drop your jaw moderately. Touch the tongue tip behind the bottom front teeth and lift the mid-front part slightly toward the roof.

Press your lips together to stop the air, then release. No vocal cord vibration.

The textbook way isn't wrong — it's just not how anyone actually says it.
In "step", the "p" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.