Press your lips together to stop the air, then release. No vocal cord vibration.

Americans pronounce perhaps as per-HAPS (/pərˈhæps/). Stress falls on the second syllable — keep everything else short and quick. You'll hear it in sentences like "Perhaps the hero can help the hungry horse".
Record yourself saying "perhaps" and play it back. The mic stays on your device — nothing's uploaded.
2 syllables, 6 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Push a stream of air from your throat through your open mouth. No tongue or lip contact.

Drop the jaw noticeably. Keep the body of the tongue low and forward, and don't let the back of the tongue raise toward the soft palate. Pull the lip corners back slightly, almost a starting smile.

Press your lips together to stop the air, then release. No vocal cord vibration.

Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

The textbook way isn't wrong — it's just not how anyone actually says it.
Stress falls on the second syllable, not the others. Stretch HAPS — keep everything else short and quick.
Americans use a relaxed retroflex R — the tongue curls back rather than rolling. The R is one continuous sound with the vowel before it, not two separate sounds.