Press your lips together to stop the air, then release. No vocal cord vibration.

Americans pronounce perspectives as per-SPEHK-tuhvz (/pərˈspɛktəvz/). Stress falls on the second syllable — keep everything else short and quick. You'll hear it in sentences like "I value the diverse perspectives that each team member brings" or "Undoubtedly, there are multiple valid perspectives on this issue" — more examples below.
Record yourself saying "perspectives" and play it back. The mic stays on your device — nothing's uploaded.
3 syllables, 10 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Press your lips together to stop the air, then release. No vocal cord vibration.

Drop your jaw moderately. Touch the tongue tip behind the bottom front teeth and lift the mid-front part slightly toward the roof.

Raise the back of your tongue to touch the soft palate (velum). Stop the air, then release.

Touch the tip or front edge of your tongue to the roof of your mouth just behind your teeth. Keep your jaw relaxed. Stop the air, then release with a puff.

Relax your lips, jaw, and tongue completely. Drop your jaw slightly and keep the tongue neutral.
Lift your bottom lip so its inner edge (where the wet part meets the dry part) touches the very bottom of your top front teeth. Add vocal cord vibration as you blow air through.

Same position as S, but add vocal cord vibration. Feel the buzz.

Click any sentence to see the full breakdown — every link, every reduction, every flap-T.
The textbook way isn't wrong — it's just not how anyone actually says it.
In "perspectives", the "t" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.
Stress falls on the second syllable, not the others. Stretch SPEHK — keep everything else short and quick.
Don't pronounce the second syllable too fully. The unstressed syllable reduces to a schwa — the lazy "uh" sound — in casual speech.
Americans use a relaxed retroflex R — the tongue curls back rather than rolling. The R is one continuous sound with the vowel before it, not two separate sounds.