Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Americans pronounce smartphones as SMART-fohnz (/ˈsmɑrtˌfoʊnz/). Stress falls on the first syllable — keep everything else short and quick. You'll hear it in sentences like "The store sells laptops, tablets, and smartphones".
Record yourself saying "smartphones" and play it back. The mic stays on your device — nothing's uploaded.
2 syllables, 8 sounds. Tap a syllable to jump to its row, then explore each sound's mouth shape and how it's made.
Place your tongue tip near the roof of your mouth behind your top teeth. Push air through the narrow gap. No voicing.

Press your lips together. Air flows through your nose. Vocal cords vibrate.

Open wide for the 'ah' vowel. Lift the tongue back and up while flaring the lips for the 'r'.
Touch the tip or front edge of your tongue to the roof of your mouth just behind your teeth. Keep your jaw relaxed. Stop the air, then release with a puff.

Lift your bottom lip to touch the very bottom of your top front teeth. Blow air through this contact point without voicing.

Start with your mouth slightly open, then close your jaw slightly as your lips round. Shift your tongue back slightly, then stretch the back up.
Touch the tip or front edge of your tongue to the roof of your mouth behind your teeth. Air flows through your nose.

Same position as S, but add vocal cord vibration. Feel the buzz.

The textbook way isn't wrong — it's just not how anyone actually says it.
In "smartphones", the "t" is not released — the articulators get into position but hold without the burst of air. Air stops but there's no release burst — the articulators hold position.
Stress falls on the first syllable, not the others. Stretch SMART — keep everything else short and quick.
Americans use a relaxed retroflex R — the tongue curls back rather than rolling. The R is one continuous sound with the vowel before it, not two separate sounds.