Tap play — the contour draws itself as the audio plays, mora-by-mora. Pattern: stroke-dashoffset driven by audio.currentTime. MVP timing: even-division per audio duration. Phase-2 will use forced alignment (narabas / torchaudio MMS_FA) for real per-mora boundaries.