Abstract: Personalized voice cloning increasingly requires not only high speaker fidelity but also fine-grained control over rhythm, pitch, intensity, and expressive prosody. However, many existing ...