Prompt for the source video: A horse is jumping
Prompt for each reference image: A photo of a horse
Prompt for the synthesized video: A zebra is jumping