Microsoft VASA-1( Unreleased)

Lifelike Audio-Driven Talking Faces Generated in Real Time

VASA-1:  single portrait photo + speech audio = hyper-realistic talking face video with precise lip-audio synclifelike facial behavior, and naturalistic head movements, generated in real time.

Realism and liveliness

Our method is capable of not only producing precious lip-audio synchronization, but also generating a large spectrum of expressive facial nuances and natural head motions. It can handle arbitary-length audio and stably output seamless talking face videos.



How useful was this post?

Click on a star to rate it!

Average rating 4.6 / 5. Vote count: 366

No votes so far! Be the first to rate this post.

Thank You

For your valuable feedback and reviews

There are no reviews yet. Be the first one to write one.

Other AI Tools

Thank you for getting in touch!

We appreciate you contacting us. One of our colleagues will get back in touch with you soon!Have a great day!

Select Filters to Apply

  • Pricing

  • Features