Multimodal Avatar Generation and Animation

Multimodal Avatar Generation and Animation

Introducing MagicAvatar, a multi-modal framework capable of converting various input modalities — text, video, and audio — into motion signals that subsequently generate/ animate an avatar.

Introduction Video:

Text-guided Avatar Generation.

Create avatar(s) with simple text prompts. Text prompt, Generated motion, Generated video

Video-guided Avatar Generation

Given a source video, create avatar(s) that follows the given motion.

Multimodal Avatar Animation

Animate an avatar of a specific subject. Identity, Driving signal, Motion Generated video,

Audio-guided Avatar Generation (coming soon)

Create an avatar based on audio input.

Last updated