Image-to-Video
Diffusers
Safetensors
English
Chinese
video generation
conversational video generation
talking human video generation
Instructions to use MeiGen-AI/MeiGen-MultiTalk with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use MeiGen-AI/MeiGen-MultiTalk with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline from diffusers.utils import load_image, export_to_video # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("MeiGen-AI/MeiGen-MultiTalk", dtype=torch.bfloat16, device_map="cuda") pipe.to("cuda") prompt = "A man with short gray hair plays a red electric guitar." image = load_image( "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/guitar-man.png" ) output = pipe(image=image, prompt=prompt).frames[0] export_to_video(output, "output.mp4") - Notebooks
- Google Colab
- Kaggle
What kind of hardware does this require to run locally?
#1
by wangleineo - opened
What kind of hardware does this require to run locally?
Our code has been optimized to require less than 24GB of VRAM when using torch.bfloat16 for computation. The optimized version will be released within the next two days.
What kind of hardware does this require to run locally?
The code for multi-GPU inference is updated. Also the low-memory inference is supported.