ComfyUI-Qwen-Omni

by SXQBW0 starssuccess

ComfyUI-Qwen-Omni is the first ComfyUI plugin that supports end-to-end multimodal interaction, enabling seamless joint generation and editing of text, images, and audio. Without intermediate steps, with just one operation, the model can simultaneously understand and process multiple input modalities, generating coherent text descriptions and voice outputs, providing an unprecedentedly smooth experience for AI creation.

View on GitHub

Nodes (0)

No node definitions found for this pack.

ComfyUI-Qwen-Omni | TealPug Node Explorer