ComfyUI-Qwen3-ASR

by kaushiknishchay0 starssuccess

ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports high-accuracy ASR and language identification for 52 languages/dialects, including 22 Chinese dialects and various English accents. Features word-level timestamps, long audio transcription, and VRAM-optimized inference.

View on GitHub

Nodes (2)

Qwen3 ASR TranscriberQwen3ASRTranscriberQwen3-ASR
Qwen3 Forced Aligner ConfigQwen3ForcedAlignerConfigQwen3-ASR