Sa2VA Video Segmentation

XJSa2VAVideoSegmentation

Sa2VA (Segment Anything 2 with Vision Assistant) for video/batch processing. This node generates consistent segmentation masks across multiple frames or images using text prompts, with optional morphological refinement for cleaner results. Ideal for video processing or batch image segmentation.

Pack: ComfyUI-Sa2VA-XJ

custom_nodes.ComfyUI-Sa2VA-XJ

Inputs (11)

NameTypeRequired
model_nameCOMBOrequired
imagesIMAGErequired
segmentation_promptSTRINGrequired
thresholdFLOATrequired
use_8bitBOOLEANrequired
use_flash_attnBOOLEANrequired
unloadBOOLEANrequired
morphCOMBOrequired
erode_kernelINTrequired
dilate_kernelINTrequired
iterationsINTrequired

Outputs (2)

NameType
text_outputSTRING
masksMASK