Audio Encoder

Whisper Large V3 FP16 in ComfyUI

1 workflows use this model

What is Whisper Large V3 FP16?

Whisper Large V3 FP16 is an audio encoder that converts audio into embeddings for audio-conditioned generation. You can run it locally in ComfyUI with full control over every parameter, or access it through Comfy Cloud. ComfyUI's node-based workflow editor lets you connect Whisper Large V3 FP16 with ControlNets, LoRAs, upscalers, and custom nodes to build any pipeline you need. There are 1 community workflow templates using Whisper Large V3 FP16 on Comfy Hub, ready to load and customize.