-
-
ComfyUI_Prompt-All-In-One Public
Prompt Generator for Video, Audio, Image, and Text. A node for ComfyUI. Including Deepseek, Alibaba Cloud Qwen, Google Gemini, and locally selected models, etc.
-
ComfyUI_OneButtonPrompt Public
A node in comfyui for one-click assisted prompt generation (for image and video generation, etc.).
-
ComfyUI_AudioTools Public
A ComfyUI node containing multiple audio processing tools.
-
ComfyUI_PortraitTools Public
Portrait Tools: Facial detection cropping, alignment, ID photo, etc
-
ComfyUI_parakeet-tdt Public
parakeet-tdt-0.6b-v2: Automatic speech recognition (ASR) model designed for high-quality English transcription, featuring support for punctuation, capitalization, and accurate timestamp prediction.
-
ComfyUI_OuteTTS Public
OuteTTS: Multilingual Text-To-Speech, Voice Cloning. A ComfyUI node.
-
ComfyUI_Seed-VC Public
Seed-VC voice or sing conversion.
-
ComfyUI_MegaTTS3 Public
Lightweight and Efficient, 🎧Ultra High-Quality Voice Cloning, Chinese and English.
-
-
ComfyUI_NotaGen Public
Symbolic Music Generation, NotaGen node for ComfyUI.
-
-
ComfyUI_KokoroTTS_MW Public
A Text To Speech node using Kokoro TTS in ComfyUI. Supports 8 languages and 150 voices
-
ComfyUI_CSM Public
ComfyUI node of Conversational Speech Model (CSM).
-
ComfyUI_IndexTTS Public
IndexTTS Voice Cloning: Supports two-person dialogue
-
ComfyUI_gemmax Public
ComfyUI Translation Nodes: XiaoMi GemmaX, QuickMT etc.
-
ComfyUI_DiffRhythm Public
Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation. A node for ComfyUI.
-
-
ComfyUI_Dia Public
Dia TTS model capable of generating ultra-realistic dialogue in one pass. ComfyUI node.
-
ComfyUI_ASR-zh Public
Automated Speech Recognition for Chinese.
-
ComfyUI_ACE-Step Public
ACE-Step: A Step Towards Music Generation Foundation Model
-
ComfyUI_StepAudioTTS Public
A Text To Speech node using Step-Audio-TTS in ComfyUI. Can speak, rap, sing, or clone voice.
-
ComfyUI_SparkTTS Public
Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
-
ComfyUI_EraX-WoW-Turbo Public
Super fast multilingual speech recognition model based on Whisper Large-v3 Turbo. A node for ComfyUI.