comfyui-workflowgenerator-models / README.md

Update README.md

b5eff20 verified 2 months ago

11.2 kB

	---
	tags:
	- gguf
	- comfyui
	- workflow-generation
	- qwen
	- text-generation
	- sentence-transformers
	library_name: gguf
	base_model: Qwen/Qwen2.5-14B
	license: gpl-3.0
	language:
	- en
	pipeline_tag: text-generation
	---

	# ComfyUI-WorkflowGenerator Models

	This repository contains the quantized GGUF models required for [ComfyUI-WorkflowGenerator](https://github.com/danielpflorian/ComfyUI-WorkflowGenerator), a custom node implementation that generates ComfyUI workflows from natural language descriptions.

	## Models Included

	### 1. Workflow Generator Model (Required)

	- File: `workflow-generator-q8_0.gguf`
	- Tokenizer: `workflow-generator/` directory
	- Purpose: Generates workflow diagrams from natural language instructions
	- Base Model: Qwen2.5-14B
	- Training: Fine-tuned from Qwen2.5-14B using LLaMA-Factory (see [original ComfyGPT repository](https://github.com/comfygpt/comfygpt/tree/main))
	- Status: Required - This model is always needed

	### 2. Embedding Model (Required)

	- Directory: `paraphrase-multilingual-MiniLM-L12-v2/`
	- Purpose: Semantic search for node name matching and validation
	- Base Model: [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2)
	- Status: Required - Always needed for semantic search in NodeValidator

	### 3. Node Validator Model (Optional)

	- File: `Qwen2.5-7B-Instruct-q8_0.gguf`
	- Tokenizer: `Qwen2.5-7B-Instruct/` directory
	- Purpose: Refines and corrects node names in workflow diagrams (LLM refinement mode)
	- Base Model: Qwen2.5-7B-Instruct (base model, not fine-tuned)
	- Status: Optional - Only needed if using LLM refinement (`use_llm_refinement=True`)

	## Model Training Information

	### Workflow Generator Model

	The `workflow-generator-q8_0.gguf` model was trained based on the [ComfyGPT research](https://github.com/comfygpt/comfygpt/tree/main) methodology:

	- Original Model Source: [xiatianzs/resources](https://huggingface.co/xiatianzs/resources/tree/main) - Original fine-tuned model from ComfyGPT research team
	- Base Model: [Qwen/Qwen2.5-14B](https://huggingface.co/Qwen/Qwen2.5-14B) from HuggingFace
	- Training Method: Full fine-tuning (Supervised Fine-Tuning / SFT)
	- Training Framework: [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)
	- Training Dataset: `FlowDataset.json` - Contains instruction-input-output pairs where:
	- Instruction: "Based on the description I provided, generate a JSON example of the required ComfyUi workflow."
	- Input: Natural language workflow descriptions
	- Output: JSON diagrams (list of edges representing workflow connections)
	- Training Hyperparameters:
	- Learning rate: 1.0e-5
	- Epochs: 3.0
	- Batch size: 1 per device (gradient accumulation: 4 steps)
	- LR scheduler: Cosine with 0.1 warmup ratio
	- Precision: bf16
	- Cutoff length: 8,192 tokens (training cutoff; model architecture supports up to 131,072 tokens)
	- DeepSpeed: ZeRO-3 optimization
	- Quantization: q8_0 (8-bit quantization for efficient inference)

	For more details on the training process, see the [original ComfyGPT repository](https://github.com/comfygpt/comfygpt/tree/main) and [training configuration](https://github.com/comfygpt/comfygpt/tree/main/train/sft).

	### Embedding Model

	The `paraphrase-multilingual-MiniLM-L12-v2` model is a SentenceTransformer model used for semantic search in the NodeValidator. It encodes node names into embeddings and finds the most similar nodes when correcting invalid node names in workflow diagrams.

	- Original Model: [sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2](https://huggingface.co/sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2)
	- Type: SentenceTransformer (HuggingFace format)
	- Size: ~420 MB
	- Dimensions: 384 (embedding vector size)
	- Use Case: Semantic similarity search for node name matching

	### Node Validator Model

	The `Qwen2.5-7B-Instruct-q8_0.gguf` model is the base [Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) model (not fine-tuned), used for its built-in instruction-following capabilities to select the best node from semantic search candidates.

	## Quick Download

	Download all models at once:

	```bash
	huggingface-cli download DanielPFlorian/comfyui-workflowgenerator-models \
	--local-dir ./ComfyUI/models/LLM/
	```

	Or download specific models:

	```bash
	# Download only the required models (workflow-generator + embedding model)
	huggingface-cli download DanielPFlorian/comfyui-workflowgenerator-models \
	--include "workflow-generator-q8_0.gguf" "workflow-generator/" "paraphrase-multilingual-MiniLM-L12-v2/" \
	--local-dir ./ComfyUI/models/LLM/
	```

	## Installation and Setup

	### Step 1: Download Models

	Download the models to your ComfyUI models directory:

	```bash
	# Navigate to ComfyUI directory
	cd /path/to/ComfyUI

	# Download all models
	huggingface-cli download DanielPFlorian/comfyui-workflowgenerator-models \
	--local-dir ./models/LLM/
	```

	### Step 2: Organize Files in LLM Directory

	After downloading, organize the files in `ComfyUI/models/LLM/` as follows:

	```
	ComfyUI/models/LLM/
	├── workflow-generator-q8_0.gguf # Main model (required)
	├── workflow-generator/ # Main tokenizer (required)
	│ ├── tokenizer.json
	│ ├── tokenizer_config.json
	│ ├── vocab.json
	│ ├── merges.txt
	│ ├── special_tokens_map.json
	│ ├── added_tokens.json
	│ ├── config.json
	│ ├── generation_config.json
	│ └── model.safetensors.index.json
	├── paraphrase-multilingual-MiniLM-L12-v2/ # Embedding model (required)
	│ ├── config.json
	│ ├── model.safetensors
	│ ├── modules.json
	│ ├── sentence_bert_config.json
	│ ├── config_sentence_transformers.json
	│ ├── tokenizer.json
	│ ├── tokenizer_config.json
	│ ├── special_tokens_map.json
	│ ├── sentencepiece.bpe.model
	│ ├── unigram.json
	│ └── 1_Pooling/
	│ └── config.json
	├── Qwen2.5-7B-Instruct-q8_0.gguf # NodeValidator model (optional)
	└── Qwen2.5-7B-Instruct/ # NodeValidator tokenizer (optional)
	├── tokenizer.json
	├── tokenizer_config.json
	├── vocab.json
	├── merges.txt
	├── config.json
	└── generation_config.json
	```

	Important: The tokenizer directory name must match the model name (without `.gguf` extension and quantization suffix). The auto-detection code looks for:
	- `workflow-generator-q8_0.gguf` → `workflow-generator/` tokenizer
	- `Qwen2.5-7B-Instruct-q8_0.gguf` → `Qwen2.5-7B-Instruct/` tokenizer

	## Usage

	### Required Models

	- workflow-generator-q8_0.gguf + workflow-generator/ tokenizer - Always needed
	- paraphrase-multilingual-MiniLM-L12-v2/ - Always needed for semantic search

	### Optional Models

	- Qwen2.5-7B-Instruct-q8_0.gguf + Qwen2.5-7B-Instruct/ tokenizer - Only needed if using LLM refinement (`use_llm_refinement=True`)

	### Model Usage in ComfyUI-WorkflowGenerator

	1. WorkflowGenerator Node: Uses `workflow-generator-q8_0.gguf` to generate workflow diagrams from natural language
	2. NodeValidator Node:
	- Uses `paraphrase-multilingual-MiniLM-L12-v2` for semantic search (always)
	- Uses `Qwen2.5-7B-Instruct-q8_0.gguf` for LLM refinement (optional, when `use_llm_refinement=True`)
	3. WorkflowBuilder Node: No models needed (deterministic code)

	## Model Specifications

	### Workflow Generator Model

	- Format: GGUF (q8_0 quantization)
	- Base: Qwen2.5-14B
	- Size: ~8-9 GB (quantized)
	- Context Window: 131,072 tokens (128K) - Model architecture supports up to 131K tokens, though training used 16,384 token cutoff
	- Quantization: q8_0 (8-bit, good balance of quality and size)

	### Embedding Model

	- Format: SentenceTransformer (HuggingFace format)
	- Base: sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
	- Size: ~420 MB
	- Dimensions: 384 (embedding vector size)

	### Node Validator Model

	- Format: GGUF (q8_0 quantization)
	- Base: Qwen2.5-7B-Instruct
	- Size: ~4-5 GB (quantized)
	- Context Window: 32,768 tokens
	- Quantization: q8_0 (8-bit, good balance of quality and size)

	## System Requirements

	- VRAM:
	- Minimum: 8 GB (for workflow-generator with CPU offloading)
	- Recommended: 16+ GB (for both models on GPU)
	- RAM: 16+ GB recommended
	- Storage: ~15-16 GB for all models, tokenizers, and embedding model

	## Performance Tips

	1. Use GGUF models: Smaller size and better VRAM efficiency than HuggingFace models
	2. GPU Layers: Use "auto" setting for optimal GPU layer allocation
	3. LLM Refinement: Only enable if you need higher accuracy (slower but more accurate)
	4. Semantic Search Only: Faster execution, deterministic results (recommended for most use cases)

	## Troubleshooting

	### Model Not Found

	- Verify models are in `ComfyUI/models/LLM/` directory
	- Check tokenizer directory name matches model name (without `.gguf` and quantization suffix)
	- Restart ComfyUI after moving files

	### Tokenizer Not Found

	- Ensure tokenizer directory exists with the correct name
	- Verify tokenizer files (`tokenizer.json`, `tokenizer_config.json`, `vocab.json`) are present
	- Check directory structure matches the expected format

	### Out of Memory

	- Reduce `n_gpu_layers` (try "auto" or lower number)
	- Use smaller quantization (q4_0 instead of q8_0) - note: you'll need to re-quantize
	- Set `device_preference` to "cpu" for some operations

	## Related Resources

	- ComfyUI-WorkflowGenerator: [GitHub Repository](https://github.com/danielpflorian/ComfyUI-WorkflowGenerator)
	- Original ComfyGPT Research: [GitHub Repository](https://github.com/comfygpt/comfygpt)
	- Research Paper: [arXiv:2503.17671](https://arxiv.org/abs/2503.17671)
	- Project Website: [https://comfygpt.github.io/](https://comfygpt.github.io/)

	## Citation

	If you use these models in your research, please cite the original ComfyGPT paper:

	```bibtex
	@article{huang2025comfygpt,
	title={ComfyGPT: A Self-Optimizing Multi-Agent System for Comprehensive ComfyUI Workflow Generation},
	author={Huang, Oucheng and Ma, Yuhang and Zhao, Zeng and Wu, Mingrui and Ji, Jiayi and Zhang, Rongsheng and Hu, Zhipeng and Sun, Xiaoshuai and Ji, Rongrong},
	journal={arXiv preprint arXiv:2503.17671},
	year={2025}
	}
	```

	## License

	These models are provided for use with ComfyUI-WorkflowGenerator. Please refer to:
	- Original ComfyGPT repository for model training details and licensing
	- Qwen2.5 model licenses from HuggingFace
	- ComfyUI-WorkflowGenerator repository for usage terms

	## Support

	For issues, questions, or contributions:
	- Issues: [ComfyUI-WorkflowGenerator Issues](https://github.com/danielpflorian/ComfyUI-WorkflowGenerator/issues)
	- Documentation: [ComfyUI-WorkflowGenerator Wiki](https://github.com/danielpflorian/ComfyUI-WorkflowGenerator/wiki)