You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Improved stability when switching between Qwen3-VL GGUF models
Fixed mmproj reuse issues in local vision models
Refined internal GGUF model lifecycle management
Documentation updates:
- Clarified project scope as a prompt generator for QwenImageEdit and Wan2.2
- Reorganized Credits and Dependencies for clearer attribution
- Updated llama-cpp-python installation notes to reference the JamePeng fork documentation
Internal structure preparation for future backend refactoring
No breaking changes to node interfaces
Copy file name to clipboardExpand all lines: README.md
+45-20Lines changed: 45 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,10 @@
1
1
# ComfyUI-MultiModal-Prompt-Nodes
2
2
3
-
**Version:** 1.0.5
3
+
**Version:** 1.0.6
4
4
**License:** GPL-3.0
5
5
6
-
Advanced multimodal prompt generation nodes for ComfyUI with local GGUF models (Qwen-VL) and cloud API support.
6
+
Multimodal prompt generator nodes for ComfyUI, designed to generate prompts for **QwenImageEdit** and **Wan2.2**.
7
+
Supports **local LLM / local GGUF models** (Qwen3-VL, Qwen-VL) and **Qwen API** for image and video prompt generation and enhancement.
7
8
8
9
---
9
10
@@ -17,6 +18,13 @@ Based on extensive testing, **Wan2.2** and **Qwen-Image-Edit** respond **signifi
17
18
### Vision Input Compatibility
18
19
Vision input support varies by model and llama-cpp-python version. See Installation section for detailed compatibility information. Results may vary based on your specific environment.
19
20
21
+
### Local GGUF Model Stability
22
+
Starting from **v1.0.6**, internal GGUF model handling has been improved to ensure stable behavior
23
+
when switching between different Qwen3-VL models (e.g. 8B ↔ 4B), with mmproj files now being
24
+
properly reloaded as part of the model switching process.
25
+
26
+
These changes are internal and do **not** affect node interfaces or workflows.
***Note:** Vision input support may vary depending on your environment and configuration. In my setup, I have not been able to get vision input working with Qwen2.5-VL even with the JamePeng fork.
87
95
88
-
**Recommended Installation (JamePeng fork for Qwen3-VL support):**
**Recommended Installation (JamePeng fork for Qwen3-VL support):**
97
+
Please follow the build and installation instructions provided in the JamePeng fork repository, as this fork requires a custom build and cannot be reliably installed via a simple `pip install`.
⚠️ **Disclaimer:** Your results may differ depending on system configuration, GPU drivers, and other factors. If you encounter issues, please verify your environment setup and consider reporting compatibility details.
100
106
107
+
**Note:** When using Qwen3-VL GGUF models, switching between different model sizes
108
+
(e.g. 8B ↔ 4B) is supported and stable as of v1.0.6.
109
+
101
110
### 4. Place Models
102
111
103
112
Place your GGUF models in `ComfyUI/models/LLM/`:
@@ -406,19 +415,36 @@ For full details, see the [LICENSE](LICENSE) file and [AUTHORS.md](AUTHORS.md).
406
415
407
416
---
408
417
418
+
## Internal Structure Notes (for Advanced Users)
419
+
420
+
This repository may introduce internal structural changes over time
421
+
(e.g. extracting Local GGUF or Cloud API implementations into separate modules)
422
+
to improve maintainability and stability.
423
+
424
+
- Node interfaces (INPUT / RETURN types) are intended to remain stable
425
+
- Internal refactors will be documented in the changelog
426
+
- The `backends/` directory added in v1.0.6 is a **non-functional placeholder**
427
+
for future internal refactoring
428
+
429
+
No user action is required.
430
+
431
+
---
432
+
409
433
## Credits
410
434
411
-
### Original Authors
435
+
### Derived From / Inspirations
436
+
This project is a restructured and extended ComfyUI custom node collection, derived from the following GPL-3.0 licensed projects:
0 commit comments