Skip to content

Latest commit

 

History

History
63 lines (46 loc) · 2.07 KB

File metadata and controls

63 lines (46 loc) · 2.07 KB

Authors

Primary Author

kantan-kanto (@kantan-kanto) - 2026

Contributions

  • Vision LLM Node (vision_llm_node.py)

    • Local GGUF vision language model integration
    • Qwen2.5-VL and Qwen3-VL support
    • Multi-image input functionality
    • Multiple style presets (default, detailed, concise, creative, raw)
  • Qwen Image Edit Prompt Generator (qwen_nodes.py)

    • Dynamic model selection (local GGUF + cloud API)
    • Qwen3-VL manual mmproj selection
    • Multi-image editing support
  • Wan Video Prompt Generator (wan_nodes.py)

    • Text-to-Video and Image-to-Video prompt optimization
    • Wan2.2-specific prompt templates
    • Local Qwen3-VL integration for video tasks
  • Project Infrastructure

    • Repository structure and organization
    • GPL-3.0 licensing
    • Documentation (README, CHANGELOG, CONTRIBUTING)
    • Unified category system (multimodal/prompt)

Based On / Derived From

This project builds upon and derives from the following GPL-3.0 licensed projects:

ComfyUI-QwenPromptRewriter

ComfyUI-QwenVL

Acknowledgments

This project integrates with and depends on:

  • ComfyUI - Custom node architecture and workflow system
  • llama-cpp-python - GGUF model loading and inference (GPL-3.0, required dependency)
  • Aliyun Dashscope API - Cloud-based LLM services for Qwen models
  • Qwen Models - Vision language models from Alibaba Cloud

License

This project is licensed under the GNU General Public License v3.0.

All derived works must also be distributed under GPL-3.0 or a compatible license.

See LICENSE file for full license text.