fix: bypass gloo DDP for Windows single-GPU training by fanfan-love-meatmeat · Pull Request #2744 · RVC-Boss/GPT-SoVITS

fanfan-love-meatmeat · 2026-03-05T03:00:59Z

Problem

On Windows with a single GPU, dist.init_process_group() using the gloo backend fails with:
RuntimeError: unsupported gloo device

This is caused by virtual network adapters (VPN, VMware, Hyper-V, etc.) interfering with gloo's network interface detection.

Solution

Skip DDP initialization entirely for Windows single-GPU setups, rather than patching gloo environment variables.

Changes

s2_train.py: Skip dist.init_process_group() on Windows single-GPU; add DummyDDP wrapper to maintain .module interface compatibility
s1_train.py: Set USE_LIBUV=0 to avoid socket conflicts; use strategy='auto' for single-GPU (bypasses gloo entirely)
utils.py / bucket_sampler.py: Related compatibility fixes

Tested on

Windows 11, single NVIDIA GPU 5060RTX, Python 3.10, PyTorch 2.5, CUDA 12.4

On Windows with a single GPU, dist.init_process_group() using the gloo backend frequently fails with 'unsupported gloo device', caused by virtual network adapters (VPN, VMware, Hyper-V, etc.). Changes: - s2_train.py: skip dist.init_process_group() on Windows single-GPU; add DummyDDP wrapper to maintain .module interface compatibility - s1_train.py: set USE_LIBUV=0 to avoid socket conflicts; use strategy='auto' for single-GPU (bypasses gloo entirely), DDPStrategy only activated for multi-GPU setups - utils.py, bucket_sampler.py: related compatibility adjustments Tested on: Windows 11, single NVIDIA GPU, Python 3.10, PyTorch 2.5

…train_v3_lora Extend the fix to v3 and LoRA training scripts: - s2_train_v3.py: skip dist.init_process_group() + DummyDDP for Windows single-GPU - s2_train_v3_lora.py: same fix applied to LoRA fine-tuning script

fanfan-love-meatmeat added 2 commits March 5, 2026 10:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: bypass gloo DDP for Windows single-GPU training#2744

fix: bypass gloo DDP for Windows single-GPU training#2744
fanfan-love-meatmeat wants to merge 2 commits intoRVC-Boss:mainfrom
fanfan-love-meatmeat:fix/windows-singlegpu-gloo

fanfan-love-meatmeat commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fanfan-love-meatmeat commented Mar 5, 2026

Problem

Solution

Changes

Tested on

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant