Commit e6e4efd
authored
[0.5/3] Diffusion ckpt export for NVFP4 & FP8 (#783)
See #781
This is the MR that only includes the refactoring of the llm export,
please ignore the change on quantize.py from the diffusion example.
<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit
## Release Notes
* **New Features**
* Added `--hf-ckpt-dir` CLI option to save checkpoints in HuggingFace
format
* Enabled support for exporting Diffusers-based pipelines
* Unified export system now handles both transformer and diffusion model
architectures
<sub>✏️ Tip: You can customize this high-level summary in your review
settings.</sub>
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
---------
Signed-off-by: Jingyu Xin <jingyux@nvidia.com>1 parent 849a350 commit e6e4efd
4 files changed
Lines changed: 323 additions & 128 deletions
File tree
- examples
- diffusers/quantization
- llm_ptq
- llm_qat
- modelopt/torch/export
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
66 | 66 | | |
67 | 67 | | |
68 | 68 | | |
| 69 | + | |
69 | 70 | | |
70 | 71 | | |
71 | 72 | | |
| |||
348 | 349 | | |
349 | 350 | | |
350 | 351 | | |
| 352 | + | |
351 | 353 | | |
352 | 354 | | |
353 | 355 | | |
| |||
363 | 365 | | |
364 | 366 | | |
365 | 367 | | |
| 368 | + | |
| 369 | + | |
| 370 | + | |
366 | 371 | | |
367 | 372 | | |
368 | 373 | | |
| |||
862 | 867 | | |
863 | 868 | | |
864 | 869 | | |
| 870 | + | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
| 876 | + | |
| 877 | + | |
| 878 | + | |
| 879 | + | |
| 880 | + | |
| 881 | + | |
| 882 | + | |
| 883 | + | |
865 | 884 | | |
866 | 885 | | |
867 | 886 | | |
| |||
994 | 1013 | | |
995 | 1014 | | |
996 | 1015 | | |
| 1016 | + | |
| 1017 | + | |
| 1018 | + | |
| 1019 | + | |
| 1020 | + | |
997 | 1021 | | |
998 | 1022 | | |
999 | 1023 | | |
| |||
1070 | 1094 | | |
1071 | 1095 | | |
1072 | 1096 | | |
| 1097 | + | |
1073 | 1098 | | |
1074 | 1099 | | |
1075 | 1100 | | |
| |||
1125 | 1150 | | |
1126 | 1151 | | |
1127 | 1152 | | |
| 1153 | + | |
| 1154 | + | |
| 1155 | + | |
1128 | 1156 | | |
1129 | 1157 | | |
1130 | 1158 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
| 39 | + | |
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
| |||
243 | 243 | | |
244 | 244 | | |
245 | 245 | | |
246 | | - | |
| 246 | + | |
247 | 247 | | |
248 | 248 | | |
249 | 249 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | | - | |
| 26 | + | |
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| |||
81 | 81 | | |
82 | 82 | | |
83 | 83 | | |
84 | | - | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
85 | 87 | | |
86 | 88 | | |
87 | 89 | | |
| |||
0 commit comments