Commit 0ef1b98
committed
Extract cast_mxfp4_to_nvfp4 quant_cfg mutation into helper
Move the inline weight-quantizer block_sizes='static' rewrite out of
quantize_main() into a public force_weight_quantizers_static() helper in
cast_mxfp4_to_nvfp4.py, keeping the cast-specific config logic colocated
with the rest of the cast flow.
Addresses review feedback on PR #1372.
Signed-off-by: Chenjie Luo <chenjiel@nvidia.com>1 parent 024c428 commit 0ef1b98
2 files changed
Lines changed: 19 additions & 12 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
291 | 291 | | |
292 | 292 | | |
293 | 293 | | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
294 | 311 | | |
295 | 312 | | |
296 | 313 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
| 28 | + | |
28 | 29 | | |
29 | 30 | | |
30 | 31 | | |
| |||
1088 | 1089 | | |
1089 | 1090 | | |
1090 | 1091 | | |
1091 | | - | |
1092 | | - | |
1093 | | - | |
1094 | | - | |
1095 | | - | |
1096 | 1092 | | |
1097 | 1093 | | |
1098 | | - | |
1099 | | - | |
1100 | | - | |
1101 | | - | |
1102 | | - | |
1103 | | - | |
1104 | | - | |
| 1094 | + | |
1105 | 1095 | | |
1106 | 1096 | | |
1107 | 1097 | | |
| |||
0 commit comments