Skip to content

Commit b5f4b56

Browse files
committed
fix docstring
Signed-off-by: Jennifer Chen <jennifchen@nvidia.com>
1 parent ef37456 commit b5f4b56

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

modelopt_recipes/models/Nemotron-3-Super-120B-A12B/super-nvfp4-fp8-sweep-stride4.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@
1313
# See the License for the specific language governing permissions and
1414
# limitations under the License.
1515

16-
# Mirrors the published nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 hf_quant_config.json:
16+
# Mirrors the published nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 hf_quant_config.json BUT adds a stride=4 for FP8 scale sweep, which is useful for large models to improve PTQ efficiency.
1717
# - MoE routed experts: NVFP4 W4A4 weight MSE, group_size 16
1818
# HF names: mixer.experts.<N>.{up,down}_proj
1919
# Megatron-Core names: mlp.experts.local_experts.<N>.linear_fc{1,2}

0 commit comments

Comments
 (0)