Adding A3mega Llama3.1-70b recipe with trtllm by Priya-Quad · Pull Request #221 · AI-Hypercomputer/gpu-recipes

Priya-Quad · 2026-05-07T05:52:59Z

No description provided.

…very in serving-launcher

…helm templates

depksingh · 2026-05-07T08:00:48Z

+# Copyright 2025 Google LLC
+#
+# Licensed under the Apache License, Version 2.0 (the "License");


Please remove the template if we already have the trt-inference folder

depksingh · 2026-05-07T08:01:12Z

+apiVersion: v1
+kind: ConfigMap
+metadata:
+  name: "{{ .Release.Name }}-config"


Please update it with the same format that vllm configs have and all the other files

depksingh · 2026-05-07T08:01:51Z

+  configFile: serving-args.yaml
+  configPath: /workload/configs
+  envs:
+    - name: LAUNCHER_SCRIPT


Move these to the launcher or some other file, which have the env variables

depksingh · 2026-05-07T08:03:29Z

    pp_size=${SERVING_CONFIG_DICT["pp_size"]:=1}
    ep_size=${SERVING_CONFIG_DICT["ep_size"]:=1}
-    backend=${SERVING_CONFIG_DICT["backend"]:="tensorrt"}
+    backend="tensorrt"


Please remove the hardcoding, the file is already right, as its being used by multiple models. Please debug the issue

depksingh · 2026-05-07T08:03:50Z

    # If custom_dataset is not set, generate a textual dataset with tokens sampled in normal distribution
    if [ -z "$dataset_file" ]; then
-        dataset_file="/ssd/token-norm-dist_${model_name##*/}_${isl}_${osl}_tp${tp_size}.json"
+        dataset_file="/scratch/token-norm-dist_${model_name##*/}_${isl}_${osl}_tp${tp_size}.json"


make changes in your added files instead of changing this

…Llama 3.1 70B sharding

…late

Priya-Quad added 4 commits May 5, 2026 12:23

Adding A3mega Recipe

7af040c

feat(a3mega): implement manual hardware binding and deep driver disco…

8d8f5e7

…very in serving-launcher

fix(a3mega): force tensorrt backend, fix launcher syntax, and update …

c80e357

…helm templates

Merge branch 'AI-Hypercomputer:main' into main

13befc9

depksingh reviewed May 7, 2026

View reviewed changes

Priya-Quad added 2 commits May 8, 2026 09:17

fix(a3mega): apply NCCL stability flags and bash-level overrides for …

e8eaafb

…Llama 3.1 70B sharding

fix(a3mega): update serving launcher native env vars and service temp…

1e67d1a

…late

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding A3mega Llama3.1-70b recipe with trtllm#221

Adding A3mega Llama3.1-70b recipe with trtllm#221
Priya-Quad wants to merge 6 commits intoAI-Hypercomputer:mainfrom
Priya-Quad:main

Priya-Quad commented May 7, 2026

Uh oh!

depksingh May 7, 2026

Uh oh!

depksingh May 7, 2026

Uh oh!

depksingh May 7, 2026

Uh oh!

depksingh May 7, 2026

Uh oh!

depksingh May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Priya-Quad commented May 7, 2026

Uh oh!

depksingh May 7, 2026

Choose a reason for hiding this comment

Uh oh!

depksingh May 7, 2026

Choose a reason for hiding this comment

Uh oh!

depksingh May 7, 2026

Choose a reason for hiding this comment

Uh oh!

depksingh May 7, 2026

Choose a reason for hiding this comment

Uh oh!

depksingh May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants