Skip to content

vllm addition to torchtalk#10

Open
arkadip-maitra wants to merge 5 commits into
mainfrom
feat/torchtalk_vllm_addition
Open

vllm addition to torchtalk#10
arkadip-maitra wants to merge 5 commits into
mainfrom
feat/torchtalk_vllm_addition

Conversation

@arkadip-maitra

Copy link
Copy Markdown
Collaborator

Fixes #5

Extend TorchTalk with first-class vllm support through a framework adapter architecture, while preserving PyTorch as the default path. This adds a real VllmAdapter, static vLLM indexing across API/engine/registry/IR/native-binding layers, condition-aware graph/proof traces, and framework-aware CLI/MCP behavior.

Tested thoroughly across both repos: TorchTalk lint passed, the full TorchTalk suite passed with PYTORCH_SOURCE=/data/pytorch VLLM_SOURCE=/data/vllm (487 tests), and the new vLLM-focused TorchTalk integration/quality suites all passed (45 tests) covering adapter bootstrap, CLI paths, MCP-facing get_status / search / trace / graph, unsupported-tool behavior, entity-count thresholds, proof traces, and conditional graph evidence. I also source-built vLLM in a fresh conda env, verified imports/native bindings, ran focused vLLM native tests (1508 passed), and confirmed real runtime behavior on H200 with offline generation, pooling/embeddings, isolated OpenAI-compatible serve, and a 2-GPU tensor-parallel serve smoke.

@arkadip-maitra arkadip-maitra changed the title [wip] vllm addition to torchtalk vllm addition to torchtalk Jun 4, 2026
@arkadip-maitra arkadip-maitra requested a review from adabeyta June 4, 2026 07:55
Github Executorch and others added 3 commits June 22, 2026 10:26
Restore the hybrid-discovery vLLM graph fixes so TorchTalk stays resilient to current vLLM layout drift, and add a clean LLM + TorchTalk(vLLM) vs LLM-only benchmark package that shows the query quality, outputs, and summary results.
Apply Ruff formatting to the vLLM index builder so the src/torchtalk format check passes in CI.
llm models are gpt 5.4 and claude opus 4.6
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[RFC] Extend TorchTalk with vLLM support

1 participant