Bump transformers to 5.0.0 by AlexanderDokuchaev · Pull Request #4060 · openvinotoolkit/nncf

AlexanderDokuchaev · 2026-05-01T05:51:30Z

Changes

Bump transformers to 5.0.0

apply_chat_template returns BatchFeature instance instead of dict
Install eval_llm with [hf] to install all requaread packages
Removed StaticCacheConfig
Updated WWB fork
Removed dummy_llama tests for SparsityActivation alogrimth as it falls

Reason for changes

Related tickets

Tests

Test examples - success
Weight compression - success
PTQ-873

Copilot

Pull request overview

This PR updates NNCF’s test and example environments to be compatible with transformers==5.0.0, adjusting call sites for API changes (notably apply_chat_template return type) and updating generation static-cache configuration accordingly.

Changes:

Bump transformers to 5.0.0 across test/example requirements and update related dependencies (e.g., sentence-transformers, tensorflow-io, whowhatbench, lm_eval[hf]).
Update example code to handle tokenizer.apply_chat_template(...) returning a BatchFeature (extracting input_ids explicitly).
Remove StaticCacheConfig usage and switch generation cache_config to a dict-based configuration; remove the dummy_llama sparsify-activations unit test helper/testcase.

Reviewed changes

Copilot reviewed 25 out of 25 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/torch/requirements.txt	Pins `transformers==5.0.0` and updates `sentence-transformers` for torch test environment.
tests/torch/function_hook/sparsify_activations/test_algo.py	Removes `dummy_llama` sparsify-activations algorithm test case.
tests/torch/function_hook/sparsify_activations/helpers.py	Removes `dummy_llama_model` helper and related transformers import.
tests/post_training/requirements.txt	Pins `transformers==5.0.0`, bumps `tensorflow-io`, updates WWB fork commit.
tests/post_training/pipelines/fx_modelling.py	Drops `StaticCacheConfig` and updates static cache setup for Transformers 5.
tests/openvino/requirements.txt	Pins `transformers==5.0.0` for OpenVINO tests.
examples/llm_compression/torch/downstream_qat_with_nls/requirements.txt	Pins `transformers==5.0.0` and switches to `lm_eval[hf]`.
examples/llm_compression/torch/distillation_qat_with_lora/requirements.txt	Pins `transformers==5.0.0` and switches to `lm_eval[hf]`.
examples/llm_compression/torch_fx/tiny_llama/requirements.txt	Pins `transformers==5.0.0` for the torch.fx tiny-llama example.
examples/llm_compression/torch_fx/tiny_llama/modelling.py	Updates static cache configuration to dict and ensures `use_cache` is enabled.
examples/llm_compression/torch_fx/tiny_llama/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.
examples/llm_compression/openvino/tiny_llama/requirements.txt	Pins `transformers==5.0.0` for OpenVINO tiny-llama example.
examples/llm_compression/openvino/tiny_llama/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.
examples/llm_compression/openvino/tiny_llama_synthetic_data/requirements.txt	Pins `transformers==5.0.0` for synthetic-data example.
examples/llm_compression/openvino/tiny_llama_find_hyperparams/requirements.txt	Pins `transformers==5.0.0` and updates WWB fork commit.
examples/llm_compression/openvino/smollm2_360m_fp8/requirements.txt	Pins `transformers==5.0.0` for FP8 example.
examples/llm_compression/openvino/smollm2_360m_fp8/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.
examples/llm_compression/openvino/smollm2_360m_codebook/requirements.txt	Pins `transformers==5.0.0` for codebook example.
examples/llm_compression/openvino/smollm2_360m_codebook/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.
examples/llm_compression/openvino/smollm2_360m_adaptive_codebook/requirements.txt	Pins `transformers==5.0.0` for adaptive codebook example.
examples/llm_compression/openvino/smollm2_360m_adaptive_codebook/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.
examples/llm_compression/onnx/tiny_llama/requirements.txt	Pins `transformers==5.0.0` for ONNX tiny-llama example.
examples/llm_compression/onnx/tiny_llama/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.
examples/llm_compression/onnx/tiny_llama_scale_estimation/requirements.txt	Pins `transformers==5.0.0` for scale-estimation example.
examples/llm_compression/onnx/tiny_llama_scale_estimation/main.py	Adjusts chat template tokenization to handle `BatchFeature` output.

github-actions Bot added NNCF PT Pull requests that updates NNCF PyTorch NNCF OpenVINO Pull requests that updates NNCF OpenVINO NNCF PTQ Pull requests that updates NNCF PTQ labels May 1, 2026

transformers==5.0.0rc3

66c7934

AlexanderDokuchaev force-pushed the ad/bump_transformers branch from b4dd5f6 to 66c7934 Compare June 16, 2026 14:12

AlexanderDokuchaev added 4 commits June 16, 2026 17:15

sentence-transformers==5.6.0

5d3d5d8

f

36acba9

transformers==5.0.0

fbfada8

f

bc5e2dc

AlexanderDokuchaev changed the title ~~Bump transformers to 5.0.0rc3~~ Bump transformers to 5.0.0 Jun 17, 2026

AlexanderDokuchaev added 2 commits June 17, 2026 11:35

fx

fd363b0

f

3febab3

AlexanderDokuchaev marked this pull request as ready for review June 17, 2026 11:23

AlexanderDokuchaev requested a review from a team as a code owner June 17, 2026 11:23

Copilot AI review requested due to automatic review settings June 17, 2026 11:23

Copilot started reviewing on behalf of AlexanderDokuchaev June 17, 2026 11:23 View session

AlexanderDokuchaev requested review from andreyanufr and anzr299 June 17, 2026 11:26

Copilot AI reviewed Jun 17, 2026

View reviewed changes

Comment thread tests/torch/function_hook/sparsify_activations/test_algo.py

AlexanderDokuchaev added 2 commits June 17, 2026 18:08

Merge branch 'develop' into ad/bump_transformers

dc696c0

gpt

5e6662c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers to 5.0.0#4060

Bump transformers to 5.0.0#4060
AlexanderDokuchaev wants to merge 9 commits into
openvinotoolkit:developfrom
AlexanderDokuchaev:ad/bump_transformers

AlexanderDokuchaev commented May 1, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AlexanderDokuchaev commented May 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reason for changes

Related tickets

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AlexanderDokuchaev commented May 1, 2026 •

edited

Loading