Skip to content

Antalya 26.3: Fix condition for using parquet metadata cache#1751

Open
zvonand wants to merge 3 commits intoantalya-26.3from
feature/antalya-26.3/pr-1631
Open

Antalya 26.3: Fix condition for using parquet metadata cache#1751
zvonand wants to merge 3 commits intoantalya-26.3from
feature/antalya-26.3/pr-1631

Conversation

@zvonand
Copy link
Copy Markdown
Collaborator

@zvonand zvonand commented May 6, 2026

Changelog category (leave one):

  • Bug Fix (user-visible misbehavior in an official stable release)

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Fix apache iceberg queries not hitting the parquet metadata cache (#1631 by @arthurpassos).

CI/CD Options

Exclude tests:

  • Fast test
  • Integration Tests
  • Stateless tests
  • Stateful tests
  • Performance tests
  • All with ASAN
  • All with TSAN
  • All with MSAN
  • All with UBSAN
  • All with Coverage
  • All with Aarch64
  • All Regression
  • Disable CI Cache

Regression jobs to run:

  • Fast suites (mostly <1h)
  • Aggregate Functions (2h)
  • Alter (1.5h)
  • Benchmark (30m)
  • ClickHouse Keeper (1h)
  • Iceberg (2h)
  • LDAP (1h)
  • Parquet (1.5h)
  • RBAC (1.5h)
  • SSL Server (1h)
  • S3 (2h)
  • S3 Export (2h)
  • Swarms (30m)
  • Tiered Storage (2h)

Cherry-picked from #1631.


Apache Iceberg queries were not htiting the parquet metadata cache because object_info->getFileFormat() resolves to IcebergDataObjectInfo::getFileFormat, which gets its return value from IcebergObjectSerializableInfo. This field is filled with the value from Apache Iceberg manifest file, and it is upper case by default, which then fails clickhouse check for parquet metadata cache usage.

Documentation entry for user-facing changes

...

@zvonand zvonand added releasy Created/managed by RelEasy ai-resolved Port conflict auto-resolved by Claude labels May 6, 2026
@github-actions
Copy link
Copy Markdown

github-actions Bot commented May 6, 2026

Workflow [PR], commit [d408ba5]

zvonand and others added 2 commits May 7, 2026 13:49
…next commit)

---
Original cherry-pick message follows:

Merge pull request #1631 from Altinity/arthurpassos-patch-11

Fix condition for using parquet metadata cache
# Conflicts:
#	src/Storages/ObjectStorage/StorageObjectStorageSource.cpp
#	tests/integration/test_storage_iceberg_with_spark/test_read_constant_columns_optimization.py
@zvonand zvonand force-pushed the feature/antalya-26.3/pr-1631 branch from 98cefd0 to 304b298 Compare May 7, 2026 11:52
arthurpassos
arthurpassos previously approved these changes May 7, 2026
Copy link
Copy Markdown
Collaborator

@arthurpassos arthurpassos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ai-resolved Port conflict auto-resolved by Claude releasy Created/managed by RelEasy

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants