Spark, Hive: Fix snapshot procedure for tables with Variant columns by nssalian · Pull Request #15964 · apache/iceberg

nssalian · 2026-04-13T16:44:11Z

Summary

The snapshot procedure fails on Spark tables with Variant columns because the Hive catalog stores LazySimpleSerDe instead of ParquetHiveSerDe for these tables. The SerDe-based format detection doesn't recognize it and throws.
After fixing that, a second failure occurs when running the test provided in the issue, HiveSchemaUtil.convertToTypeString which has no case for VARIANT.

Changes

This adds a resolveFileFormat helper that falls back to table.provider() when the SerDe doesn't match a known format, and maps VARIANT to "unknown" in the Hive schema conversion, following the conversation here: #15228

Made changes in Spark v4.0 and v4.1

Test plan

Expanded the test in the issue to include both partitioned and unpartitioned tables in both Spark 4.0 and 4.1 - skipped for hive and spark catalog until hive 4 lands
Added a unit test for the HiveSchemaUtil conversion check

…or now

nssalian · 2026-04-13T17:34:37Z

@RussellSpitzer @pvary @aihuaxu @steveloughran @huaxingao PTAL

steveloughran · 2026-04-14T12:42:18Z

@nssalian will do

nssalian added 2 commits April 13, 2026 08:45

Spark, Hive: Fix snapshot procedure for tables with Variant columns

46f335c

Change Hive variant type to unknown and skip hive and spark_catalog f…

bdf1f5e

…or now

github-actions bot added spark hive labels Apr 13, 2026

nssalian marked this pull request as ready for review April 13, 2026 17:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark, Hive: Fix snapshot procedure for tables with Variant columns#15964

Spark, Hive: Fix snapshot procedure for tables with Variant columns#15964
nssalian wants to merge 2 commits intoapache:mainfrom
nssalian:snapshot-procedure-variant-fix

nssalian commented Apr 13, 2026 •

edited

Loading

Uh oh!

nssalian commented Apr 13, 2026 •

edited

Loading

Uh oh!

steveloughran commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nssalian commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Uh oh!

nssalian commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

steveloughran commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nssalian commented Apr 13, 2026 •

edited

Loading

nssalian commented Apr 13, 2026 •

edited

Loading