feat: support DROP INDEX DDL by LuciferYang · Pull Request #371 · lance-format/lance-spark

LuciferYang · 2026-03-31T14:35:42Z

Summary

Add support for dropping indexes on Lance tables through the Spark SQL interface:

ALTER TABLE catalog.db.table DROP INDEX index_name;

This is the counterpart to the existing CREATE INDEX and SHOW INDEXES commands, completing the index lifecycle management in lance-spark.

Design

DROP INDEX is a metadata-only operation — it removes the index entry from the dataset manifest via lance-core's dataset.dropIndex(name) API. Physical index files are not deleted; they are cleaned up by VACUUM during garbage collection.

The implementation follows the standard lance-spark SQL extension pipeline:

ANTLR grammar — ALTER TABLE ... DROP INDEX indexName rule + DROP token
AST builder — visitDropIndex in all version-specific builders (3.4, 3.5, 4.0, 4.1)
Logical plan — LanceDropIndex(table, indexName) with output schema (index_name, status)
Physical exec — LanceDropIndexExec calls dataset.dropIndex(indexName) on the driver
Strategy — LanceDropIndex → LanceDropIndexExec mapping with indexName.toLowerCase for consistency with CREATE INDEX

Note: The logical plan and physical exec are named LanceDropIndex / LanceDropIndexExec (not DropIndex / DropIndexExec) to avoid classpath collisions with Spark's built-in classes of the same name in spark-catalyst and spark-sql, which have different constructor signatures.

Changes

LanceSqlExtensions.g4 — Added dropIndex grammar rule and DROP token
LanceSqlExtensionsAstBuilder.scala (3.4, 3.5, 4.0, 4.1) — Added visitDropIndex visitor
DropIndex.scala — New LanceDropIndex logical plan
DropIndexExec.scala — New LanceDropIndexExec physical execution node
LanceDataSourceV2Strategy.scala — Added LanceDropIndex → LanceDropIndexExec dispatch
BaseAddIndexTest.java — Added testDropIndex and testDropIndexThenRecreate
test_lance_spark.py — Added test_drop_index and test_drop_index_then_recreate integration tests
drop-index.md — New documentation page

Test plan

Unit tests (BaseAddIndexTest.java)

testDropIndex — Creates index, drops it, verifies schema/output, confirms index is gone
testDropIndexThenRecreate — Full lifecycle: create → drop → recreate → verify query works
All 9 AddIndexTest tests pass (7 existing + 2 new) across all modules (3.4, 3.5, 4.0, 4.1)
Full test suite passes, 0 failures

Integration tests (test_lance_spark.py)

test_drop_index — Creates BTree index, drops it via SQL, verifies output schema (index_name, status), confirms SHOW INDEXES no longer lists it
test_drop_index_then_recreate — Full lifecycle: create → drop → recreate, verifies recreated index appears in SHOW INDEXES and queries still return correct results

hamersaw

Looks great thanks! Can you just add an integration test?

Add two integration tests to TestDDLIndex: - test_drop_index: creates index, drops it, verifies output schema and that SHOW INDEXES no longer lists it - test_drop_index_then_recreate: full lifecycle create -> drop -> recreate, verifies the recreated index works with queries

LuciferYang

Thanks for the review! Added two integration tests in docker/tests/test_lance_spark.py under TestDDLIndex:

test_drop_index — creates a BTree index, drops it via ALTER TABLE ... DROP INDEX, verifies the output schema (index_name, status), and confirms the index no longer appears in SHOW INDEXES.
test_drop_index_then_recreate — full lifecycle: create → drop → recreate, then verifies the recreated index exists in SHOW INDEXES and queries still work correctly.

SHOW INDEXES output schema uses column 'name', not 'index_name'. The DROP INDEX output uses 'index_name' — these are different schemas.

LuciferYang · 2026-04-02T03:15:34Z

need to further investigate the Docker testing

Spark's spark-catalyst JAR contains its own org.apache.spark.sql.catalyst.plans.logical.DropIndex with a 3-parameter constructor (LogicalPlan, String, boolean). At runtime Spark's class takes precedence on the classpath, causing NoSuchMethodError when the AST builder tries to call lance-spark's 2-parameter constructor. Rename to LanceDropIndex / LanceDropIndexOutputType to avoid the collision, consistent with how other lance-spark classes (AddIndex, ShowIndexes, etc.) already use names that don't conflict with Spark built-ins.

…llision Spark's spark-sql JAR also contains DropIndexExec in the same package (org.apache.spark.sql.execution.datasources.v2) with a different constructor signature. Rename to LanceDropIndexExec for the same reason as the LanceDropIndex rename.

LuciferYang · 2026-04-02T06:53:58Z

all test passed

hamersaw

Great add! Thanks.

LuciferYang · 2026-04-02T16:35:59Z

thanks @hamersaw

LuciferYang · 2026-04-02T16:42:02Z

By the way, I would like to mention that I will be on a 4-day holiday. During this period, I may not respond promptly to code fix suggestions.

hamersaw · 2026-04-02T17:34:22Z

By the way, I would like to mention that I will be on a 4-day holiday. During this period, I may not respond promptly to code fix suggestions.

Enjoy the break!

LuciferYang added 2 commits March 31, 2026 22:30

init

35ead90

add doc

fd49173

github-actions bot added the enhancement New feature or request label Mar 31, 2026

hamersaw reviewed Apr 1, 2026

View reviewed changes

LuciferYang commented Apr 2, 2026

View reviewed changes

fix: use correct column name 'name' for SHOW INDEXES results

00d0f09

SHOW INDEXES output schema uses column 'name', not 'index_name'. The DROP INDEX output uses 'index_name' — these are different schemas.

LuciferYang added 2 commits April 2, 2026 14:07

hamersaw approved these changes Apr 2, 2026

View reviewed changes

hamersaw merged commit ca54fa9 into lance-format:main Apr 2, 2026
18 checks passed

hamersaw mentioned this pull request Apr 6, 2026

Drop index DDL #244

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support DROP INDEX DDL#371

feat: support DROP INDEX DDL#371
hamersaw merged 6 commits intolance-format:mainfrom
LuciferYang:feat/drop-index-v2

LuciferYang commented Mar 31, 2026 •

edited

Loading

Uh oh!

hamersaw left a comment

Uh oh!

LuciferYang left a comment

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

hamersaw left a comment

Uh oh!

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

hamersaw commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

LuciferYang commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Design

Changes

Test plan

Unit tests (BaseAddIndexTest.java)

Integration tests (test_lance_spark.py)

Uh oh!

hamersaw left a comment

Choose a reason for hiding this comment

Uh oh!

LuciferYang left a comment

Choose a reason for hiding this comment

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

hamersaw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

LuciferYang commented Apr 2, 2026

Uh oh!

hamersaw commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

LuciferYang commented Mar 31, 2026 •

edited

Loading