Skip to content

[SPARK-56838][SDP] Introduce AutoCDC parameters dataclass#55836

Open
AnishMahto wants to merge 3 commits into
apache:masterfrom
AnishMahto:SPARK-56838-introduce-ChangeArgs
Open

[SPARK-56838][SDP] Introduce AutoCDC parameters dataclass#55836
AnishMahto wants to merge 3 commits into
apache:masterfrom
AnishMahto:SPARK-56838-introduce-ChangeArgs

Conversation

@AnishMahto
Copy link
Copy Markdown
Contributor

@AnishMahto AnishMahto commented May 12, 2026

Approved AutoCDC SPIP: https://lists.apache.org/thread/j6sj9wo9odgdpgzlxtvhoy7szs0jplf7


Introduce ChangeArgs as the dataclass that represents AutoCDC API parameters. In future PRs:

  1. ChangeArgs will be constructed, populated, and propagated by SDP SQL/Python flow registration API.
  2. ChangeArgs will be referenced by SCD1/SCD2 algorithm implementations, to respect user specified configurations.
  3. Advanced AutoCDC parameters (as per the SPIP) such as ignoreNull or trackHistoryColumns will be added and supported.

Additionally introduce ColumnSelection helper class, to encode the notion of user selecting a list of columns for inclusion/exclusion directly into a data type, rather than relying on implicit understanding of a raw string list.

@dongjoon-hyun dongjoon-hyun changed the title [SPARK-56838][SDP][AutoCDC] Introduce AutoCDC parameters dataclass [SPARK-56838][SDP] Introduce AutoCDC parameters dataclass May 13, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant