Skip to content

[Bug]: Filesystem partition saving run with overwrite removes all files not just the related ones #2239

@stloyd

Description

@stloyd

What happened?

Right now, when having two data frames working on the same partitioning, and using the native any filesystem on the run of the second data frame, it will remove files from the previous one.

How to reproduce?

df()
    ->extract(/* */)
    ->partitionBy('order-year', 'order-month')
    ->mode(overwrite())
    ->load(to_csv(__DIR__.'/output/orders.csv'))
    ->run()
;

df()
    ->extract(/* */)
    ->partitionBy('order-year', 'order-month')
    ->mode(overwrite())
    ->load(to_csv(__DIR__.'/output/sales.csv'))
    ->run()
;

Playground snippet

No response

Data required to reproduce bug locally

If the bug requires specific data, provide it here:

  • PHP array
  • CSV
  • JSON
  • Other format if bug is format-specific

Version

0.30+

Relevant error output

Copy and paste any relevant error output (no backticks needed).

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions