You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat(writer): auto-set referenced_data_file on PositionDeleteFileWriter (#169)
* feat(writer): auto-set referenced_data_file on PositionDeleteFileWriter close
When a position delete writer closes with exactly one output file and all
written entries reference a single distinct data file path, set
referenced_data_file on the resulting DataFile. This allows readers to
scope delete file application without scanning the file_path column,
matching Iceberg Java behavior and eliminating unnecessary I/O.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* fix(writer): set referenced_data_file on all rolled position delete files
The builders.len() == 1 guard prevented referenced_data_file from being
set when the rolling writer produced multiple output files due to size
rollover. Since rolling is purely size-driven, distinct_paths.len() == 1
already guarantees that every rolled output file contains entries for
the same single data file. Removing the builders.len() check ensures the
field is set on all output files, so readers can skip them for
non-matching data files even in high-volume single-file delete scenarios.
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
* refactor(writer): clean up test comment in position_delete_file_writer
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
---------
Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>
0 commit comments