feat!(excelreader): only load dtypes for columns specified via `use_columns` #329

lukapeschke · 2025-02-17T10:29:46Z

This changes our logic regarding the information we extract on sheet load. Rather than loading dtype information for all available columns, we only load it for selected columns. This introduces two breaking changes:

For ExcelTable and ExcelSheet, available_columns is no longer a property. It is now a method, and information regarding available columns is computed on-demand (the information is cached after having been computed once).
If use_columns is a callable, it no longer receives a ColumnInfo object, but a ColumnInfoBuilder instead (no dtype information).
ColumnInfoBuilder is now part of the Python API.

@PrettyWood If you have a better naming suggestion for ColumnInfoBuilder, I'd love to hear it. Maybe ColumnNameInfo or ColumnInfoNoDtype ?

closes #327

…than a property Signed-off-by: Luka Peschke <[email protected]>

This allows to return FastExcelResult<T> in pymethods Signed-off-by: Luka Peschke <[email protected]>

…olumns` Signed-off-by: Luka Peschke <[email protected]>

PrettyWood

Looks great! I prefer ColumnInfoNoDtype

Signed-off-by: Luka Peschke <[email protected]>

lukapeschke added 3 commits February 17, 2025 11:22

feat!(excelsheet,exceltable): make available_columns a method rather …

51d5c74

…than a property Signed-off-by: Luka Peschke <[email protected]>

refactor: impl From<FastExcelError> for PyErr

6f529f7

This allows to return FastExcelResult<T> in pymethods Signed-off-by: Luka Peschke <[email protected]>

feat!(excelreader): only load dtypes for columns specified via `use_c…

a274276

…olumns` Signed-off-by: Luka Peschke <[email protected]>

lukapeschke added bug Something isn't working enhancement New feature or request ✋ need review ✋ 🦀 rust 🦀 Pull requests that edit Rust code labels Feb 17, 2025

lukapeschke requested a review from PrettyWood February 17, 2025 10:29

lukapeschke self-assigned this Feb 17, 2025

PrettyWood reviewed Feb 18, 2025

View reviewed changes

lukapeschke added 2 commits February 18, 2025 15:49

Merge branch 'main' into make-selected-columns-a-method

2dd6766

refactor: rename ColumnInfoBuilder to ColumnInfoNoDtype

176d8d6

Signed-off-by: Luka Peschke <[email protected]>

lukapeschke requested a review from PrettyWood February 19, 2025 14:29

PrettyWood approved these changes Feb 19, 2025

View reviewed changes

PrettyWood merged commit bf4a229 into main Feb 19, 2025
23 checks passed

PrettyWood deleted the make-selected-columns-a-method branch February 19, 2025 14:54

PrettyWood removed the ✋ need review ✋ label Feb 19, 2025

lukapeschke mentioned this pull request Feb 19, 2025

When using use_columns it still reads other columns #327

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat!(excelreader): only load dtypes for columns specified via `use_columns` #329

feat!(excelreader): only load dtypes for columns specified via `use_columns` #329

lukapeschke commented Feb 17, 2025

PrettyWood left a comment

feat!(excelreader): only load dtypes for columns specified via use_columns #329

feat!(excelreader): only load dtypes for columns specified via use_columns #329

Conversation

lukapeschke commented Feb 17, 2025

PrettyWood left a comment

Choose a reason for hiding this comment

feat!(excelreader): only load dtypes for columns specified via `use_columns` #329

feat!(excelreader): only load dtypes for columns specified via `use_columns` #329